Simple Environment
The Simple Environment is the basic functional EnergyPlus environment included in ICCP
. Most examples work with this environment.
This environment is a wrapper around the EnergyPlus simulation that can be found at EnergyPlus_simulations/simple_simulation
.
It is described a simple environment because there’s only a single HVAC setpoint that’s being acted on.
Description
The case study building is a 1-story building prototype called “Controlled Environments for Living Lab Studies” (CELLS) actually located at the Smart Living Lab site in Fribourg. The building prototype has 2 rooms that are almost identical in size.
Observation space
The observation space is of dimension 6, it has:
- “Tair” - Air Temperature
- “Rh” - Relative Humidity
- “Tmrt” - Mean Radiant Temperature
- “Tout” - External Ambient Temperature
- “Qheat” - Heating Demand from the HVAC system
- “Occ” - Occupancy
Action space
The action space is of dimension 1, it has:
- “Tset”, sets the temperature of a HVAC unit located in one of the rooms.
Thermal comfort or Predicted mean vote (PMV)
One may define the thermal comfort of a room using Berkeley’s PMV method which defines the thermal comfort using parameters such as mean radiant temperature or relative humidity.
The PMV usually ranges from -2 to 2, where 0 is neutral, -2 very cold, and 2 is very hot.
In terms of thermal comfort and human health, a room in the PMV range of -0.5 to 0 is considered the best.
Reward function
The reward of a given observation / state is a weighted combination of the thermal comfort (defined as the PMV) and the heating demand.
Alpha is the parameter for thermal comfort and Beta is the parameter for the heating demand.
Then, we may define the reward as:
Reward = Beta * (1 - (heating/(800’000))) + Alpha * (1 - abs((pmv + 0.5))) * occupancy
Thus, a high Beta heavily penalizes heating and a high Alpha heavily prioritizes thermal comfort.