Deep learning methods applied to modeling and policy optimization in large buildings

Naug, Avisek

Deep learning methods applied to modeling and policy optimization in large buildings

dc.creator	Naug, Avisek
dc.date.accessioned	2022-05-19T17:19:11Z
dc.date.available	2022-05-19T17:19:11Z
dc.date.created	2022-05
dc.date.issued	2022-05-16
dc.date.submitted	May 2022
dc.identifier.uri	http://hdl.handle.net/1803/17367
dc.description.abstract	Developing an optimal policy for building energy management is a difficult problem because the system exhibits non-stationary behaviors and the target policy needs to evolve with changes in the state transition and reward functions. Non-stationary real world problems often present a set of challenges: the non-stationarities are difficult to detect; and systems with low sampling rates can create sample-inefficiency problems for learning algorithms. In addition, the system may have to satisfy safety-critical constraints, and, therefore, the policy must be learned offline. In this thesis, we develop deep reinforcement learning methods for designing supervisory controllers for building energy management. This process requires a model of the system for planning or training purposes. It may be computationally infeasible to derive accurate physics-based models for complex systems that generalize across non-stationarities due to resource constraints or the lack of detailed domain knowledge. On the other hand, data-driven models, while relatively easier to develop, need sufficient data to cover a variety of building operations and environmental conditions to ensure that the building model is not under constrained or overfitting. Our approach solves the problem of deploying a condition-based lifelong reinforcement learning agent for building energy management. We assume that building systems can be modeled as a Lipschitz Continuous NS-MDP with bounded changes in the system dynamics and reward, allowing policies to adapt incrementally to the changing behavior via a relearning procedure within a given duration of time determined by the time-constants associated with the system. The approach involves two loops: An outer loop detects the non-stationarity by tracking the deployment phase reward and triggers an inner loop call the relearning phase. The relearning process starts by updating the data-driven models with recent system data to adapt to the non-stationarity. We employ elastic weighted regularization to prevent overfitting with limited data. Then, the agent policy trains by interacting with the updated data-driven models simulating the system behavior. To generate a large amount of diverse data, we allow the system to simulate over a long future horizon by using forecasts of the exogenous variables. To account for the variance issues due to inaccurate returns for the data-driven models, we train the agent across multiple environments in parallel, thereby bootstrapping experiences to reduce the effects of uncertainty and to collect decorrelated transitions. We demonstrate our proposed approach on a building simulation testbed and benchmark it against other state-of-the-art approaches in building supervisory control like G36, PPO, DDPG and MPC. We also deploy the approach in a real building on our university campus. Our approach is able to perform significantly better than existing supervisory control strategies and highlights the need for a condition-based offline relearning framework in dynamic systems.
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	Deep Learning, Deep Reinforcement Learning, Building Energy Optimization
dc.title	Deep learning methods applied to modeling and policy optimization in large buildings
dc.type	Thesis
dc.date.updated	2022-05-19T17:19:11Z
dc.type.material	text
thesis.degree.name	PhD
thesis.degree.level	Doctoral
thesis.degree.discipline	Computer Science
thesis.degree.grantor	Vanderbilt University Graduate School
dc.creator.orcid	0000-0003-3253-7286
dc.contributor.committeeChair	Biswas, Gautam

Files in this item

Name:: NAUG-DISSERTATION-2022.pdf
Size:: 10.34Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Electronic Theses and Dissertations
Electronic theses and dissertations of masters and doctoral students submitted to the Graduate School.

Show simple item record