Deep Bayesian Reinforcement Learning in Changing Environments

Lead Research Organisation: University College London
Department Name: Statistical Science

Abstract

Deep Reinforcement Learning (DRL) worked well in a wide range of games with a fixed environment, such as Go and Starcraft. But most real-world environments change over time and are influenced by random factors such as weather. So, the nonstationarity of the environment in DRL requires more attention. This PhD research sheds light on Reinforcement Learning in changing environments with Bayesian approach since Bayesian DRL can deal with environments with uncertainty. All we need is to treat the environmental changes as uncertainty. In general, I plan to pursue four research directions to help fast learning in changing environments with Bayesian RL. I first investigate utilising prior experience to identify current environment status in a changing environment setting and then learn strategies. Then I suggest using the meta-learning tricks to facilitate adaptation to new environments in Bayesian RL. Finally, I propose less conservative Robust RL and more efficient Safe RL in changing environments with a Bayesian approach. These Bayesian RL directions can contribute to an efficient, safe, and robust deep Bayesian RL in changing environments.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/T517793/1 01/10/2020 30/09/2025
2724208 Studentship EP/T517793/1 26/09/2022 25/09/2026 Xiaohang Tang