Hierarchical nonparametric Bayesian Reinforcement Learning

Lead Research Organisation: University College London
Department Name: Computer Science

Abstract

During his PhD, Samuel will work on building scalable sample-efficient autonomous machines. The approach would be to use a hierarchical approach, and thus break down tasks into sub-tasks to achieve, and as a result enable us to scale to larger problems and transfer knowledge between tasks through the knowledge of sub-tasks. He would also bring scale by using spatial abstractions in sub-tasks, and thus reduce the state-space of individual sub-tasks based on their relevant variables. For the above, Samuel would take a model-based Bayesian approach using Bayesian non-parametrics to model the environment and then Value-Function/policy-search methods to find optimal sub and parent policies. One of the main challenges he would like to contribute to solving is making Gaussian processes more scalable, and designing efficient hierarchical frameworks for learning.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/S021566/1 01/04/2019 30/09/2027
2267930 Studentship EP/S021566/1 23/09/2019 22/09/2023 Samuel Cohen