Hierarchical nonparametric Bayesian Reinforcement Learning
Lead Research Organisation:
University College London
Department Name: Computer Science
Abstract
During his PhD, Samuel will work on building scalable sample-efficient autonomous machines. The approach would be to use a hierarchical approach, and thus break down tasks into sub-tasks to achieve, and as a result enable us to scale to larger problems and transfer knowledge between tasks through the knowledge of sub-tasks. He would also bring scale by using spatial abstractions in sub-tasks, and thus reduce the state-space of individual sub-tasks based on their relevant variables. For the above, Samuel would take a model-based Bayesian approach using Bayesian non-parametrics to model the environment and then Value-Function/policy-search methods to find optimal sub and parent policies. One of the main challenges he would like to contribute to solving is making Gaussian processes more scalable, and designing efficient hierarchical frameworks for learning.
People |
ORCID iD |
Marc Deisenroth (Primary Supervisor) | |
Samuel Cohen (Student) |
Studentship Projects
Project Reference | Relationship | Related To | Start | End | Student Name |
---|---|---|---|---|---|
EP/S021566/1 | 31/03/2019 | 29/09/2027 | |||
2267930 | Studentship | EP/S021566/1 | 22/09/2019 | 21/09/2023 | Samuel Cohen |