Improving Sample Efficiency of Reinforcement Learning

Lead Research Organisation: University of Edinburgh

Department Name: Sch of Informatics

Abstract

Deep reinforcement learning has had huge empirical success and is a major enabling technology for many applications of AI. However, recent RL algorithms still require millions of samples to obtain good performance. Since obtaining environment interactions is often costly and since challenging environments are rarely static, this inhibits many practical applications. This project will investigate ways of reducing this cost, aiming to find more sample-efficient RL algorithms. We aim for the algorithms to be deployable in realistic settings, where agents use deep networks to represent knowledge about the environment. It is also likely to lead to improved performance of other systems making automated decisions.

Research Strategy

The project will investigate two main avenues for improving sample efficiency. Firstly, using a Bayesian framework to gain additional information from samples, we hope to achieve improved exploration which will in turn lead to more informative samples. Secondly, using meta-learning, we hope to enable generalisation, reducing the amount of samples required to learn a task which is similar to other learned tasks.

Objectives and Applications

We aim to develop more sample efficient reinforcement learning algorithms and to gain new insights about exploration. The project will be carried out in collaboration with Microsoft Research Cambridge and will have immediate relevance for their computer games research, particularly for training game AI in complex worlds where samples are expensive. The project will have wider applications for any problem which involves decision making with limited data, including real world applications such as robotics and pricing strategies.

Student:

Adam Jelley

Period of Study:

Apr 21 - Jul 25

Funder:

EPSRC

Project Status:

Active

Project Category:

Studentship

Project Reference:

2579743

Research Topic:

Unclassified

Organisations

People	ORCID iD
Amos Storkey (Primary Supervisor)
Adam Jelley (Student)	http://orcid.org/0000-0002-0052-482X

Publications

Author Name

Title Publication Date Published

10 25 50

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/V519674/1			30/09/2020	29/09/2025
2579743	Studentship	EP/V519674/1	30/04/2021	30/07/2025	Adam Jelley

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects