Sequential Decision making in probabilistic models

Lead Research Organisation: University of Cambridge

Department Name: Computer Science and Technology

Abstract

This proposal considers the problem of robust sequential decision making in non-linear environments. Reinforcement
learning has demonstrated high potential for solving complex problems in non-linear environments but has lacked
efficiency and robustness. We argue that in order to deploy reinforcement learning agents in the real world, it is essential to
develop similar efficiency and robustness properties that have been developed in control theory. We propose to leverage
the extensive control and probabilistic reasoning literature to improve RL algorithms and present two interesting research
directions. The first one considers using Sequential Monte-Carlo methods to improve planning for non-linear
environments. The second direction focuses on designing robust controllers by exploring the connections between
adversarial learning, robust control theory, and uncertainty modelling.

Student:

Pierre Thodoroff

Period of Study:

Sep 20 - Sep 23

Funder:

EPSRC

Project Status:

Closed

Project Category:

Studentship

Project Reference:

2744311

Research Topic:

Unclassified

Organisations

University of Cambridge (Lead Research Organisation)

People	ORCID iD
Neil Lawrence (Primary Supervisor)	http://orcid.org/0000-0001-9258-1030
Pierre Thodoroff (Student)

Publications

Author Name Title Publication

Date Published

10 25 50

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/T517847/1			30/09/2020	29/09/2025
2744311	Studentship	EP/T517847/1	30/09/2020	29/09/2023	Pierre Thodoroff

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects