General forward model

Lead Research Organisation: Queen Mary University of London

Department Name: Sch of Electronic Eng & Computer Science

Abstract

Reasearch Context

Artificial intelligence (AI) algorithms such as deep learning has been used successfully in many fields such as Computer Vision, Natural Language Processing and Decision making systems.
AI has been particularly successful in learning how to play games such as the ancient chinese game of Go, Atari games such as Breakout and Pong, and more recently complex games such as Starcraft and DOTA2.

Many successful algorithms require an ability to effectively see-forward in time, and plan next moves based on the outcome of these internal experiments.
This usually requires access to a perfect underlying model of the sytem, or a very accurate hand-crafted replica of the underlying system. There have been several attempts recently to actually build models to predict
the possible future outcomes and plan accordingly without access to the underlying model.

Aims and Objectives

The aim of this research is to create an efficient and genric way for simulated environments to be accurately predicted. There are several questions that will be answered by this research:
Can predictive models of the simulated environment improve the results of training?
Are accurate models of the environment necessary to train AI algorithms to perform complex tasks? Or can models that effectively "filter" unecessary information be used?
In games with several players which amy or may not have predictable actions, can predictive models be used to create better adversaries?
Can predictive models of simulated environments be used to understand risk?

Potential Applications and benefits

Using games as a test-bed for research has advantages in safety, cost and control of time. For example, when training self-driving cars to avoid collisions. In a simulated environment,
the cars can make mistakes without endangering the public; they can exist entirely in software and no physical components need to be maintained or replaced; simulations can be 'sped up'
meaning that several hours of training time can be compressed into minutes.

If there are ways to predict outcomes in real-world environments, then the applications of this research are widespread. Being able to accurately predict the outcomes of actions several seconds
into the future would provide a large advantage for safetly-critical applications. Additionally in the context of environments with hard to predict parameters,
AI algorithms that can plan ahead for best and worse case scenarios would be able to understand underlying risk in it's actions.

Student:

Christopher Bamford

Period of Study:

Sep 18 - Jul 22

Funder:

EPSRC

Project Status:

Closed

Project Category:

Studentship

Project Reference:

2109450

Research Topic:

Unclassified

Organisations

Queen Mary University of London (Lead Research Organisation)

People	ORCID iD
Christopher Bamford (Student)

Publications

Author Name

Title Publication Date Published

10 25 50

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/R513106/1			30/09/2018	29/09/2023
2109450	Studentship	EP/R513106/1	30/09/2018	30/07/2022	Christopher Bamford

Impact Summary


Description	Several researchers are using my project: https://griddly.readthedocs.io/en/latest/ in their work.
First Year Of Impact	2020
Sector	Creative Economy,Digital/Communication/Information Technologies (including Software),Education

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects