An Explanation-based Reinforcement Learning Approach
Lead Research Organisation:
University of Manchester
Department Name: Computer Science
Abstract
This project proposes a novel reinforcement learning (RL) algorithm which is guided by a knowledge-based user feedback. The project integrates user feedback using structured knowledge bases into the traditional RL models. The main objective of the project is to reduce the barriers for a continuous, knowledge-driven dialogue between end-users and RL models, improving few-shot learning capabilities (the ability to generalize from fewer examples).
The core research questions are:
- Can a Knowledge Base driven reinforcement learning (RL) approach supported by end-user feedback deliver few-shot learning capabilities?
- Which Knowledge Representation models and formalisms can support generalization and user feedback process?
- How semantic parsing methods can be used to support the end-user interaction with the knowledge base?
The research can impact the ability to deliver Artificial Intelligence (AI) systems which are more transparent, and which generalize from fewer examples. These two properties are at the center of the requirements for the application of AI within scenarios which require trust (e.g. health and legal domains).
This research is positioned within the EPSRC `data to knowledge' priority area.
The core research questions are:
- Can a Knowledge Base driven reinforcement learning (RL) approach supported by end-user feedback deliver few-shot learning capabilities?
- Which Knowledge Representation models and formalisms can support generalization and user feedback process?
- How semantic parsing methods can be used to support the end-user interaction with the knowledge base?
The research can impact the ability to deliver Artificial Intelligence (AI) systems which are more transparent, and which generalize from fewer examples. These two properties are at the center of the requirements for the application of AI within scenarios which require trust (e.g. health and legal domains).
This research is positioned within the EPSRC `data to knowledge' priority area.
Organisations
People |
ORCID iD |
| Philip Osborne (Student) |
Studentship Projects
| Project Reference | Relationship | Related To | Start | End | Student Name |
|---|---|---|---|---|---|
| EP/N509565/1 | 30/09/2016 | 29/09/2021 | |||
| 2169184 | Studentship | EP/N509565/1 | 01/01/2019 | 31/12/2022 | Philip Osborne |
| EP/R513131/1 | 30/09/2018 | 29/09/2023 | |||
| 2169184 | Studentship | EP/R513131/1 | 01/01/2019 | 31/12/2022 | Philip Osborne |
| NE/W503186/1 | 31/03/2021 | 30/03/2022 | |||
| 2169184 | Studentship | NE/W503186/1 | 01/01/2019 | 31/12/2022 | Philip Osborne |