Toward Data-Efficient Reinforcement Learning

Lead Research Organisation: University College London
Department Name: Electronic and Electrical Engineering

Abstract

We are surrounded by large-scale interconnected systems, from the Internet to the power grid, from the commuter traffic flow to social networks. While essential, the management of such networked systems is exceedingly hard, mainly because of their intrinsic and constantly growing complexity. To overcome this challenge, we are in dire need of data-efficient online decision strategies under uncertainty for high-dimensional and dynamic networks. This project focuses on developing fundamental learning methodologies targeting data-efficiency in large-scale problems, specifically, data-efficient reinforcement learning methodologies able to infer and exploit the structure of the network. . This project will develop fundamental methodologies on data-efficient AI and autonomous systems, aligning with the S&T Challenge of AI and the "AI technologies", and "decision making strategy" EPSRC research areas - some of the largest/growing portfolios. This research directly links also to UKRI grand challenge on AI, the EPSRC areas as productivity and connectedness, developing "cost-effective ways of delivering existing services" in a fully connected world [EPSRC Delivery Plan, 2016-2019], in the envisioned application of controlling IoT networks. It links indirectly also the following UCL Grand Challenges: the UCL Grand Challenge of Sustainable Cities (AI for autonomous driving cars, grid energy, energy-harvesting IoT networks); the UCL Grand Challenge of Transformative Technology (AI for managing Internet of Food, understanding social behaviours/impact); the UCL Grand Challenge of Cultural Understanding (graph learning to influence of religions, politics opinion across the world, AI for personalized education).

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/N509577/1 01/10/2016 24/03/2022
1916295 Studentship EP/N509577/1 25/09/2017 03/06/2022 Sephora Madjiheurem
EP/R513143/1 01/10/2018 30/09/2023
1916295 Studentship EP/R513143/1 25/09/2017 03/06/2022 Sephora Madjiheurem
 
Description A main challenge of autonomous learning in large scale networks is achieving data efficiency for fast learning.
This is the problem that is being studied with this award, and some impactful progress have been made. In particular, we conducted a project that allowed to challenge some commonly accepted assumptions around learning systems with the community. This resulted in the development of a new learning method with promising experimental results. It is an ongoing work, but we have already published papers on the subject.
Exploitation Route The the code of the methods developed with this funding are made available online and the details are thoroughly explained in published papers. In fact, some users have already started accessing the online material for their own research. The methods are general enough that they can be applied to many domains where autonomous learning is required.
Sectors Digital/Communication/Information Technologies (including Software),Electronics,Energy,Financial Services, and Management Consultancy,Healthcare,Manufacturing, including Industrial Biotechology,Transport

URL https://arxiv.org/pdf/1901.05351.pdf