📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Zero and Few-Shot Generalization of Agentic Systems

Lead Research Organisation: University of Oxford

Abstract

Brief description of the context of the research including potential impact
Agents, whether human or artificial, must interact efficiently and effectively with their environment. In the context of multi-agent systems, this often involves agents coordinating with one another to achieve shared objectives and maximize collective utility. Such systems can model a wide range of real-world domains where collaboration is essential, including areas like autonomous driving, where vehicles must cooperate to ensure road safety; smart grid management, where distributed energy systems collaborate to optimize power distribution; and robotic teams, where multiple robots coordinate to complete complex tasks like warehouse logistics or disaster response. In these scenarios, agents must handle uncertainty, communicate effectively, and adapt to dynamic changes in their environment. Therefore, the design of these collaborative systems requires careful consideration of coordination mechanisms, learning methods, and strategies for managing inter-agent dependencies, ultimately driving the development of robust, scalable, and effective solutions for real-world challenges.
Aims and Objectives
The aim of this research is to develop methods that enable agents to robustly generalize to new environments and partners. These methods are designed to perform effectively even in scenarios with minimal access to behavioural data and limited prior coordination experience with test-time agents.
Novelty of the research methodology
Existing methods for zero-shot coordination and similar settings typically assume strong assumptions about the underlying environment. This research aims to relax these assumptions by developing scalable methods that can be applied to more realistic and dynamic environments. By reducing reliance on restrictive assumptions, we seek to create more flexible and generalizable approaches that enhance agent coordination in practical, real-world scenarios.
Alignment to EPSRC's strategies and research areas (which EPSRC research area the project relates to) Further information on the areas can be found on http://www.epsrc.ac.uk/research/ourportfolio/researchareas/
This research aligns with the EPSRC's goals of advancing more secure and reliable systems, particularly in the realm of robotics. By developing reinforcement learning models capable of solving complex applied challenges, this work has the potential to significantly impact areas such as autonomous robotics, where robust models could assist in precision tasks, prevent contamination, or even contribute to the discovery of new treatments through automated experimentation and analysis.

Any companies or collaborators involved

Toshiba, EPSRC

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/S024050/1 30/09/2019 30/03/2028
2868707 Studentship EP/S024050/1 30/09/2023 29/09/2027 Darius Muglich