Zero and Few-Shot Generalization of Agentic Systems

Lead Research Organisation: University of Oxford

Abstract

Brief description of the context of the research including potential impact
Agents, whether human or artificial, must interact efficiently and effectively with their environment. In the context of multi-agent systems, this often involves agents coordinating with one another to achieve shared objectives and maximize collective utility. Such systems can model a wide range of real-world domains where collaboration is essential, including areas like autonomous driving, where vehicles must cooperate to ensure road safety; smart grid management, where distributed energy systems collaborate to optimize power distribution; and robotic teams, where multiple robots coordinate to complete complex tasks like warehouse logistics or disaster response. In these scenarios, agents must handle uncertainty, communicate effectively, and adapt to dynamic changes in their environment. Therefore, the design of these collaborative systems requires careful consideration of coordination mechanisms, learning methods, and strategies for managing inter-agent dependencies, ultimately driving the development of robust, scalable, and effective solutions for real-world challenges.
Aims and Objectives
The aim of this research is to develop methods that enable agents to robustly generalize to new environments and partners. These methods are designed to perform effectively even in scenarios with minimal access to behavioural data and limited prior coordination experience with test-time agents.
Novelty of the research methodology
Existing methods for zero-shot coordination and similar settings typically assume strong assumptions about the underlying environment. This research aims to relax these assumptions by developing scalable methods that can be applied to more realistic and dynamic environments. By reducing reliance on restrictive assumptions, we seek to create more flexible and generalizable approaches that enhance agent coordination in practical, real-world scenarios.
Alignment to EPSRC's strategies and research areas (which EPSRC research area the project relates to) Further information on the areas can be found on http://www.epsrc.ac.uk/research/ourportfolio/researchareas/
This research aligns with the EPSRC's goals of advancing more secure and reliable systems, particularly in the realm of robotics. By developing reinforcement learning models capable of solving complex applied challenges, this work has the potential to significantly impact areas such as autonomous robotics, where robust models could assist in precision tasks, prevent contamination, or even contribute to the discovery of new treatments through automated experimentation and analysis.

Any companies or collaborators involved

Toshiba, EPSRC

Student:

Darius Muglich

Period of Study:

Sep 23 - Sep 27

Funder:

EPSRC

Project Status:

Active

Project Category:

Studentship

Project Reference:

2868707

Research Topic:

Unclassified

Organisations

People	ORCID iD
Darius Muglich (Student)

Publications

Author Name

Title Publication Date Published

10 25 50

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/S024050/1			30/09/2019	30/03/2028
2868707	Studentship	EP/S024050/1	30/09/2023	29/09/2027	Darius Muglich

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects