Learning of safety critical model predictive controllers for autonomous systems

Lead Research Organisation: Loughborough University

Department Name: Wolfson Sch of Mech, Elec & Manufac Eng

Abstract

Modern autonomous systems such as mobile robots and autonomous vehicles rely heavily on feedback controllers for motion control, particularly for path-following and obstacle avoidance, where they are employed to follow a trajectory set by a higher-level motion planner in a hierarchical control scheme. Model Predictive Control (MPC) is a popular controller choice for obstacle avoidance, as it allows constraints to be specified to ensure that the mobile robot or autonomous vehicle does not collide with obstacles. The behaviour of MPC is well understood from years of theoretical development and industrial practice, providing strong safety assurances, but considerable time and expert knowledge is required to implement it, especially in safety-critical applications such as autonomous vehicles.

In recent years, research on deep Reinforcement Learning (RL) has provided new methods to automatically find nonlinear feedback controllers for challenging control problems. But unlike MPC, existing RL methods typically have no guarantees of stability or of constraint satisfaction, and for safety-critical applications it is difficult to verify their behaviour.

To combine the predictability and safety guarantees of MPC with the power and convenience of modern RL methods, this project will develop methods to automatically learn MPC controllers in actor-critic RL frameworks, considering motion control and obstacle avoidance problems for autonomous vehicles. This will be a direct application of recent mathematical results showing that convex optimisations, such as MPC, can be employed as a trainable layer in RL frameworks such as PyTorch, allowing them to be learned. The goal is to enable rapid design and prototyping of path-following type MPC without requiring expert-knowledge of the underlying MPC algorithm, therefore reducing development time and cost and improving safety and reliability of future mobile robots and autonomous vehicles.

To ensure the new algorithms are practically applicable, an example application of motorcycle path-following and stability assistance will be used to guide their development. The problem of stabilising a two-wheeled vehicle in forward motion to follow a predefined path, for example via steering actuation, is challenging and has important applications in the emerging area of active safety systems for motorcycles and scooters.

For long term impact and to encourage adoption of the new methods by autonomous systems researchers, the new methods developed will be included in an open-source software library published on Github.

Funded Value:

£400,125

Funded Period:

May 23 - Nov 26

Funder:

EPSRC

Project Status:

Active

Project Category:

Research Grant

Project Reference:

EP/X015459/1

Principal Investigator:

James Fleming

Research Subject:

Info. & commun. Technol. (20%)

Mechanical engineering (10%)

Systems engineering (70%)

Research Topic:

Artificial Intelligence (20%)

Control Engineering (70%)

Robotics & Autonomy (10%)

Organisations

People	ORCID iD
James Fleming (Principal Investigator)	http://orcid.org/0000-0003-2936-4644

Publications

Author Name

Title Publication Date Published

10 25 50

Fleming J (2024) Robust tube MPC using gain-scheduled policies for a class of LPV systems

Fleming J (2024) Robust Tube MPC Using Gain-Scheduled Policies for a Class of LPV Systems in IEEE Control Systems Letters

Lot R (2025) Eco-driving optimal control for electric vehicles with driver preferences in Transportation Engineering

Midgley W (2023) Model-free Road Friction Estimation using Machine Learning

Otoofi M (2023) Estimating friction coefficient using generative modelling

Otoofi M (2024) FrictionSegNet: Simultaneous Semantic Segmentation and Friction Estimation Using Hierarchical Latent Variable Models in IEEE Transactions on Intelligent Transportation Systems

Research Tools and Methods
Collaboration
Software and Technical Products
Engagement Activities


Title	Novel gain-scheduled / parameter dependent optimal control methods for MPC
Description	Novel techniques for tube-based model predictive control of LPV (parameter varying) systems such as the UniPD self-balancing bike. A novel mathematical step allows us to perform convex optimisation over scheduled feedback policies, which was not previously possible
Type Of Material	Improvements to research infrastructure
Year Produced	2024
Provided To Others?	Yes
Impact	Novel MPC controllers for UniPD self balancing bike. Method will be published and disseminated in this year's IEEE conference on decision and control, and associated code made available online.


Description	Collaboration with University of Padova
Organisation	University of Padova
Country	Italy
Sector	Academic/University
PI Contribution	Ongoing collaboration on control systems design for the university of Padova (UniPD) 'Self-Balancing Bike' (SBB).
Collaborator Contribution	Access to SBB and associated technical support provided by UniPD for testing of novel MPC methods and control systems design techniques developed in this project.
Impact	https://doi.org/10.1080/00423114.2018.1506588
Start Year	2019


Title	LPV MPC toolbox
Description	A MATLAB toolbox to allow other researchers to use our new MPC algorithms for LPV system arising from this research project.
Type Of Technology	Webtool/Application
Year Produced	2025
Open Source License?	Yes
Impact	None yet
URL	https://github.com/jflmng/lpvmpc_toolbox


Description	IMechE workshop at UKACC CONTROL 2024 conference
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Industry/Business
Results and Impact	Participation and dissemination of project through IMechE sponsored workshop at CONTROL 2024 conference.
Year(s) Of Engagement Activity	2024


Description	Participation in ACE Network ECR Webinar - Control for Smart and Sustainable Mobility
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Industry/Business
Results and Impact	This seminar aims to gather ECRs in control engineering to discuss critical challenges and opportunities in the evolution of mobility systems, triggered by cutting-edge control and AI techniques. It will feature ECRs working across diverse domains within the context of future mobility, including road and railway systems, mobility electrification, and other emerging transportation technologies. Industry presenters will share insights from early deployments and the challenges encountered. The seminar will translate these insights into a practical and impactful research roadmap, promoting research collaboration within the ACE network, and preparing us for upcoming funding opportunities.
Year(s) Of Engagement Activity	2025
URL	https://ukcontrol.org/webinar-control-technologies-for-smart-and-sustainable-mobility-systems

Abstract

Organisations

People

ORCID iD

Publications