📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Learning of safety critical model predictive controllers for autonomous systems

Lead Research Organisation: Loughborough University
Department Name: Wolfson Sch of Mech, Elec & Manufac Eng

Abstract

Modern autonomous systems such as mobile robots and autonomous vehicles rely heavily on feedback controllers for motion control, particularly for path-following and obstacle avoidance, where they are employed to follow a trajectory set by a higher-level motion planner in a hierarchical control scheme. Model Predictive Control (MPC) is a popular controller choice for obstacle avoidance, as it allows constraints to be specified to ensure that the mobile robot or autonomous vehicle does not collide with obstacles. The behaviour of MPC is well understood from years of theoretical development and industrial practice, providing strong safety assurances, but considerable time and expert knowledge is required to implement it, especially in safety-critical applications such as autonomous vehicles.

In recent years, research on deep Reinforcement Learning (RL) has provided new methods to automatically find nonlinear feedback controllers for challenging control problems. But unlike MPC, existing RL methods typically have no guarantees of stability or of constraint satisfaction, and for safety-critical applications it is difficult to verify their behaviour.

To combine the predictability and safety guarantees of MPC with the power and convenience of modern RL methods, this project will develop methods to automatically learn MPC controllers in actor-critic RL frameworks, considering motion control and obstacle avoidance problems for autonomous vehicles. This will be a direct application of recent mathematical results showing that convex optimisations, such as MPC, can be employed as a trainable layer in RL frameworks such as PyTorch, allowing them to be learned. The goal is to enable rapid design and prototyping of path-following type MPC without requiring expert-knowledge of the underlying MPC algorithm, therefore reducing development time and cost and improving safety and reliability of future mobile robots and autonomous vehicles.

To ensure the new algorithms are practically applicable, an example application of motorcycle path-following and stability assistance will be used to guide their development. The problem of stabilising a two-wheeled vehicle in forward motion to follow a predefined path, for example via steering actuation, is challenging and has important applications in the emerging area of active safety systems for motorcycles and scooters.

For long term impact and to encourage adoption of the new methods by autonomous systems researchers, the new methods developed will be included in an open-source software library published on Github.
 
Title Novel gain-scheduled / parameter dependent optimal control methods for MPC 
Description Novel techniques for tube-based model predictive control of LPV (parameter varying) systems such as the UniPD self-balancing bike. A novel mathematical step allows us to perform convex optimisation over scheduled feedback policies, which was not previously possible 
Type Of Material Improvements to research infrastructure 
Year Produced 2024 
Provided To Others? Yes  
Impact Novel MPC controllers for UniPD self balancing bike. Method will be published and disseminated in this year's IEEE conference on decision and control, and associated code made available online. 
 
Description Collaboration with University of Padova 
Organisation University of Padova
Country Italy 
Sector Academic/University 
PI Contribution Ongoing collaboration on control systems design for the university of Padova (UniPD) 'Self-Balancing Bike' (SBB).
Collaborator Contribution Access to SBB and associated technical support provided by UniPD for testing of novel MPC methods and control systems design techniques developed in this project.
Impact https://doi.org/10.1080/00423114.2018.1506588
Start Year 2019
 
Title LPV MPC toolbox 
Description A MATLAB toolbox to allow other researchers to use our new MPC algorithms for LPV system arising from this research project. 
Type Of Technology Webtool/Application 
Year Produced 2025 
Open Source License? Yes  
Impact None yet 
URL https://github.com/jflmng/lpvmpc_toolbox
 
Description IMechE workshop at UKACC CONTROL 2024 conference 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Industry/Business
Results and Impact Participation and dissemination of project through IMechE sponsored workshop at CONTROL 2024 conference.
Year(s) Of Engagement Activity 2024
 
Description Participation in ACE Network ECR Webinar - Control for Smart and Sustainable Mobility 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Industry/Business
Results and Impact This seminar aims to gather ECRs in control engineering to discuss critical challenges and opportunities in the evolution of mobility systems, triggered by cutting-edge control and AI techniques. It will feature ECRs working across diverse domains within the context of future mobility, including road and railway systems, mobility electrification, and other emerging transportation technologies. Industry presenters will share insights from early deployments and the challenges encountered. The seminar will translate these insights into a practical and impactful research roadmap, promoting research collaboration within the ACE network, and preparing us for upcoming funding opportunities.
Year(s) Of Engagement Activity 2025
URL https://ukcontrol.org/webinar-control-technologies-for-smart-and-sustainable-mobility-systems