Learning of safety critical model predictive controllers for autonomous systems
Lead Research Organisation:
Loughborough University
Department Name: Wolfson Sch of Mech, Elec & Manufac Eng
Abstract
Modern autonomous systems such as mobile robots and autonomous vehicles rely heavily on feedback controllers for motion control, particularly for path-following and obstacle avoidance, where they are employed to follow a trajectory set by a higher-level motion planner in a hierarchical control scheme. Model Predictive Control (MPC) is a popular controller choice for obstacle avoidance, as it allows constraints to be specified to ensure that the mobile robot or autonomous vehicle does not collide with obstacles. The behaviour of MPC is well understood from years of theoretical development and industrial practice, providing strong safety assurances, but considerable time and expert knowledge is required to implement it, especially in safety-critical applications such as autonomous vehicles.
In recent years, research on deep Reinforcement Learning (RL) has provided new methods to automatically find nonlinear feedback controllers for challenging control problems. But unlike MPC, existing RL methods typically have no guarantees of stability or of constraint satisfaction, and for safety-critical applications it is difficult to verify their behaviour.
To combine the predictability and safety guarantees of MPC with the power and convenience of modern RL methods, this project will develop methods to automatically learn MPC controllers in actor-critic RL frameworks, considering motion control and obstacle avoidance problems for autonomous vehicles. This will be a direct application of recent mathematical results showing that convex optimisations, such as MPC, can be employed as a trainable layer in RL frameworks such as PyTorch, allowing them to be learned. The goal is to enable rapid design and prototyping of path-following type MPC without requiring expert-knowledge of the underlying MPC algorithm, therefore reducing development time and cost and improving safety and reliability of future mobile robots and autonomous vehicles.
To ensure the new algorithms are practically applicable, an example application of motorcycle path-following and stability assistance will be used to guide their development. The problem of stabilising a two-wheeled vehicle in forward motion to follow a predefined path, for example via steering actuation, is challenging and has important applications in the emerging area of active safety systems for motorcycles and scooters.
For long term impact and to encourage adoption of the new methods by autonomous systems researchers, the new methods developed will be included in an open-source software library published on Github.
In recent years, research on deep Reinforcement Learning (RL) has provided new methods to automatically find nonlinear feedback controllers for challenging control problems. But unlike MPC, existing RL methods typically have no guarantees of stability or of constraint satisfaction, and for safety-critical applications it is difficult to verify their behaviour.
To combine the predictability and safety guarantees of MPC with the power and convenience of modern RL methods, this project will develop methods to automatically learn MPC controllers in actor-critic RL frameworks, considering motion control and obstacle avoidance problems for autonomous vehicles. This will be a direct application of recent mathematical results showing that convex optimisations, such as MPC, can be employed as a trainable layer in RL frameworks such as PyTorch, allowing them to be learned. The goal is to enable rapid design and prototyping of path-following type MPC without requiring expert-knowledge of the underlying MPC algorithm, therefore reducing development time and cost and improving safety and reliability of future mobile robots and autonomous vehicles.
To ensure the new algorithms are practically applicable, an example application of motorcycle path-following and stability assistance will be used to guide their development. The problem of stabilising a two-wheeled vehicle in forward motion to follow a predefined path, for example via steering actuation, is challenging and has important applications in the emerging area of active safety systems for motorcycles and scooters.
For long term impact and to encourage adoption of the new methods by autonomous systems researchers, the new methods developed will be included in an open-source software library published on Github.
Publications
Fleming J
(2024)
Robust Tube MPC Using Gain-Scheduled Policies for a Class of LPV Systems
in IEEE Control Systems Letters
Lot R
(2025)
Eco-driving optimal control for electric vehicles with driver preferences
in Transportation Engineering
Midgley W
(2023)
Model-free Road Friction Estimation using Machine Learning
Otoofi M
(2023)
Estimating friction coefficient using generative modelling
Otoofi M
(2024)
FrictionSegNet: Simultaneous Semantic Segmentation and Friction Estimation Using Hierarchical Latent Variable Models
in IEEE Transactions on Intelligent Transportation Systems
| Title | Novel gain-scheduled / parameter dependent optimal control methods for MPC |
| Description | Novel techniques for tube-based model predictive control of LPV (parameter varying) systems such as the UniPD self-balancing bike. A novel mathematical step allows us to perform convex optimisation over scheduled feedback policies, which was not previously possible |
| Type Of Material | Improvements to research infrastructure |
| Year Produced | 2024 |
| Provided To Others? | Yes |
| Impact | Novel MPC controllers for UniPD self balancing bike. Method will be published and disseminated in this year's IEEE conference on decision and control, and associated code made available online. |
| Description | Collaboration with University of Padova |
| Organisation | University of Padova |
| Country | Italy |
| Sector | Academic/University |
| PI Contribution | Ongoing collaboration on control systems design for the university of Padova (UniPD) 'Self-Balancing Bike' (SBB). |
| Collaborator Contribution | Access to SBB and associated technical support provided by UniPD for testing of novel MPC methods and control systems design techniques developed in this project. |
| Impact | https://doi.org/10.1080/00423114.2018.1506588 |
| Start Year | 2019 |
| Title | LPV MPC toolbox |
| Description | A MATLAB toolbox to allow other researchers to use our new MPC algorithms for LPV system arising from this research project. |
| Type Of Technology | Webtool/Application |
| Year Produced | 2025 |
| Open Source License? | Yes |
| Impact | None yet |
| URL | https://github.com/jflmng/lpvmpc_toolbox |
| Description | IMechE workshop at UKACC CONTROL 2024 conference |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Industry/Business |
| Results and Impact | Participation and dissemination of project through IMechE sponsored workshop at CONTROL 2024 conference. |
| Year(s) Of Engagement Activity | 2024 |
| Description | Participation in ACE Network ECR Webinar - Control for Smart and Sustainable Mobility |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Industry/Business |
| Results and Impact | This seminar aims to gather ECRs in control engineering to discuss critical challenges and opportunities in the evolution of mobility systems, triggered by cutting-edge control and AI techniques. It will feature ECRs working across diverse domains within the context of future mobility, including road and railway systems, mobility electrification, and other emerging transportation technologies. Industry presenters will share insights from early deployments and the challenges encountered. The seminar will translate these insights into a practical and impactful research roadmap, promoting research collaboration within the ACE network, and preparing us for upcoming funding opportunities. |
| Year(s) Of Engagement Activity | 2025 |
| URL | https://ukcontrol.org/webinar-control-technologies-for-smart-and-sustainable-mobility-systems |
