Turing AI Fellowship: Reinforcement Learning for Healthcare

Lead Research Organisation: Imperial College London
Department Name: Bioengineering

Abstract

In this Turing Artificial Intelligence Acceleration Fellowship, I will focus on artificial intelligence for medical treatments and therapies. I take the view that AI is a question on how to realise artificial systems that solve practical problems currently requiring human intelligence to solve, such as those solved by clinicians, nurses and therapists. Critical care is high risk and highly invasive environment caring for the sickest patients at greatest risk of death. Patients within this environment are highly monitored, enabling sudden changes in physiology to be attended to immediately. In addition, this monitoring requires a heavier staffing ratio (often 1:1 nursing; 1:8 medical) and variances in human factors and non-technical pressures (e.g. staffing, skill-mix, finances) leads to critical care delivery being disparate.

AI in healthcare is a hard problem as, due to the diversity and variability of human nature, systems have to cope with unexpected circumstances when solving perceptual, reasoning or planning problems. Crucially, AI has two facets: Understanding from data, and Agency. While rapid strides have been made on learning from data, e.g. how to make medical diagnosis more precise and faster than human experts, there is little work on how to carry on after the diagnosis, e.g. which therapy and treatment to conduct. The latter requires agency and has seen fewer applications as it is a harder problem to solve.

My clinical partners and I want to develop the required AI algorithms that can learn and distil the best plan of action to treat a specific patient, from the expert knowledge of clinicians. We will focus on an area of AI called RL that has been successful in enabling robots and self-driving cars to learn a form of autonomous agency. We want to transform these methods into the healthcare domain. This will require the development of new RL algorithms, able to efficiently understand the state of a patient from noisy and ambiguous hospital data. The system will not only learn to recommend interventions such as prescribing drugs and changing dosages as needed per patient but to make these recommendations in a manner that is meaningful to the clinical decision-makers and helps them make the best final decision on a course of action.

The methods developed as part of this project can be used in different applications beyond healthcare. Many sectors within industry, such as aerospace, or energy, deal with similar bottlenecks. These are highly regulated environments, with great need for decisions making support, but a scarcity of highly skilled human experts. With sufficient data, our methods can be applied to these sectors as well, to distil the required human expertise and best practices from top experts, and use them to drive decision making all over the sector, for increased efficiency and safety.

Publications

10 25 50
 
Description The aim of my Turing AI Fellowship is to develop reinforcement learning-based AI for human interventions. Core of this work is the idea to use off-line reinforcement learning (the AI Clinician) to model the state of a person (here an intensive care patient) and learn from historical data how human expert interventions (here an intensive care doctor) how to treat patients better.The core of the grant is focussed on healthcare (the so called AI clinician) and industry partners (Shell, Airbus) bring in applications to other domains of industry and services. The active clinical trials on the AI Clinician (which is now an NIHR/NHS flagship project and deployed in 4 London hospitals) drives a lot of the underpinning AI work that is needed beyond As the first example of clinically trialled AI-learned Digital Therapeutics our work has by now been replicated in over 30 independent studies.
The first, foundational problem we cracked (Li & Faisal, 2021) required methods to handle the fact that hospital data are only painting a partial observable picture of the patient and we required the ability to handle continuous state and action spaces in near real-time, while representing uncertainty that the model had - which resulted in the called Bayesian Distributional Policy Gradients (AAAI, 2021) which combines a Bayesian RL model for parameter uncertainty and a distributional RL model for return uncertainty in a principled manner with a combined posterior. A clear lesson learned from the clinical evaluation of the RL agent was that we needed better methods for a. uncertainty quantification from offline RL and developed methods to estimate epistemic & aleatoric components (to communicate risk to clinicans) (Festor et Faisal, 2021, ICML MLHC) , b. Handling hierarchical decomposition of the policy (for explainabiltiy purposes), c. Safety-critical aspect (to ensure safety of the learned decisions) (Festor et al, 2022 BMJ HIC) and d. Methods that allow us to empirically measure which and what explanations and AI recommendations had the biggest impact on clinical decision making (Shafti et Faisal, 2022), e. The ethics of AI decisions on humans (Post, Brett & Faisal, 2022, AI & Ethics). f. Developing methods that combine federation & transfer learning to learn from data across different sites. Crucially, these developments are now been combined into forthcoming major paper that define AI Clincian 2.0 which is going into clinical testing this year. Follow on funding allows us to take the AI Clinician to paediatric intensive care. We also expaneded the application base for this digital therapeutics work (pandemic-wide intensive care work was slow), and so we expanded this into settings where we found more capacity to apply ourselves namely the treatment of neurological disorders. This required the development of appropriate patient state modelling that was on par with the intensive care work (Kardivelu et Faisal, Nature Medicine 2023; Ricotti et Faisal, Nature Medicine 2023) before we could apply these to reinforcement learning based neurological interventions (eg Wannawas & Faisal, 2021,2022,2023). In addition this personal fellowship was meant to accelerate Prof Faisal's career, this has been realised by Prof Faisal having been named as a new Director of the Alan Turing Institute, the national AI research lab.
Exploitation Route The AI Clinician is now in active clinical trials with end-users (intensive care clinicians) and in 4 hospitals across London (St Marys, Charing Cross, Hammersmith & UCLH). There is a glboal community of over 30 research groups using our models and data processing to develop the AI Clinician idea further.
Sectors Communities and Social Services/Policy

Digital/Communication/Information Technologies (including Software)

Energy

Healthcare

Manufacturing

including Industrial Biotechology

Pharmaceuticals and Medical Biotechnology

Transport

Other

URL https://aiclinician.icu
 
Description The active clinical trials (the AI Clinician is now an NIHR/NHS flagship project and being deployed in 4 London hospitals since 2022 - the fellowship drives a lot of the underpinning AI work that is needed beyond As the first example of clinically trialled AI-learned Digital Therapeutics our work has by now been replicated in over 30 independent studies. Clinical sites from the US to the Netherlands are now deploying the AI clinician code/software stack/data selectors for their own trials.
First Year Of Impact 2021
Sector Aerospace, Defence and Marine,Education,Energy,Healthcare
Impact Types Cultural

Economic

Policy & public services

 
Description AI for decoding and executing human intentions
Geographic Reach Multiple continents/international 
Policy Influence Type Participation in a guidance/advisory committee
Impact Changes in regulatory pipelines have been evaluated and discussed in response to our inputs and are in active change.
 
Description BBSRC AI strategy
Geographic Reach National 
Policy Influence Type Contribution to a national consultation/review
Impact Advised BBSRC on AI strategy which led to new BBSRC AI call.
 
Description MRC AI strategy panel
Geographic Reach National 
Policy Influence Type Contribution to a national consultation/review
Impact Guidance recommendation for MRC Biostatistics Unit strategy
 
Description Airbus Space & Defense - Airbus grant to support Turing AI Fellowship
Amount £100,000 (GBP)
Organisation Airbus Group 
Sector Academic/University
Country France
Start 01/2021 
End 12/2023
 
Description Rosetrees Interdisciplinary Award
Amount £300,000 (GBP)
Funding ID Only one prize award is awarded every year. 
Organisation Rosetrees Trust 
Sector Charity/Non Profit
Country United Kingdom
Start 09/2023 
End 09/2027
 
Description Shell - PostDoc research grant to support Turing AI Fellowship
Amount £408,727 (GBP)
Organisation Shell Centre 
Sector Private
Country United Kingdom
Start 01/2021 
End 12/2025
 
Description Skyrmionics for Neuromorphic Technologies
Amount £703,710 (GBP)
Funding ID EP/V028189/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 09/2021 
End 09/2024
 
Description Wearable Deep Spinal Interfacing (WDSI)
Amount £202,026 (GBP)
Funding ID EP/X018466/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 11/2022 
End 04/2024
 
Description Airbus Partnership in Machine Learning 
Organisation Airbus Group
Department Airbus Defence and Space UK
Country United Kingdom 
Sector Private 
PI Contribution Airbus Space & Defense want to collaborate with on translation of my foundational AI on reinforcement learning for human-in-the-loop control to work in the aerospace domain, specifically Robotics applications. The support included a jointly supervised PhD student and the time and efforts of the Airbus Data R&D Science and Robotics team in the UK. The collaboration was supported now by a new Airbus CASE PhD studentdship.
Collaborator Contribution Funding for a PostDoc to kickstart a project.
Impact Publications will be listed in the publication section.
Start Year 2021
 
Description Partnership with Shell on Reinforcement Learning 
Organisation Shell Global Solutions International BV
Department Shell Research Ltd
Country United Kingdom 
Sector Private 
PI Contribution The purpose of this is to engage a Postdoctoral researcher and translate the impact of the Turing AI Fellowship's foundational AI work into the energy sector application domain. This is to work directly with Daniel Jeavons, the General Manager of Data Science at Shell Research in addition, Shell commit to working with me directly through discussion meetings on a monthly basis as an in-kind contribution to support the Fellowship. Since the start of this partnership the collaboration was enriched by a further Shell CASE studentship.
Collaborator Contribution Funding for PostDocs and an EPSRC CASE studentship.
Impact Publications will be listed under the publication section.
Start Year 2021
 
Description Siemens Parternship in Digital Twins 
Organisation Siemens AG
Country Germany 
Sector Private 
PI Contribution Dr Annemarie Frie, a Head of Siemens corporate research (aka the AI and robotics research division) commit to working with me directly through monthly discussion meetings as an in-kind contribution to support the Fellowship. For
Collaborator Contribution see above.
Impact Publications will be listed in the publication seciton
Start Year 2021
 
Title AI Clinician XP2 
Description The cornerstone of sepsis resuscitation is the administration of intravenous fluids (IVF) and/or vasopressors (drugs that squeeze the blood vessels to increase blood pressure) to maintain blood flow to prevent organ failure. However, there is huge uncertainty around the individual dosing of these drugs in an individual patient, partially due to high sepsis heterogeneity. The current guidelines provide recommendations at a population-level but fail to individualise the decisions. Wrong decisions lead to poorer outcomes and increased ICU-resource use. A tool to personalise these medications could improve patient survival. We have developed a new method to automatically and continuously review and recommend the correct dose of these medications to doctors, which was created using artificial intelligence (AI) techniques applied to large medical databases. The method we used is called reinforcement learning, and we call the technology the "AI Clinician". 
Type Support Tool - For Medical Intervention
Current Stage Of Development Early clinical assessment
Year Development Stage Completed 2024
Development Status Under active development/distribution
Clinical Trial? Yes
Impact This present experiment will test the feasibility of running the AI Clinician in real-time in operational ICUs, in preparation for a future large scale multicentre randomised trial that will test for an improvement in clinically relevant outcomes. At this stage and in the interest of focusing on prescribers first, we will only be testing the use of the system by ICU doctors. Studies with nurses will be conducted in the future. 
 
Title Passive Evaluation in Operational Environment of the AI Clinician Decision Support System for Sepsis Treatment (AI Clinician XP1) 
Description We have developed a new method to automatically and continuously review and recommend the correct dose of these medications to doctors, which was created using artificial intelligence (AI) techniques applied to large medical databases. The method we used is called reinforcement learning, and we call the technology the "AI Clinician". In thisAI Clinician XP1 trial, we tested the safety of the AI Clinician when running in "shadow mode", i.e. in pseudonymised batches of patient data presented to off-duty ICU clinicians. This enabled us to 1) develop methods and software to connect to real-time electronic health records (EHR); 2) check the safety of the algorithm when used in a contemporary UK ICU patient cohort. 
Type Support Tool - For Medical Intervention
Current Stage Of Development Refinement. Clinical
Year Development Stage Completed 2024
Development Status Under active development/distribution
Clinical Trial? Yes
Impact We want to test the feasibility of running the AI Clinician in real-time in the ICU of 4 sites across a period of 6 months and identify patients suitable for the system. 
 
Company Name Ethomix Ltd 
Description  
Year Established 2024 
Impact The company was recently formed, in the run up to the formation the company won several incubator and startup awards, and is now getting ready to trade with their first customer.
 
Description BBC exclusive feature on our work (including TV, news, documentary and web features) 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Exclusive coverage of our work in BBC
Year(s) Of Engagement Activity 2023
URL https://www.bbc.co.uk/news/science-environment-64326125
 
Description Co-organiser of the International Conference on Complex Acute Illness (ICCAI) 2024 in Bethesda (MD, USA) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact I have become a co-organiser and been nominated as board member of the leading serious and complex illness society and their annual conference (ICCAI).
Year(s) Of Engagement Activity 2023,2024
URL https://iccai.org
 
Description Disability in AI (talk) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact A public engagement talk on Disability, AI and Assitive Technology at UCL
Year(s) Of Engagement Activity 2022
 
Description IEMTRONICS 2023 Keynote 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Keynote at technology conference
Year(s) Of Engagement Activity 2023,2024
 
Description Industry Keynote at FZI Karsruhe 
Form Of Engagement Activity Participation in an open day or visit at my research institution
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Keynote at Forschungszentrum Informatik (FZI) in Karlsruhe (Germany) one of Germany's major private-public industry research institutes.
Year(s) Of Engagement Activity 2023,2024
 
Description Keynote at Annual All-German-Medical-School conference (NUM) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Keynote at key event held in Berlin at which all German medical schools participated.
Year(s) Of Engagement Activity 2024
URL https://www.linkedin.com/posts/drgoeschl_num-convention-2024-activity-7154096014960771072-w_PO/
 
Description Keynote at Open Medical Forum held at TU Munich 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Keynote at major medical event held in Munich.
Year(s) Of Engagement Activity 2023,2024
URL https://www.tum.de/en/news-and-events/all-news/press-releases/details/the-importance-of-a-transdisci...
 
Description Keynote at annual governing political party closed-door retreat 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Policymakers/politicians
Results and Impact Gave keynote at an annual governing party retreat that has lead to government decisions and cabinet decisions.
Year(s) Of Engagement Activity 2023,2024
 
Description Keynote at the American Association for the Advancement of Science (AAAS) in Washington DC 8/11/2023 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Keynote at global open forum on research in Washington, DC (USA).
Year(s) Of Engagement Activity 2023,2024
URL https://www.science.org/content/webinar/ai-mental-health-opportunities-and-challenges
 
Description Symposium on Artificial Intelligence in Health & Care: Regulation, Evaluation & Policies 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Policymakers/politicians
Results and Impact Artificial Intelligence has already proven its potential to reshape healthcare technologies. It offers improved diagnosis, intervention, health system efficiency, and preventive medicine in the hands of health professionals, patients, carers, and the public.

However, established ways of testing and regulating technologies are not easy to apply to AI. In this symposium, presentations and panel discussion will explore:

Evaluation and regulatory approaches, and how they can be developed to allow for faster and safer innovation.

New horizons in AI, some of which will make it easier to ensure AI is explainable, reliable, while others will create new challenges for evaluation and regulations.

The Symposium is for researchers and professionals in healthcare, government, business and academia. It will be run in three blocks running on the 7, 17 and 21 of September as a hybrid event on campus and online (Teams) starting at 14.30 each day.

Speakers and panel members confirmed include experts from MHRA, NHSX, American College of Radiology, Roche Diagnostics, Kheiron, British Standards Institute, and universities and academic medical centres of Birmingham, Cambridge, Harvard, Leeds, Oxford, Stanford, Duke University, UCL and Imperial College London.
Year(s) Of Engagement Activity 2021
URL https://www.eventbrite.co.uk/e/ai-in-future-healthcare-regulation-evaluation-policies-registration-1...
 
Description World Economic Forum - AI Summit Spring 2024 and Fall 2024 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Policymakers/politicians
Results and Impact Building on the success of the inaugural Responsible AI Leadership Summit and the launch of the AI Governance Alliance (AIGA), the AI Governance Summit brings together more than 200 AI leaders, including Alliance community members and a broader AI community of experts and influential decision-makers. This collaborative gathering serves as a pivotal platform for knowledge sharing, strategic outlook and the formulation of concrete action plans to ensure the responsible development and deployment of generative AI on a global scale. I was an invited member.
Year(s) Of Engagement Activity 2023,2024
URL https://www.weforum.org/events/ai-governance-summit-2023/