Virtual Clinical Trial Emulation with Generative AI Models

Lead Research Organisation: University of Strathclyde
Department Name: Computer and Information Sciences

Abstract

This research will apply newly emerging generative AI technology to transform biomedical and health research by enabling virtual clinical trial emulation with synthetic data. It will overcome key limitations in both Randomised Controlled Trials (RCTs) and observational studies. RCTs have long been considered as the "gold standard" to evaluate treatments and medicines. However, they are far from being able to answer all clinical questions. In addition to time and cost constraints, RCTs have significant limitations to generalise their findings as their scope is limited. In many situations conducting RCTs with real patients is logistically challenging or unethical due to their potentially harmful nature. This leaves a significant knowledge gap, for example, we still have very limited clinical guidelines about how to manage multi-morbidities. While observational studies can overcome the issues faced by RCTs by leveraging routinely collected data from real world, they are typically imbalanced across population, diseases and interventions; there are a significant amount of noise and missing measurements in the data, and we need lengthy time and significant effort to remove patient identifiable information from the data to protect privacy. More importantly, treatment choices and outcomes in real world clinical cases may depend on factors that are not measured within the data, which may invalidate the observational study.
AI research has made great advances in creating new data. With new generative AI models, we can generate synthetic patient populations that faithfully preserve the statistical attributes of real populations. Compared with anonymised real data, synthetic data can be generated in unlimited volume while containing "zero" information about real individuals. Hence, they are in a much better position to overcome legal barriers in data protection and sharing. More importantly, experiments with synthetic data will allow clinical researchers to perform "virtual-trials" to gain quantitative insight into causal relations between treatment and its effect. This will enable prediction and comparison of hypothetical treatments to seek answers to important research questions that currently cannot be answered in real trials. The success of this adventurous and timely research will bring a landscape change to revolutionise future biomedical and health research by broadening its research agenda, liberating its restrictions, saving cost and time, leading to significant benefits to healthcare by speeding up new timelines for treatment discovery, addressing increasingly complex healthcare landscape in elderly population and multi-morbidity, and transforming regulatory and policy making process.

Technical Summary

This research will apply newly emerging generative AI technology to transform biomedical and health research by enabling virtual clinical trial emulations with synthetic data. AI research has made great advances in creating new data. With new generative AI models, we can generate synthetic patient populations that faithfully preserve the statistical attributes of real populations. Experiments with synthetic data will allow clinical researchers to perform "virtual-trials" to gain quantitative insight into causal relations between a target effect and its causes. This will enable prediction and comparison of hypothetical interventions to seek answers to important research questions that currently cannot be answered in real trials, including clinical questions associated with underrepresented populations of children, older adults, and patients with multi-morbidities and polypharmacy - these people are commonly excluded in trials.

The project aims to validate the concept by seeking answers to the primary research questions including: 1) Can we use generative AI models to generate synthetic data that preserve the same value for research as real-world health data? 2)Can we perform virtual clinical trial emulations by discovering correct causal relations from the synthetic data?

Within the scope of this feasibility study, we will focus on a specific Type 2 diabetes mellites (T2DM) use case, which is a confirmatory study to test the ability of the AI-driven virtual trial emulations by emulating established clinical trials to replicate their results. We will target the effect of an exemplar drug (i.e. Liraglutide, a GLP-1 receptor agonist). Over the years, several RCTs have been conducted in the LEAD (Liraglutide Effect and Action in Diabetes) program to collect detailed evidence about the effect of this drug on HbA1c and weight, cardiovascular risk, the lipid profile, and blood pressure. The trial emulations will attempt to rediscover their findings.
 
Description Using Artificial Intelligence to Determine the Factors Associated with Frequent Attendance at Healthcare Service Facilities
Amount £195,000 (GBP)
Funding ID 270619 
Organisation NHS Lanarkshire 
Sector Public
Country United Kingdom
Start 01/2024 
End 04/2024
 
Description o Causal Counterfactual visualisation for human causal decision making - A case study in healthcare
Amount £598,000 (GBP)
Funding ID EP/X029778/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 06/2023 
End 12/2025
 
Description NHS Lanarkshire 
Organisation NHS Lanarkshire
Country United Kingdom 
Sector Public 
PI Contribution Feng Dong has been appointed as an honorary professor in NHS Lanarkshire. The contribution area is AI for health.
Collaborator Contribution The NHS Lanarkshire offers clinical expertise and clinical scenarios.
Impact We have submitted a number of research proposal. We have also set up a joint MSc project in AI for health
Start Year 2021
 
Description NHSGGC 
Organisation NHS Greater Glasgow and Clyde (NHSGGC)
Country United Kingdom 
Sector Public 
PI Contribution research work on data analysis
Collaborator Contribution work on clinical input and data provision
Impact expected joint publications
Start Year 2022
 
Description Engagement Scottish Study Group for Diabetes 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact We presented at the Scottish Study Group for Diabetes to a group of diabetes clinicians (adult + paediatric consultants/trainees), epidemiologists and health economists - largely focused on diabetes in children and young people.
Year(s) Of Engagement Activity 2023
 
Description Health and Care Futures Showcase Event 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact Health and Care Futures Showcase Event with Golden Jubilee National Hospital, Scotland
Year(s) Of Engagement Activity 2023
 
Description Meeting with Scottish Government visits 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Policymakers/politicians
Results and Impact Present the project to the Head of Digital Health and Care in Scottish Government in a meeting.
Year(s) Of Engagement Activity 2024
 
Description Research collaboration working group 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Professional Practitioners
Results and Impact a working group of University of Strathclyde and Golden Jubilee National Hospital, Scotland
Year(s) Of Engagement Activity 2023
 
Description University Faculty AI Event 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Other audiences
Results and Impact A presentation about the project and the results to inform colleagues within the university
Year(s) Of Engagement Activity 2023