Statistical Methods for Causal Inference
Lead Research Organisation:
University of Bristol
Department Name: UNLISTED
Abstract
Working out which treatments or interventions are effective, using only observational data, needs increasingly sophisticated statistical methods. We aim to develop methods to enable researchers to estimate the effects of interventions as accurately as possible.
We will develop models to help identify those who would benefit most from a given intervention, so enabling better targeting of treatments. Most measures in observational data are made with some error (e.g. blood pressure varies throughout the day) and we will extend current methods to reduce the effect this has on causal estimates. We will also develop ways to use current methods based on mendelian randomization (MR) to improve analyses of non-genetic exposures, such as examining the benefits of cycling to work. Current methods tend to focus on one intervention, and we will extend these to examine the long-term effects, for example to assess the impact of remaining heavier than average throughout adolescence and adulthood vs losing weight during adulthood. Finally, we will improve methods for combining evidence from several different study types to answer the same question, by developing a triangulation framework.
We will develop models to help identify those who would benefit most from a given intervention, so enabling better targeting of treatments. Most measures in observational data are made with some error (e.g. blood pressure varies throughout the day) and we will extend current methods to reduce the effect this has on causal estimates. We will also develop ways to use current methods based on mendelian randomization (MR) to improve analyses of non-genetic exposures, such as examining the benefits of cycling to work. Current methods tend to focus on one intervention, and we will extend these to examine the long-term effects, for example to assess the impact of remaining heavier than average throughout adolescence and adulthood vs losing weight during adulthood. Finally, we will improve methods for combining evidence from several different study types to answer the same question, by developing a triangulation framework.
Technical Summary
Evaluating public health and clinical interventions requires estimation of causal effects. While randomised controlled trials (RCTs) permit causal inferences without making strong statistical assumptions, increasingly sophisticated statistical methods are available to estimate causal effects from observational data. However, each such method is based on a set of assumptions, violation of which may result in bias and thus erroneous conclusions.
Aim: To develop methods to minimise bias when estimating causal effects.
1. Modelling and examining heterogeneity. Modelling heterogeneity of a causal effect and identifying effect modifiers can help target interventions to those who would benefit most. We will develop efficient strategies for identifying effect modifiers in a single study, and meta-analytic methods to identify effect-modifiers across several studies. We will examine the impact of heterogeneity on bias of instrumental variable analyses.
2. Measurement Error. The exposure, confounders, effect modifiers or outcome variables may be measured with error, which can lead to bias in estimated causal effects. We will extend methods for modelling random and systematic measurement error, and for assessing the impact of complex measurement error mechanisms on causal effects estimated using different analytical methods.
3. Complex causal and lifecourse hypotheses. We will develop ways to use instruments for outcome and confounders to assess sensitivity to unmeasured confounding in multivariable analyses. We will then use instrumental variables with effects that vary over time and combine these with covariate-selection methods to identify timing of causal effects. These methods will be extended to examine causal effects of combinations of multiple exposures.
4. Selection bias. We will develop sensitivity analysis methods based on inverse probability weighting, and extend these to address collider bias in case-only studies of disease progression. We will develop methods to use external information (including knowledge about dependences between variables) to detect and minimise selection bias.
5. Triangulation. Formally relating causal estimands from MR studies and RCTs will improve the process of identifying drug targets and designing RCTs to test them. We will develop methods to quantitatively triangulate the results from MR studies and RCTs, addressing the different quantities estimated by each of these study types, as well as effect heterogeneity and the factors affecting selection into each study.
Aim: To develop methods to minimise bias when estimating causal effects.
1. Modelling and examining heterogeneity. Modelling heterogeneity of a causal effect and identifying effect modifiers can help target interventions to those who would benefit most. We will develop efficient strategies for identifying effect modifiers in a single study, and meta-analytic methods to identify effect-modifiers across several studies. We will examine the impact of heterogeneity on bias of instrumental variable analyses.
2. Measurement Error. The exposure, confounders, effect modifiers or outcome variables may be measured with error, which can lead to bias in estimated causal effects. We will extend methods for modelling random and systematic measurement error, and for assessing the impact of complex measurement error mechanisms on causal effects estimated using different analytical methods.
3. Complex causal and lifecourse hypotheses. We will develop ways to use instruments for outcome and confounders to assess sensitivity to unmeasured confounding in multivariable analyses. We will then use instrumental variables with effects that vary over time and combine these with covariate-selection methods to identify timing of causal effects. These methods will be extended to examine causal effects of combinations of multiple exposures.
4. Selection bias. We will develop sensitivity analysis methods based on inverse probability weighting, and extend these to address collider bias in case-only studies of disease progression. We will develop methods to use external information (including knowledge about dependences between variables) to detect and minimise selection bias.
5. Triangulation. Formally relating causal estimands from MR studies and RCTs will improve the process of identifying drug targets and designing RCTs to test them. We will develop methods to quantitatively triangulate the results from MR studies and RCTs, addressing the different quantities estimated by each of these study types, as well as effect heterogeneity and the factors affecting selection into each study.
Organisations
People |
ORCID iD |
| Kate Tilling (Principal Investigator) |
Publications
Burrows K
(2024)
A framework for conducting GWAS using repeated measures data with an application to childhood BMI.
in Nature communications
Carter AR
(2023)
Time-sensitive testing pressures and COVID-19 outcomes: are socioeconomic inequalities over the first year of the pandemic explained by selection bias?
in BMC public health
Cornish R
(2024)
Maternal pre-pregnancy body mass index and risk of preterm birth: a collaboration using large routine health datasets
in BMC Medicine
Curnow E
(2024)
Multiple imputation strategies for missing event times in a multi-state model analysis.
in Statistics in medicine
Curnow E
(2024)
Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random.
in BMC medical research methodology
De Lange M
(2024)
A Hypothesis-Free Approach to Identifying Potential Effects of Relative Age in School Year: an Instrumental Variable Phenome-Wide Association Study in the UK Biobank
in American Journal of Epidemiology
De Lange MA
(2024)
The effects of daylight saving time clock changes on accelerometer-measured sleep duration in the UK Biobank.
in Journal of sleep research
Easey KE
(2024)
Challenges in using data on fathers/partners to study prenatal exposures and offspring health.
in Journal of developmental origins of health and disease
| Description | Science stall about the impact of environment and health at a music festival |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Public/other audiences |
| Results and Impact | Over the course of three days, over 1000 people engaged with activities on a stall about the impact of the environment on public health. Visitors to the stall included children, teenagers and adults, including people who work in academia and healthcare. People provided insight into our work and expressed interest in learning more. Some visitors on our stall stayed for over an hour. |
| Year(s) Of Engagement Activity | 2024 |
| Description | Secondary school visit (Keynsham) |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Schools |
| Results and Impact | Researchers visited a secondary school Year 10 science class with 17 students. To tie in with student's learning, the students heard about and did activities about smoking and vaping. Students also got a taste of a range of health-related research and an opportunity for them to provide feedback and help future research in our unit. In addition, there was an opportunity to ask questions about the team's different journeys as scientists. The visit aimed to: reinforce learning for GCSE, particularly around non-communicable diseases; help the students gain deeper understanding of the science behind public health messages; and raise aspirations by giving a sense of the opportunities with studying science. Students fed back that they enjoyed the session and that it helped their learning. They also enjoyed being able to contribute to some PPIE for research around young people and vaping, which will also feed back into research. |
| Year(s) Of Engagement Activity | 2024 |
| Description | Stall at the Green Man festival |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Public/other audiences |
| Results and Impact | Ten researchers took a stall titled "Selection bias - How (not) to get the wrong answers with health data" to the Einstein's Garden section of the Green Man festival in South Wales. Through a range of fun activities aimed at all ages, the team demonstrated the following key messages: • Selection bias is a big challenge for public health research • Underrepresentation of particular groups of people in studies can lead to public health issues being misunderstood and some groups of people being missed or inadequately addressed in prevention, diagnosis and treatment • There are some ways we can fill the data gaps The aim of this engagement was to increase awareness of the value of participating in public health studies and improve the general public's understanding of how their health data is used in research and what challenges can emerge when groups are underrepresented. We also took the opportunity to gather insight from festival visitors about barriers and concerns that they have around sharing health data, which will help inform our research. |
| Year(s) Of Engagement Activity | 2023 |
| Description | Stall in Bristol city centre shopping area as part of the FUTURES festival |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Public/other audiences |
| Results and Impact | Five researchers and related staff took a stall titled "Selection bias - How (not) to get the wrong answers with health data" to the Cabot Circus Shopping Centre in central Bristol one Saturday afternoon as part of the FUTURES festival. Through a range of fun activities aimed at all ages, the team demonstrated the following key messages: • Selection bias is a big challenge for public health research • Underrepresentation of particular groups of people in studies can lead to public health issues being misunderstood and some groups of people being missed or inadequately addressed in prevention, diagnosis and treatment • There are some ways we can fill the data gaps The aim of this engagement was to increase awareness of the value of participating in public health studies and improve the general public's understanding of how their health data is used in research and what challenges can emerge when groups are underrepresented. We also took the opportunity to gather insight from festival visitors about barriers and concerns that they have around sharing health data, which will help inform our research. We ran the stall alongside the Children of the 90s studies and encouraged people to participate or reengage with this study. |
| Year(s) Of Engagement Activity | 2023 |