Improving the statistical efficiency of randomised controlled trials through their design

Lead Research Organisation: London School of Hygiene and Tropical Medicine

Department Name: Epidemiology and Population Health

Abstract

Randomised controlled trials (RCTs) are a way of assessing whether medical treatments are effective or not, in which patients are randomly allocated to receive the treatment or to a comparison group and their outcomes compared. RCTs are considered to be the gold-standard method of comparing treatments, but they are expensive and time consuming to conduct. Designing them well is therefore crucial. One way of improving the design of RCTs is to increase their statistical efficiency, which is a measure of the ability of an RCT to detect a treatment effect if one is present. If we can design more statistically efficient RCTs, then it would be possible to run them with fewer patients or for a shorter length of time, which will also reduce costs. The major objectives of this fellowship are to extend existing methods to compare the statistical efficiency of different RCT designs, and to explore the effect of different designs in several case-studies.

One way of increasing the statistical efficiency of an RCT is to measure a patient's outcome more than once. When planning such an RCT, it is necessary to make choices about design features such as the number and spacing of follow-up visits. The first aim of this fellowship is to investigate, and extend as necessary, existing methods for assessing the efficiency of different RCT design features. The methods will be applied to several case studies, including data from two RCTs - one of multiple sclerosis patients and the other looking at the effect of statins on muscle pain.

Many RCTs suffer from missing data, for example if some patients miss a follow-up visit. In addition, patients do not always receive the treatment they are assigned, for example if they forget or do not want to take their treatment. The second aim of this project will be to extend the aforementioned methods to cope with these situations, and to investigate how the optimal choice of design features changes in these circumstances.

The final aim of the fellowship is to explore what types of data can be used to plan the design of future RCTs. There may not be any data from previous RCTs available to help design future ones, but there might be routinely collected data sources, such as disease registries and electronic health records. However, it is not clear whether such data sets can be used to help choose design features. This fellowship will use two case-studies to assess the feasibility of using routinely collected data to design RCTs. The first case-study will use two different sources of data from patients with Huntington's disease: one which is collected in a very controlled manner, and the other which is a disease registry. The second case-study will use data from primary care electronic health records.

By developing methods to assess the statistical efficiency of different design features, this research will provide tools and information that can be used by researchers who design RCTs in the future.

Technical Summary

Randomised controlled trials (RCTs) are an important tool for assessing the effectiveness of medical interventions. RCTs are expensive and time consuming, and designing more statistically efficient trials is therefore crucial. The primary objectives of this research are to develop and extend existing methodologies to assess the statistical efficiency of RCT designs, and compare the efficiency of different design features. This will be achieved through addressing the following questions:

1. How do design features, such as the number of interim visits, length of follow-up, and use of complex designs such as run-in periods, affect the statistical efficiency of an RCT?
2. What effect does missing data and/or non-compliance have on the optimal choice of such design features?
3. Is it possible to use routinely collected data such as electronic health records and disease registries to inform the optimal design of RCTs?

These questions will be addressed by developing and extending methods for assessing the statistical efficiency of RCTs, including existing methodology based on linear mixed models. The methods will be explored using both simulation and application to case-studies, including RCT data from multiple sclerosis patients and a series of N-of-1 trials, and observational data from patients with Huntington's disease. This research will focus on two types of RCT: those which have an outcome that is a rate, and series of N-of-1 trials. These types of design are of particular interest since they can offer greatly improved statistical efficiency compared with standard alternatives. Due to the complexity of these designs, there are many choices to be made about the design features. Developing methods that allow the optimal design features to be chosen offers the opportunity to maximise the efficiency of these designs.

The proposed research will provide information and methodologies that can be used to design statistically efficient RCTs.

Planned Impact

Randomised controlled trials (RCTs) are the gold-standard method for assessing whether medical interventions are effective or not, but they are expensive to conduct. Recruitment to RCTs can be difficult and many trials fail to meet their recruitment targets, resulting in under-powered trials with too few patients. One consequence of lack of power is that RCTs investigating truly effective treatments may fail to give statistically significant results; these treatments may then not be used in practice. Another is that under-powered RCTs which find statistically significant benefits are likely to over-estimate treatment effect sizes, as explained by Button et al. in "Power failure: why small sample size undermines the reliability of neuroscience", Nature Rev Neuroscience 14 (2013) p365. This overestimation could lead to treatments being adopted in clinical practice which in fact are not as effective as they are thought to be, and may not meet cost-effectiveness requirements set by bodies such as NICE. Given these points, it is crucial to design RCTs with statistical efficiency in mind, and to improve efficiency through careful choice of design features. The former will ensure RCTs have appropriate power, and the latter will enable them to be run with fewer patients or for a shorter time.

This fellowship therefore has the potential to have a large impact, both within academic circles and more widely. Patients would be major beneficiaries of this research. Designing statistically efficient trials can lead to shorter RCTs, in which case information about the most effective treatments could be delivered to patients and the clinicians who treat them more quickly. Patients would also benefit through RCTs being run with a smaller sample size, since fewer patients would be exposed to potentially harmful interventions.

Other beneficiaries of the project would include researchers involved in planning and running RCTs, as well as statisticians with an interest in RCT design and methodology. This research would provide tools and information which would enable trialists to run more statistically efficient RCTs. Potential beneficiaries would therefore also include research councils, charities and other bodies who fund RCTs. If more statistically efficient trials are designed, this will enable RCTs to be smaller or shorter. This would reduce costs, and RCT funders would get more results for their money.

An additional benefit to both patients and researchers would be having fewer underpowered trials, either through insufficient sample sizes being used or due to recruitment targets not being met. As explained above, this can lead to true treatment effects being either missed, or overestimated if statistically significant. If RCTs are designed with appropriate statistical power this is less likely to happen.

The research relating to N-of-1 trials would be of particular benefit to the management of chronic diseases. The primary aim of conducting N-of-1 trials is to communicate to patients which treatment is best for them - a step towards personalised medicine. Designing statistically efficient N-of-1 trials would provide direct benefits to patients and the clinicians who treat them.

Results obtained by applying the methods to case studies would have an impact within the specific disease area of those studies. For example, when applying the methods to data from MS-STAT, an RCT in patients with secondary progressive multiple sclerosis (MS), valuable information as to which design features are statistically efficient would be gained. This information would be of great value when designing other MS trials. The results may have limited generalisability to other disease areas: however, the methods that would be developed and extended as part of this fellowship would have a wider generalisability and impact. Researchers could take these methods and apply them to other disease areas, and use them to design RCTs in other settings.

Funded Value:

£266,308

Funded Period:

Aug 17 - Dec 23

Funder:

MRC

Project Status:

Closed

Project Category:

Fellowship

Project Reference:

MR/P014372/1

Principal Investigator:

Katharine Morgan

Health Category:

Unclassified

Organisations

People	ORCID iD
Katharine Morgan (Principal Investigator / Fellow)

Publications

Author Name

Title Publication Date Published

10 25 50

Bridge H (2024) Negative variance components and intercept-slope correlations greater than one in magnitude: How do such "non-regular" random intercept and slope models arise, and what should be done when they do? in Statistics in Medicine

Hemming K (2019) Quality of stepped-wedge trial reporting can be reliably assessed using an updated CONSORT: crowd-sourcing systematic review. in Journal of clinical epidemiology

Imahori Y (2020) The association between anthropometric measures of adiposity and the progression of carotid atherosclerosis. in BMC cardiovascular disorders

Leyrat C (2018) Response to: How to design and analyse cluster randomized trials with a small number of clusters? Comment on Leyrat et al. in International journal of epidemiology

Morgan KE (2025) Applying the Estimands Framework to Non-Inferiority Trials: Guidance on Choice of Hypothetical Estimands for Non-Adherence and Comparison of Estimation Methods. in Statistics in medicine

Morgan KE (2019) Reflection on modern methods: calculating a sample size for a repeatability sub-study to correct for measurement error in a single continuous exposure. in International journal of epidemiology

Morgan KE (2023) How important is the linearity assumption in a sample size calculation for a randomised controlled trial where treatment is anticipated to affect a rate of change? in BMC medical research methodology

Nash S (2024) Erratum: Power and sample-size calculations for trials that compare slopes over time: Introducing the slopepower command in The Stata Journal: Promoting communications on statistics and Stata

Nash S (2021) Power and sample-size calculations for trials that compare slopes over time: Introducing the slopepower command. in The Stata journal

Collaboration
Software and Technical Products
Engagement Activities


Description	Collaboration with UCL
Organisation	University College London
Country	United Kingdom
Sector	Academic/University
PI Contribution	I have provided advice on statistical methods, helped write a paper, and supported the partner in submitting a fellowship application.
Collaborator Contribution	The partner has successfully applied for a fellowship, which includes funding for statistical support on methodological issues from me. They also proposed the idea for and led work on the paper.
Impact	A paper has been accepted for publication in The Journal of Prevention of Alzheimer's Disease. The partner's application for fellowship funding was successful.
Start Year	2020


Description	Collaborations - software package
Organisation	University College London
Department	Medical Research Council Clinical Trials Unit (MRC CTU) at UCL
Country	United Kingdom
Sector	Public
PI Contribution	I have helped develop and test a Stata software package that is an extension of slopepower.
Collaborator Contribution	The partners have developed an extension to the slopepower package.
Impact	Talk given by partner at ICTMC 2024. Stata software package in development, to be released soon.
Start Year	2023


Title	Stata package "slopepower"
Description	I have helped to develop and test a user-written command for Stata software called "slopepower", along with some collaborators at LSHTM. We have also written an associated help file. Although Stata is proprietary software, user-written packages are freely downloadable to anyone who wishes to use it. The package enables Stata users to perform sample size or power calculations for trials which compare slopes over time.
Type Of Technology	Software
Year Produced	2018
Impact	Use of the "slopepower" package will allow trialists to fit a mixed model to an existing data set containing data on people with the condition of interest, and use parameter estimates from that model to predict sample sizes for a variety of different trial designs. People who plan these types of trial will be able to use this software to help compare the sample sizes required for different trial designs.
URL	https://ideas.repec.org/c/boc/bocode/s458484.html


Description	Poster presented at ICTMC 2024
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Presented a poster entitled "How important is the linearity assumption in a sample size calculation for an RCT where treatment is anticipated to affect a rate of change?" at the 2024 International Clinical Trials Methodology Conference.
Year(s) Of Engagement Activity	2024


Description	Talk given at ISCB conference 2020
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Gave an oral presentation entitled "What happens to sample size calculations for series of N-of-1 trials when outcomes aren't normally distributed?" at the 2020 conference of the International Society for Clinical Biostatistics.
Year(s) Of Engagement Activity	2020


Description	Talk given at ISCB conference 2021
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Gave an oral presentation entitled "Dangers of wrongly assuming linearity in a trial sample size calculation when treatment affects rate of change" at the 2021 conference of the International Society for Clinical Biostatistics.
Year(s) Of Engagement Activity	2021


Description	Talk given at meeting of Neurodegenerative Diseases Statistics Working Group (Neustats)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Other audiences
Results and Impact	Talk given at a meeting of the Neurodegenerative Diseases Statistics Working Group (Neustats), entitled "Dangers of wrongly assuming linearity in a trial sample size calculation when treatment affects rate of change". Members of Neustats include statisticians and clinicians who are involved in running clinical trials, including trials for which this research could be of direct import.
Year(s) Of Engagement Activity	2023