Evolution forecasting for real-time blood-cancer risk prediction

Lead Research Organisation: University of Cambridge
Department Name: Oncology

Abstract

Cancer is a disease of evolution that plays out in our bodies over decades. As our cells divide, occasional errors ("mutations") in DNA replication occur which can disrupt the normal function of the cell. If these "mutant clones" survive long enough they can acquire further cancer-causing mutations that eventually drive the total break-down of controlled cell proliferation. The first steps of this evolutionary process begin years before the development of cancer. This raises the possibility that these early events could be used as a bellwether for predicting who is at risk of cancer. However, because it is difficult to measure the evolution taking place inside our bodies over such long timescales, our understanding of the evolutionary dynamics driving early cancer remains cursory. A key gap in our understanding is our inability to identify which mutant clones will progress to lethal cancers and which will remain benign.

Serial blood samples collected annually from hundreds of thousands of healthy people give us a superpower that can fill in these gaps in understanding. We can "zoom in" on the people who develop cancer, then "rewind" time by analysing blood samples collected years before the cancer was diagnosed. This provides a detailed "fossil record" of the disease, enabling one to determine when the cancer first arose and to watch the entire evolutionary life-history of the tumour unfold, one DNA error at a time. An attractive initial target for this ambitious vision is the aggressive blood cancer Acute Myeloid Leukemia (AML) because it is characterised by a relatively small number of cancer-causing mutations, which are readily detectable in the blood.

This UKRI FLF application sets out an innovative long-term research programme for blood cancer prediction and early detection. To achieve this, first we will combine population genetic theory with the vast amounts of sequencing data from the blood of >50,000 individuals to characterise the evolutionary dynamics that defines "normal". Second, by exploiting an extraordinary set of serial blood samples in hundreds of AML cases and cancer-free controls we will generate a unique dynamic dataset that paints a highly quantitative genetic portrait of how AML evolves from healthy tissue. Third, by mining this rich data resource, we will train a set of mathematical and statistical models that use evolutionary dynamics theory and simulation (which we have previously pioneered) to make probabilistic "forecasts" of leukemia risk from a blood sample. This FLF application will thus establish blood-cancers as a "model system" for early cancer detection by combining unique longitudinal samples, novel sequencing technologies and emerging statistical methods to predict blood cancer risk.

Planned Impact

When cancer is detected earlier, it is easier to treat successfully. As such, early detection of cancer is a major strategic focus for the UK. The major impact of this FLF application will be to accelerate the development of early detection tools for blood cancers by addressing three fundamental questions which will help the UK address the major challenge of early detection: predicting which pre-cancerous states will progress to lethal cancers (thus requiring intervention) and which remain benign or indolent. The ultimate impact of this research will be on patient outcomes.

Early detection requires samples collected before symptoms arise. Blood is an ideal system for this because large collections of serial bloods have been collected in the UK, from otherwise healthy people, for a number of long-term studies. These samples can be exploited to "rewind" time in people who develop cancer by analysing samples collected before diagnosis. This insight has been noted by US-based biotech firms with deep pockets (e.g. Grail, Google Life Sciences, Guardant Health and Freenome), all of whom have identified the need to start longitudinal studies of their own. It is crucial the UK regain the initiative by exploiting our unique existing samples resources and combine them with strengths in cancer-evolution, sequencing and quantitative skills. This FLF application will establish just that, by using blood-cancers as a "model system" for early cancer detection and linking unique longitudinal samples, novel sequencing technologies and emerging statistical methods to predict blood cancer risk. To establish this application's impact, we propose to make all (non-identifiable) data and analysis framework from this proposal rapidly available to the wider scientific community via a dedicated study portal. With a team and UK-based community of quantitatively trained clinicians, biologists, engineers and mathematicians dedicated to cancer-prediction though longitudinal profiling of biomarkers, these data will accelerate the development and commercialisation of early cancer detection tests.
 
Description The key objective from this award was to be able to predict blood cancer years before diagnosis. We now have substiantial evidence that we will be able to achieve this by the end of the award.

We have generated large volumes of genomic data from blood samples collected up to 10 years before cancer diagnosis. In many of these samples there is clear signal of a cancer which is detectable in some people up to 10 years prior to their diagnossis. We have found that this strong signal exists in both genetic and epigenetic assays and its strength and duration strongly suggests that we will be able to predict cancer in some individuals many years before a traditional diagnosis.
Exploitation Route Our work has clear implications for cancer screening and the performance of early cancer detection tests. It also has implications for possible interventions (i.e. treatments) for pre-cancer which will likely be relevant for pharma.
Sectors Healthcare,Pharmaceuticals and Medical Biotechnology

 
Description Our work has impacted the following areas: --Establishing clonal haematopoiesis (CH) clinics in the NHS-- Our work on CH (Watson et al., Science 2020) drew a link between the fitness advantage of mutations in CH and future blood cancer risk. A CH clinic is being set up at Addenbrookes Hospital Cambridge, with the aim that this will become a national referral clinic for patients with high risk CH or patients with acquired risk factors for haematological malignancy. Patients will undergo regular DNA sequencing, to assess clonal evolution, as well as blood count monitoring. The aim is to also set up clinical trials in order assess the effect of targeted therapies on halting the progression of CH to malignancy. --A new clinical longitudinal cohort study to analyse clonal dynamics during healthy ageing and disease-- We have established a new prospective longitudunal clinical cohort (Legacy Blood-- https://www.legacy-study.org/ -- run out of Addenbrookes Hospital) which aims to better understand which specific factors drive or impede the growth of mutations in the blood over time. The LEGACY study aims to answer these questions by studying blood samples collected at regular intervals (once every ~6 weeks) over a period of 18 months, from the same individuals, together with concurrent information on their health and lifestyle, collected through questionnaires, NHS medical and health-related records and wearable devices (Fitbit). By then continuing to follow the health of participants long-term (through medical and other health-related records), we hope to assess whether particular changes detectable in the blood might be predictive of future illness.
First Year Of Impact 2022
Sector Healthcare,Pharmaceuticals and Medical Biotechnology
Impact Types Policy & public services

 
Description Longitudinal tracking of genetic and epigenetic signals for blood-cancer risk prediction
Amount £249,082 (GBP)
Funding ID 29749 
Organisation Cancer Research UK 
Sector Charity/Non Profit
Country United Kingdom
Start 02/2020 
End 01/2022
 
Title Method for estimating selection in healthy tissues 
Description In the early stages of cancer development, cells with mutations conferring fitness advantages can outcompete their neighbors and clonally expand. However, our current knowledge of such driver mutations is still incomplete. My colleagues and I devised and published strategy to address this gap. The core idea of our method is to look not at driver mutations, but at passenger mutations, which are selectively neutral (for example, synonymous mutations that don't change protein sequences) but can reach high frequencies by genetic linkage with driver mutations. Our method is able to estimate genome-wide fitness effects and mutation rates of driver mutations (even including those that are not directly observed) by using information about passenger mutation abundance and frequency. 
Type Of Material Technology assay or reagent 
Year Produced 2021 
Provided To Others? Yes  
Impact Improved ability to identify new driver mutations in early cancer 
URL https://www.nature.com/articles/s41588-021-00957-1
 
Description Collaboiration with Elisa Laurenti 
Organisation University of Cambridge
Country United Kingdom 
Sector Academic/University 
PI Contribution We are undertaking a study with the lab of Elisa Laurenti looking at pre-leukaemic evolution using single-cell DNA sequencing technologies. We are performing the single-cell DNA sequencing and all subsequent analysis of the data
Collaborator Contribution Our partners in this collaboration are providing expertise on cell sorting.
Impact We have amassed results from 16 patient samples which will be written up into a manuscript for publication in the year 2023
Start Year 2020
 
Description Collaboration with Oregon Health Sciences University on AML prediction 
Organisation Oregon Health and Science University
Country United States 
Sector Academic/University 
PI Contribution We initiated this collaboration and lead the joint project.
Collaborator Contribution They are performing various genomic analyses on clinical samples
Impact Multidisciplinary: Genomics Statistics Clinical medicine
Start Year 2019
 
Description Collaboration with UCL and MRC clinical trials unit 
Organisation Medical Research Council (MRC)
Department MRC Clinical Trials Unit
Country United Kingdom 
Sector Academic/University 
PI Contribution We initiated this collaboration and our group is leading the joint project.
Collaborator Contribution They have provided crucial clinical samples and advice on study design.
Impact Multidisciplinary: Clinical medicine Clinical trials Genomics Physics
Start Year 2019
 
Description Collaboration with UCL and MRC clinical trials unit 
Organisation University College London
Country United Kingdom 
Sector Academic/University 
PI Contribution We initiated this collaboration and our group is leading the joint project.
Collaborator Contribution They have provided crucial clinical samples and advice on study design.
Impact Multidisciplinary: Clinical medicine Clinical trials Genomics Physics
Start Year 2019
 
Company Name SYNTENY BIOTECHNOLOGY LIMITED 
Description At Synteny Biotechnology, we propose a radically new approach for early cancer detection. Instead of detecting cancer directly, we propose to harness the adaptive immune system to detect it for us. By "reading out" what the immune system is recognising via a simple blood test, we propose to harness the body's in-built diagnostic to detect cancer anywhere in the body. Two features of the immune system make it an ideal detector of early cancer. First, it is exquisitely sensitive: able to detect a single molecule of a foreign antigen caused by, for example, a tumour or a viral infection. Second, the immune system amplifies an initially weak signal by exponentially expanding the immune cells that successfully recognise the tumour. We are establishing machine learning methods for "decoding" the immune system and use these methods to develop an accurate and cheap pan-cancer early detection test. We have raised in excess of £1.2m in an initial seed round from both angel investors and a Innovate UK Biomedical Catalyst award. 
Year Established 2021 
Impact Our current impact is providing full-time employment to 2 machine learning engineers. This is expected to grow to 5 FTE in the next few months.
 
Description Cambridge Conversations Event on early detection of cancer 
Form Of Engagement Activity A broadcast e.g. TV/radio/film/podcast (other than news/press)
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Supporters
Results and Impact a 50-min online interview about the future of early detection of cancer.
Year(s) Of Engagement Activity 2021
URL https://www.youtube.com/watch?v=dopY5oTCgOo&t=1118s&ab_channel=DearWorld%2CYoursCambridge