Deciphering and targeting cancer metabolism using executable models

Lead Research Organisation: University of Cambridge

Department Name: MRC Cancer Unit

Abstract

The progression of cancer is driven by mutations and errors in the cell that promote the "hallmarks of cancer". These are a set of distinct behaviours that cells in tumours possess, enabling the survival and growth of the cancer. A longstanding observation in cancer cells is the Warburg effect; the switch away from oxidative phosphorylation to anaerobic glycolysis. Recent evidence has shown that metabolic reprogramming- large scale modification of how the cells process energy- is achieved by mutations in oncogenes including KRAS. This enables cancer cell survival in the tumour. Furthermore mutations in individual metabolic enzymes, such as fumarate hydratase, can affect a transition from epithelial to a mesenchymal morphology. Both the links between individual metabolites and cell behaviour, and the role of the metabolic network in cancer development however remains unclear. Knowledge of both of these are necessary to interpret metabolic changes in cancer and to identify new drug targets that are robust to network effects.

Computational modelling offers the opportunity to formalise the relationships between elements in the network and to address this issue. However, conventional approaches based on ordinary differential equations (ODEs) and flux balance analysis (FBA) are not suitable. ODE models require precise physico-chemical parameters that are not available for human metabolism. FBA does not have this requirement but is not capable of modelling the accumulation of metabolites that can occur in response to mutation.

I propose to use an "executable" modelling approach to construct models of the metabolic network and link them to cellular behaviour. Executable modelling approaches do not require detailed physical parameters, and can model accumulation events. They have the further advantage that they are amenable to a class of model analysis known as formal verification. Here mathematical proofs are used to offer guarantees of cell properties that are encoded in formal logic. These guarantees can apply over all possible states of the system and so offer a uniquely powerful way to test model correctness. This is particularly useful for highly robust systems such as the metabolic network, but also in taking account of rare events. This latter issue is particularly important when considering how rare events determine the development of cancer. Model building will be guided by the use of machine learning approaches (t-SNE, decision forests) to address publicly available expression data and metabolomic datasets shared with us by AstraZeneca. Finally, these models will be used to make novel predictions about the metabolism can change cell phenotype. The development of these models will allow us to understand how metabolic networks drive cancer progression and help us identify novel oncogenes and drug targets.

Technical Summary

The progression of cancer is associated with extensive metabolic reprogramming. A key feature of cancer is the Warburg effect, where cells switch from relying on oxidative phosphorylation to anaerobic glycolysis. Similarly, KRAS and BRAF mutations drive widescale reprogramming of metabolism, whilst the mutation of individual metabolic enzymes such as fumarate hydratase can promote cancer progression.

Computational models of cancer metabolism could elucidate the links between metabolism and cellular phenotype. Here we propose the construction of an executable model of cancer metabolism, guided by the use of machine learning applied to high throughput expression and metabolomic datasets and the existing literature on cancer metabolism. Executable models do not require the detailed physical parameterisation necessary for continuous models, such as those based on ordinary differential equations. They also allow more complex model behaviours to be described than flux balance analysis, including metabolite accumulation. Finally, they are amenable to analyses based on formal verification, where mathematical proofs are used to offer powerful guarantees of model behaviour by effectively searching all possible states of the model. The models and the new predictions that arise from the model construction will be validated through experiments performed with the Frezza group.

Planned Impact

This project will lead to demonstrable contributions within academia, industrial research, and wider society. Complex models of metabolism will be constructed using existing publicly available and privately shared high throughput datasets analysed using machine learning approaches, alongside the literature on cancer metabolism. These models will be tested using formal verification based analyses. The work will give rise to valuable insights into how metabolism can determine the cancer phenotype, and will be published in high impact papers and through conferences.

Within the academic community, the interdisciplinary nature of the work will support biologists working in metabolism, and computer scientists working on formal verification approaches applied to large models. The models that are generated will encompass large numbers of observations that have been made and as such can be used to test and validate experimental approaches before they are taken into the laboratory. Similarly, model building in the life sciences has generated new problems for existing approaches and algorithms, requiring the development of new techniques for analysis. As such, the construction of new, complex models will support algorithm developments, at a minimum by extending the test sets with large, realistic models, or by requiring new research.

Through working with high throughput datasets, supplied in part by AstraZeneca, we will be able to apply our models to relevant data on the metabolic behaviours of different cell lines. This will give new insights into a large body of experimental data covering multiple cell lines under constant conditions. This in turn will be used to inform drug development within the company, and will lead to future collaborations based around the research.

In addition to impact from the direct results of the project, the work will have additional impacts important to the wider UK. One post-doctoral research associate will be trained over the course of the fellowship in the building and analysis of executable models of biological processes, and the development of formal specifications describing biological processes. Current models of metabolism have been used in outreach activities (including demonstrations for school parties) and will be used to demonstrate to the wider public how computational methods aid scientific understanding of cancer and wider biology. This includes the development of worksheets for A-level students currently under discussion with the OCR.

Funded Value:

£344,485

Funded Period:

Dec 18 - Dec 20

Funder:

MRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

MR/S000216/1

Principal Investigator:

Benjamin Hall

Health Category:

Unclassified

Organisations

People	ORCID iD
Benjamin Hall (Principal Investigator)	http://orcid.org/0000-0003-0355-2946

Publications

Author Name

Title Publication Date Published

|< < 1 2 3 4 > >|

10 25 50

Abby E (2023) Notch1 mutations drive clonal expansion in normal esophageal epithelium but impair tumor growth. in Nature genetics

Abby E (2023) Notch1 mutations drive clonal expansion in normal esophageal epithelium but impair tumor growth.

Abby E (2021) Notch1 mutation drives clonal expansion in normal esophageal epithelium but impairs tumor growth

Abby E (2023) Notch1 mutations drive clonal expansion in normal esophageal epithelium but impair tumor growth.

Acha-Sagredo A (2021) Clone competition as a mechanism to reduce tumor formation. in Developmental cell

Clarke M (2019) Automated Reasoning for Systems Biology and Medicine

Colom B (2020) Spatial competition shapes the dynamic mutational landscape of normal esophageal epithelium in Nature Genetics

Colom B (2021) Mutant clones in normal epithelium outcompete and eliminate emerging tumours. in Nature

Colom B (2021) Mutant clones in normal epithelium outcompete and eliminate emerging tumours.

De Dominici M (2021) Shedding Light on Mutant Clonal Dynamics and Cancer Risk in the Skin. in Cancer discovery

Related Projects

Project Reference	Relationship	Related To	Start	End	Award Value
MR/S000216/1			01/12/2018	31/12/2020	£344,486
MR/S000216/2	Transfer	MR/S000216/1	01/02/2021	28/02/2023	£143,109

Policy Influence
Further Funding
Research Databases and Models
Research Tools and Methods
Collaboration
Software and Technical Products
Engagement Activities


Description	A-level worksheet development with OCR
Geographic Reach	National
Policy Influence Type	Influenced training of practitioners or researchers
Impact	Computational science plays a major role in all branches of the natural sciences. Whilst mathematical models have long played a role in hard sciences, driving uptake of computational techniques, this has spread significantly over the last two decades to include the life sciences. This shift in the sciences reflects a wider change in society as computers and computational thinking become more embedded in all areas of life. Whilst there have been substantive shifts in recent years through efforts such as "Computing at schools" to encourage computer science in early years education, this has been predominantly limited to that discipline. Despite its value being recognised, and the requirement of examination boards for modelling approaches, "computational science" has been slower to be adopted due to lack of appropriate software and developed activites. Over the past QQ we have developed a body of work aimed at A-level students to introduce cancer modelling, based on research at MRC CU. At the heart of this work is the use of the open source BioModelAnalyzer tool (http://biomodelanalyzer.org/), a graphical tool for network model construction and analysis. We have presented demonstrations at the Big Biology day, the Cambridge Science Festival, and the MRC Festival of Science. Through connections made with individual teachers during the MRC Festival, we were approached by Richard Tateson (and later Andri Achilleos) of OCR and worked with OCR to develop A-level worksheets. These were first released in 2018 through OCR forums for teachers, and with a blog post made in early 2019 https://www.ocr.org.uk/blog/getting-started-with-computational-biology-in-cancer-research/. Over the past year this has been evaluated by both the Hall group with A-level work experience students and in local schools and separately by OCR, and as part of this work, the Hall group took part in the new schools hub at the Royal Society Exhibition (Summer 2019), demonstrating the tool for students and sharing worksheets with teachers. In response to the detailed evaluation results and broader feedback from OCR, the worksheets are currently undergoing rewriting in preparation for assessment for inclusion in the curriculum. If approved at the "PAG10" standard, these will be suitable for adoption by up to 60,000 A-level biology students a year in the UK.
URL	https://www.ocr.org.uk/blog/getting-started-with-computational-biology-in-cancer-research/


Description	Research Fellows Enhanced Research Expenses 2021
Amount	£168,282 (GBP)
Funding ID	RF\ERE\210188
Organisation	The Royal Society
Sector	Charity/Non Profit
Country	United Kingdom
Start	12/2021
End	03/2023


Description	University Research Fellowships Renewals 2019
Amount	£475,798 (GBP)
Funding ID	URF\R\191013
Organisation	The Royal Society
Sector	Charity/Non Profit
Country	United Kingdom
Start	03/2020
End	03/2023


Title	BioModelAnalyzer software for executable modelling
Description	BioModelAnalyzer is an open source software tool that enables users with a wide range of computational proficiency to engage in modelling of gene regulatory networks.
Type Of Material	Improvements to research infrastructure
Year Produced	2017
Provided To Others?	Yes
Impact	This tool has led to and enabled many publications, and been used in outreach activities
URL	https://biomodelanalyzer.org/


Title	Biomodelanalyzer software 2025
Description	This is a tool for the development and analysis of executable models of biological processes
Type Of Material	Improvements to research infrastructure
Year Produced	2017
Provided To Others?	Yes
Impact	Various publications, ongoing workshop with Wellcome
URL	http://biomodelanalyzer.org


Title	Darwinian shift software tool
Description	This tool generates complex null hypotheses for the analysis of sequencing data sets
Type Of Material	Improvements to research infrastructure
Year Produced	2022
Provided To Others?	Yes
Impact	This has led to two papers to date and is being integrated into other projects.
URL	https://github.com/michaelhall28/darwinian_shift


Title	BioModelAnalyzer
Description	BioModelAnalyzer is a tool for the development and analysis of executable models
Type Of Material	Computer model/algorithm
Year Produced	2017
Provided To Others?	Yes
Impact	This has been used in several different publications, public lectures and demonstrations, and undergraduate and post graduate teaching
URL	https://biomodelanalyzer.org/


Title	Darwinian shift
Description	Darwinian shift enables hypothesis testing of sequencing datasets by developing powerful null hypotheses
Type Of Material	Data analysis technique
Year Produced	2022
Provided To Others?	Yes
Impact	This has been published and is being reused in other tool development
URL	https://github.com/michaelhall28/darwinian_shift


Description	AstraZeneca collaboration
Organisation	AstraZeneca
Department	Research and Development AstraZeneca
Country	United Kingdom
Sector	Private
PI Contribution	Our team analysed and visualised metabolomics datasets, making new discoveries.
Collaborator Contribution	AstraZeneca supplied metabolomics data used in the project and consulted on the analysis
Impact	This has led to the publication "Heterogeneity of the cancer cell line metabolic landscape" in Molecular Systems Biology, 2022
Start Year	2018


Description	Collaboration with Prof Phil Jones, Sanger
Organisation	The Wellcome Trust Sanger Institute
Country	United Kingdom
Sector	Charity/Non Profit
PI Contribution	Working with Prof Jones, my group and I have developed computational models of the aging tissue and provided statistical insights
Collaborator Contribution	Prof Jones led the generation of experimental and clinical data
Impact	Multiple publications, listed elsewhere
Start Year	2014


Description	Executable modelling of cancer metabolic networks
Organisation	AstraZeneca
Department	Research and Development AstraZeneca
Country	United Kingdom
Sector	Private
PI Contribution	Developing network models and analysing large datasets of metabolic changes in cancer
Collaborator Contribution	Shared datasets, expertise, and training of members of my lab.
Impact	This is a multidisciplinary project, involving mathematical modelling, data science, and experimental techniques in cancer metabolism
Start Year	2018


Description	Executable modelling of cancer metabolic networks
Organisation	University of Cambridge
Department	MRC Cancer Unit
Country	United Kingdom
Sector	Academic/University
PI Contribution	Developing network models and analysing large datasets of metabolic changes in cancer
Collaborator Contribution	Shared datasets, expertise, and training of members of my lab.
Impact	This is a multidisciplinary project, involving mathematical modelling, data science, and experimental techniques in cancer metabolism
Start Year	2018


Description	Large scale analysis of gene regulatory networks
Organisation	University of Toulouse
Country	France
Sector	Academic/University
PI Contribution	My team develop and maintain the BioModelAnalyzer tool, used in this project
Collaborator Contribution	The partners have developed large scale models and contributed code to the BioModelAnalyzer, enabling the code to run on Unix systems
Impact	This has led to one publication (Zerrouk et al, 2024)
Start Year	2021


Description	Metabolic maps of cancer
Organisation	AstraZeneca
Country	United Kingdom
Sector	Private
PI Contribution	Analysis of cell line metabolomics data
Collaborator Contribution	Generation of metabolomics data
Impact	None at present
Start Year	2018


Description	Modelling stromal function in tumour draining lymph nodes
Organisation	Medical Research Council (MRC)
Department	MRC Cancer Unit
Country	United Kingdom
Sector	Academic/University
PI Contribution	This a collaboration between computational biologists and wet lab experimentalists. Experiments yielding large amounts of data were performed by the Shields Lab and analysed by us. This data was then used as a basis for modelling systems, where the predictions were functionally verified by members of the Shields lab.
Collaborator Contribution	This a collaboration between computational biologists and wet lab experimentalists. Experiments yielding large amounts of data were performed by the Shields Lab and analysed by us. This data was then used as a basis for modelling systems, where the predictions were functionally verified by members of the Shields lab.
Impact	Riedel et al Nature Immunology 2016. This is a multidisciplinary collaboration crossing wet lab and in silica approaches
Start Year	2015


Title	BioModelAnalyzer
Description	Software for the construction and analysis of executable models
Type Of Technology	Software
Year Produced	2019
Open Source License?	Yes
Impact	Added features that enable model and motif construction through the introduction of selection and copy tools
URL	http://biomodelanalyzer.org/


Title	BioModelAnalyzer
Description	Software for the development and analysis of executable models
Type Of Technology	Software
Year Produced	2020
Open Source License?	Yes
Impact	Design refinements and unification of model and motif library, easing the use and reuse of models
URL	http://biomodelanalyzer.org/


Title	BioModelAnalyzer (2021
Description	BioModelAnalyzre is a tool for the construction and analysis of executable models of biological systems
Type Of Technology	Software
Year Produced	2021
Open Source License?	Yes
Impact	Taught at workshop, masters courses, and used heavily in my research
URL	https://biomodelanalyzer.org/


Title	CaSQ
Description	Software tool for converting formats for describing network maps to models automatically. I added support for BioModelAnalyzer export.
Type Of Technology	Software
Year Produced	2021
Open Source License?	Yes
Impact	Will be adopted in the near future by students from collaborators groups.
URL	https://github.com/soli/casq


Title	Darwinian shift
Description	Darwinian shift enables construction of null hypotheses for analysing sequencing datasets
Type Of Technology	Software
Year Produced	2022
Open Source License?	Yes
Impact	Several publications have used this
URL	https://github.com/michaelhall28/darwinian_shift


Title	OmnipathR
Description	Tool for the extraction of networks from the omnipath database of gene interactions. I added BMA motif export support.
Type Of Technology	Software
Year Produced	2019
Open Source License?	Yes
Impact	Aids adoption in the community
URL	https://bioconductor.org/packages/OmnipathR


Description	Lectures on teaching modelling cancer biology
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Schools
Results and Impact	Spoke at two Wellcome Connecting Science events to teachers (A-level and T-level) about how to model cancer development.
Year(s) Of Engagement Activity	2023
URL	https://www.stem.org.uk/cpd/ondemand/519920/cutting-edge-science-genomics-and-artificial-intelligenc...


Description	School practical session (Perse)
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	Local
Primary Audience	Schools
Results and Impact	Around 20 A-level students attended a short talk and a practical session run by myself using the BioModelAnalyzer. Students enjoyed the practical and gave extensive feedback which will be included in future updates.
Year(s) Of Engagement Activity	2019
URL	https://www.instagram.com/p/B3uJGNJpToD/