Deciphering and targeting cancer metabolism using executable models

Lead Research Organisation: University College London
Department Name: Medical Physics and Biomedical Eng

Abstract

The progression of cancer is driven by mutations and errors in the cell that promote the "hallmarks of cancer". These are a set of distinct behaviours that cells in tumours possess, enabling the survival and growth of the cancer. A longstanding observation in cancer cells is the Warburg effect; the switch away from oxidative phosphorylation to anaerobic glycolysis. Recent evidence has shown that metabolic reprogramming- large scale modification of how the cells process energy- is achieved by mutations in oncogenes including KRAS. This enables cancer cell survival in the tumour. Furthermore mutations in individual metabolic enzymes, such as fumarate hydratase, can affect a transition from epithelial to a mesenchymal morphology. Both the links between individual metabolites and cell behaviour, and the role of the metabolic network in cancer development however remains unclear. Knowledge of both of these are necessary to interpret metabolic changes in cancer and to identify new drug targets that are robust to network effects.

Computational modelling offers the opportunity to formalise the relationships between elements in the network and to address this issue. However, conventional approaches based on ordinary differential equations (ODEs) and flux balance analysis (FBA) are not suitable. ODE models require precise physico-chemical parameters that are not available for human metabolism. FBA does not have this requirement but is not capable of modelling the accumulation of metabolites that can occur in response to mutation.

I propose to use an "executable" modelling approach to construct models of the metabolic network and link them to cellular behaviour. Executable modelling approaches do not require detailed physical parameters, and can model accumulation events. They have the further advantage that they are amenable to a class of model analysis known as formal verification. Here mathematical proofs are used to offer guarantees of cell properties that are encoded in formal logic. These guarantees can apply over all possible states of the system and so offer a uniquely powerful way to test model correctness. This is particularly useful for highly robust systems such as the metabolic network, but also in taking account of rare events. This latter issue is particularly important when considering how rare events determine the development of cancer. Model building will be guided by the use of machine learning approaches (t-SNE, decision forests) to address publicly available expression data and metabolomic datasets shared with us by AstraZeneca. Finally, these models will be used to make novel predictions about the metabolism can change cell phenotype. The development of these models will allow us to understand how metabolic networks drive cancer progression and help us identify novel oncogenes and drug targets.

Technical Summary

The progression of cancer is associated with extensive metabolic reprogramming. A key feature of cancer is the Warburg effect, where cells switch from relying on oxidative phosphorylation to anaerobic glycolysis. Similarly, KRAS and BRAF mutations drive widescale reprogramming of metabolism, whilst the mutation of individual metabolic enzymes such as fumarate hydratase can promote cancer progression.

Computational models of cancer metabolism could elucidate the links between metabolism and cellular phenotype. Here we propose the construction of an executable model of cancer metabolism, guided by the use of machine learning applied to high throughput expression and metabolomic datasets and the existing literature on cancer metabolism. Executable models do not require the detailed physical parameterisation necessary for continuous models, such as those based on ordinary differential equations. They also allow more complex model behaviours to be described than flux balance analysis, including metabolite accumulation. Finally, they are amenable to analyses based on formal verification, where mathematical proofs are used to offer powerful guarantees of model behaviour by effectively searching all possible states of the model. The models and the new predictions that arise from the model construction will be validated through experiments performed with the Frezza group.

Planned Impact

This project will lead to demonstrable contributions within academia, industrial research, and wider society. Complex models of metabolism will be constructed using existing publicly available and privately shared high throughput datasets analysed using machine learning approaches, alongside the literature on cancer metabolism. These models will be tested using formal verification based analyses. The work will give rise to valuable insights into how metabolism can determine the cancer phenotype, and will be published in high impact papers and through conferences.

Within the academic community, the interdisciplinary nature of the work will support biologists working in metabolism, and computer scientists working on formal verification approaches applied to large models. The models that are generated will encompass large numbers of observations that have been made and as such can be used to test and validate experimental approaches before they are taken into the laboratory. Similarly, model building in the life sciences has generated new problems for existing approaches and algorithms, requiring the development of new techniques for analysis. As such, the construction of new, complex models will support algorithm developments, at a minimum by extending the test sets with large, realistic models, or by requiring new research.

Through working with high throughput datasets, supplied in part by AstraZeneca, we will be able to apply our models to relevant data on the metabolic behaviours of different cell lines. This will give new insights into a large body of experimental data covering multiple cell lines under constant conditions. This in turn will be used to inform drug development within the company, and will lead to future collaborations based around the research.

In addition to impact from the direct results of the project, the work will have additional impacts important to the wider UK. One post-doctoral research associate will be trained over the course of the fellowship in the building and analysis of executable models of biological processes, and the development of formal specifications describing biological processes. Current models of metabolism have been used in outreach activities (including demonstrations for school parties) and will be used to demonstrate to the wider public how computational methods aid scientific understanding of cancer and wider biology. This includes the development of worksheets for A-level students currently under discussion with the OCR.
 
Description Research Fellows Enhanced Research Expenses 2021
Amount £168,282 (GBP)
Funding ID RF\ERE\210188 
Organisation The Royal Society 
Sector Charity/Non Profit
Country United Kingdom
Start 12/2021 
End 03/2023
 
Title BioModelAnalyzer software for executable modelling 
Description BioModelAnalyzer is an open source software tool that enables users with a wide range of computational proficiency to engage in modelling of gene regulatory networks. 
Type Of Material Improvements to research infrastructure 
Year Produced 2017 
Provided To Others? Yes  
Impact This tool has led to and enabled many publications, and been used in outreach activities 
URL https://biomodelanalyzer.org/
 
Title Darwinian shift software tool 
Description This tool generates complex null hypotheses for the analysis of sequencing data sets 
Type Of Material Improvements to research infrastructure 
Year Produced 2022 
Provided To Others? Yes  
Impact This has led to two papers to date and is being integrated into other projects. 
URL https://github.com/michaelhall28/darwinian_shift
 
Title BioModelAnalyzer 
Description BioModelAnalyzer is a tool for the development and analysis of executable models 
Type Of Material Computer model/algorithm 
Year Produced 2017 
Provided To Others? Yes  
Impact This has been used in several different publications, public lectures and demonstrations, and undergraduate and post graduate teaching 
URL https://biomodelanalyzer.org/
 
Title Darwinian shift 
Description Darwinian shift enables hypothesis testing of sequencing datasets by developing powerful null hypotheses 
Type Of Material Data analysis technique 
Year Produced 2022 
Provided To Others? Yes  
Impact This has been published and is being reused in other tool development 
URL https://github.com/michaelhall28/darwinian_shift
 
Description AstraZeneca collaboration 
Organisation AstraZeneca
Department Research and Development AstraZeneca
Country United Kingdom 
Sector Private 
PI Contribution Our team analysed and visualised metabolomics datasets, making new discoveries.
Collaborator Contribution AstraZeneca supplied metabolomics data used in the project and consulted on the analysis
Impact This has led to the publication "Heterogeneity of the cancer cell line metabolic landscape" in Molecular Systems Biology, 2022
Start Year 2018
 
Description Large scale analysis of gene regulatory networks 
Organisation University of Toulouse
Country France 
Sector Academic/University 
PI Contribution My team develop and maintain the BioModelAnalyzer tool, used in this project
Collaborator Contribution The partners have developed large scale models and contributed code to the BioModelAnalyzer, enabling the code to run on Unix systems
Impact This has led to one publication (Zerrouk et al, 2024)
Start Year 2021
 
Title BioModelAnalyzer (2021 
Description BioModelAnalyzre is a tool for the construction and analysis of executable models of biological systems 
Type Of Technology Software 
Year Produced 2021 
Open Source License? Yes  
Impact Taught at workshop, masters courses, and used heavily in my research 
URL https://biomodelanalyzer.org/
 
Title CaSQ 
Description Software tool for converting formats for describing network maps to models automatically. I added support for BioModelAnalyzer export. 
Type Of Technology Software 
Year Produced 2021 
Open Source License? Yes  
Impact Will be adopted in the near future by students from collaborators groups. 
URL https://github.com/soli/casq
 
Title Darwinian shift 
Description Darwinian shift enables construction of null hypotheses for analysing sequencing datasets 
Type Of Technology Software 
Year Produced 2022 
Open Source License? Yes  
Impact Several publications have used this 
URL https://github.com/michaelhall28/darwinian_shift
 
Description Lectures on teaching modelling cancer biology 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Schools
Results and Impact Spoke at two Wellcome Connecting Science events to teachers (A-level and T-level) about how to model cancer development.
Year(s) Of Engagement Activity 2023
URL https://www.stem.org.uk/cpd/ondemand/519920/cutting-edge-science-genomics-and-artificial-intelligenc...