Deciphering and targeting cancer metabolism using executable models
Lead Research Organisation:
University College London
Department Name: Medical Physics and Biomedical Eng
Abstract
The progression of cancer is driven by mutations and errors in the cell that promote the "hallmarks of cancer". These are a set of distinct behaviours that cells in tumours possess, enabling the survival and growth of the cancer. A longstanding observation in cancer cells is the Warburg effect; the switch away from oxidative phosphorylation to anaerobic glycolysis. Recent evidence has shown that metabolic reprogramming- large scale modification of how the cells process energy- is achieved by mutations in oncogenes including KRAS. This enables cancer cell survival in the tumour. Furthermore mutations in individual metabolic enzymes, such as fumarate hydratase, can affect a transition from epithelial to a mesenchymal morphology. Both the links between individual metabolites and cell behaviour, and the role of the metabolic network in cancer development however remains unclear. Knowledge of both of these are necessary to interpret metabolic changes in cancer and to identify new drug targets that are robust to network effects.
Computational modelling offers the opportunity to formalise the relationships between elements in the network and to address this issue. However, conventional approaches based on ordinary differential equations (ODEs) and flux balance analysis (FBA) are not suitable. ODE models require precise physico-chemical parameters that are not available for human metabolism. FBA does not have this requirement but is not capable of modelling the accumulation of metabolites that can occur in response to mutation.
I propose to use an "executable" modelling approach to construct models of the metabolic network and link them to cellular behaviour. Executable modelling approaches do not require detailed physical parameters, and can model accumulation events. They have the further advantage that they are amenable to a class of model analysis known as formal verification. Here mathematical proofs are used to offer guarantees of cell properties that are encoded in formal logic. These guarantees can apply over all possible states of the system and so offer a uniquely powerful way to test model correctness. This is particularly useful for highly robust systems such as the metabolic network, but also in taking account of rare events. This latter issue is particularly important when considering how rare events determine the development of cancer. Model building will be guided by the use of machine learning approaches (t-SNE, decision forests) to address publicly available expression data and metabolomic datasets shared with us by AstraZeneca. Finally, these models will be used to make novel predictions about the metabolism can change cell phenotype. The development of these models will allow us to understand how metabolic networks drive cancer progression and help us identify novel oncogenes and drug targets.
Computational modelling offers the opportunity to formalise the relationships between elements in the network and to address this issue. However, conventional approaches based on ordinary differential equations (ODEs) and flux balance analysis (FBA) are not suitable. ODE models require precise physico-chemical parameters that are not available for human metabolism. FBA does not have this requirement but is not capable of modelling the accumulation of metabolites that can occur in response to mutation.
I propose to use an "executable" modelling approach to construct models of the metabolic network and link them to cellular behaviour. Executable modelling approaches do not require detailed physical parameters, and can model accumulation events. They have the further advantage that they are amenable to a class of model analysis known as formal verification. Here mathematical proofs are used to offer guarantees of cell properties that are encoded in formal logic. These guarantees can apply over all possible states of the system and so offer a uniquely powerful way to test model correctness. This is particularly useful for highly robust systems such as the metabolic network, but also in taking account of rare events. This latter issue is particularly important when considering how rare events determine the development of cancer. Model building will be guided by the use of machine learning approaches (t-SNE, decision forests) to address publicly available expression data and metabolomic datasets shared with us by AstraZeneca. Finally, these models will be used to make novel predictions about the metabolism can change cell phenotype. The development of these models will allow us to understand how metabolic networks drive cancer progression and help us identify novel oncogenes and drug targets.
Technical Summary
The progression of cancer is associated with extensive metabolic reprogramming. A key feature of cancer is the Warburg effect, where cells switch from relying on oxidative phosphorylation to anaerobic glycolysis. Similarly, KRAS and BRAF mutations drive widescale reprogramming of metabolism, whilst the mutation of individual metabolic enzymes such as fumarate hydratase can promote cancer progression.
Computational models of cancer metabolism could elucidate the links between metabolism and cellular phenotype. Here we propose the construction of an executable model of cancer metabolism, guided by the use of machine learning applied to high throughput expression and metabolomic datasets and the existing literature on cancer metabolism. Executable models do not require the detailed physical parameterisation necessary for continuous models, such as those based on ordinary differential equations. They also allow more complex model behaviours to be described than flux balance analysis, including metabolite accumulation. Finally, they are amenable to analyses based on formal verification, where mathematical proofs are used to offer powerful guarantees of model behaviour by effectively searching all possible states of the model. The models and the new predictions that arise from the model construction will be validated through experiments performed with the Frezza group.
Computational models of cancer metabolism could elucidate the links between metabolism and cellular phenotype. Here we propose the construction of an executable model of cancer metabolism, guided by the use of machine learning applied to high throughput expression and metabolomic datasets and the existing literature on cancer metabolism. Executable models do not require the detailed physical parameterisation necessary for continuous models, such as those based on ordinary differential equations. They also allow more complex model behaviours to be described than flux balance analysis, including metabolite accumulation. Finally, they are amenable to analyses based on formal verification, where mathematical proofs are used to offer powerful guarantees of model behaviour by effectively searching all possible states of the model. The models and the new predictions that arise from the model construction will be validated through experiments performed with the Frezza group.
Planned Impact
This project will lead to demonstrable contributions within academia, industrial research, and wider society. Complex models of metabolism will be constructed using existing publicly available and privately shared high throughput datasets analysed using machine learning approaches, alongside the literature on cancer metabolism. These models will be tested using formal verification based analyses. The work will give rise to valuable insights into how metabolism can determine the cancer phenotype, and will be published in high impact papers and through conferences.
Within the academic community, the interdisciplinary nature of the work will support biologists working in metabolism, and computer scientists working on formal verification approaches applied to large models. The models that are generated will encompass large numbers of observations that have been made and as such can be used to test and validate experimental approaches before they are taken into the laboratory. Similarly, model building in the life sciences has generated new problems for existing approaches and algorithms, requiring the development of new techniques for analysis. As such, the construction of new, complex models will support algorithm developments, at a minimum by extending the test sets with large, realistic models, or by requiring new research.
Through working with high throughput datasets, supplied in part by AstraZeneca, we will be able to apply our models to relevant data on the metabolic behaviours of different cell lines. This will give new insights into a large body of experimental data covering multiple cell lines under constant conditions. This in turn will be used to inform drug development within the company, and will lead to future collaborations based around the research.
In addition to impact from the direct results of the project, the work will have additional impacts important to the wider UK. One post-doctoral research associate will be trained over the course of the fellowship in the building and analysis of executable models of biological processes, and the development of formal specifications describing biological processes. Current models of metabolism have been used in outreach activities (including demonstrations for school parties) and will be used to demonstrate to the wider public how computational methods aid scientific understanding of cancer and wider biology. This includes the development of worksheets for A-level students currently under discussion with the OCR.
Within the academic community, the interdisciplinary nature of the work will support biologists working in metabolism, and computer scientists working on formal verification approaches applied to large models. The models that are generated will encompass large numbers of observations that have been made and as such can be used to test and validate experimental approaches before they are taken into the laboratory. Similarly, model building in the life sciences has generated new problems for existing approaches and algorithms, requiring the development of new techniques for analysis. As such, the construction of new, complex models will support algorithm developments, at a minimum by extending the test sets with large, realistic models, or by requiring new research.
Through working with high throughput datasets, supplied in part by AstraZeneca, we will be able to apply our models to relevant data on the metabolic behaviours of different cell lines. This will give new insights into a large body of experimental data covering multiple cell lines under constant conditions. This in turn will be used to inform drug development within the company, and will lead to future collaborations based around the research.
In addition to impact from the direct results of the project, the work will have additional impacts important to the wider UK. One post-doctoral research associate will be trained over the course of the fellowship in the building and analysis of executable models of biological processes, and the development of formal specifications describing biological processes. Current models of metabolism have been used in outreach activities (including demonstrations for school parties) and will be used to demonstrate to the wider public how computational methods aid scientific understanding of cancer and wider biology. This includes the development of worksheets for A-level students currently under discussion with the OCR.
Publications
Abby E
(2023)
Notch1 mutations drive clonal expansion in normal esophageal epithelium but impair tumor growth.
in Nature genetics
Colom B
(2021)
Mutant clones in normal epithelium outcompete and eliminate emerging tumours.
in Nature
Foley K
(2024)
SMAD4 and KCNQ3 alterations are associated with lymph node metastases in oesophageal adenocarcinoma.
in Biochimica et biophysica acta. Molecular basis of disease
Hall B
(2021)
Data integration in logic-based models of biological mechanisms
in Current Opinion in Systems Biology
Hall M
(2023)
Mutations observed in somatic evolution reveal underlying gene mechanisms
in Communications Biology
Description | Research Fellows Enhanced Research Expenses 2021 |
Amount | £168,282 (GBP) |
Funding ID | RF\ERE\210188 |
Organisation | The Royal Society |
Sector | Charity/Non Profit |
Country | United Kingdom |
Start | 12/2021 |
End | 03/2023 |
Title | BioModelAnalyzer software for executable modelling |
Description | BioModelAnalyzer is an open source software tool that enables users with a wide range of computational proficiency to engage in modelling of gene regulatory networks. |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2017 |
Provided To Others? | Yes |
Impact | This tool has led to and enabled many publications, and been used in outreach activities |
URL | https://biomodelanalyzer.org/ |
Title | Darwinian shift software tool |
Description | This tool generates complex null hypotheses for the analysis of sequencing data sets |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2022 |
Provided To Others? | Yes |
Impact | This has led to two papers to date and is being integrated into other projects. |
URL | https://github.com/michaelhall28/darwinian_shift |
Title | BioModelAnalyzer |
Description | BioModelAnalyzer is a tool for the development and analysis of executable models |
Type Of Material | Computer model/algorithm |
Year Produced | 2017 |
Provided To Others? | Yes |
Impact | This has been used in several different publications, public lectures and demonstrations, and undergraduate and post graduate teaching |
URL | https://biomodelanalyzer.org/ |
Title | Darwinian shift |
Description | Darwinian shift enables hypothesis testing of sequencing datasets by developing powerful null hypotheses |
Type Of Material | Data analysis technique |
Year Produced | 2022 |
Provided To Others? | Yes |
Impact | This has been published and is being reused in other tool development |
URL | https://github.com/michaelhall28/darwinian_shift |
Description | AstraZeneca collaboration |
Organisation | AstraZeneca |
Department | Research and Development AstraZeneca |
Country | United Kingdom |
Sector | Private |
PI Contribution | Our team analysed and visualised metabolomics datasets, making new discoveries. |
Collaborator Contribution | AstraZeneca supplied metabolomics data used in the project and consulted on the analysis |
Impact | This has led to the publication "Heterogeneity of the cancer cell line metabolic landscape" in Molecular Systems Biology, 2022 |
Start Year | 2018 |
Description | Large scale analysis of gene regulatory networks |
Organisation | University of Toulouse |
Country | France |
Sector | Academic/University |
PI Contribution | My team develop and maintain the BioModelAnalyzer tool, used in this project |
Collaborator Contribution | The partners have developed large scale models and contributed code to the BioModelAnalyzer, enabling the code to run on Unix systems |
Impact | This has led to one publication (Zerrouk et al, 2024) |
Start Year | 2021 |
Title | BioModelAnalyzer (2021 |
Description | BioModelAnalyzre is a tool for the construction and analysis of executable models of biological systems |
Type Of Technology | Software |
Year Produced | 2021 |
Open Source License? | Yes |
Impact | Taught at workshop, masters courses, and used heavily in my research |
URL | https://biomodelanalyzer.org/ |
Title | CaSQ |
Description | Software tool for converting formats for describing network maps to models automatically. I added support for BioModelAnalyzer export. |
Type Of Technology | Software |
Year Produced | 2021 |
Open Source License? | Yes |
Impact | Will be adopted in the near future by students from collaborators groups. |
URL | https://github.com/soli/casq |
Title | Darwinian shift |
Description | Darwinian shift enables construction of null hypotheses for analysing sequencing datasets |
Type Of Technology | Software |
Year Produced | 2022 |
Open Source License? | Yes |
Impact | Several publications have used this |
URL | https://github.com/michaelhall28/darwinian_shift |
Description | Lectures on teaching modelling cancer biology |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Schools |
Results and Impact | Spoke at two Wellcome Connecting Science events to teachers (A-level and T-level) about how to model cancer development. |
Year(s) Of Engagement Activity | 2023 |
URL | https://www.stem.org.uk/cpd/ondemand/519920/cutting-edge-science-genomics-and-artificial-intelligenc... |