Origins and impacts of regulatory mutations

Lead Research Organisation: University of Edinburgh
Department Name: UNLISTED

Abstract

We study how alterations to DNA called mutations cause birth defects and cancers, based upon new computational analyses of large DNA sequencing datasets. Overall, the challenge is to find the small minority of mutations among many that affect gene functions and cause disease.
Mutations in sperm, eggs or the early embryo can cause a huge range of rare genetic disorders in children, but the mutations underlying most cases are unknown. We aim to understand the processes that lead to these mutations as parents age, to find the few mutations that really matter, and help clinicians to diagnose cases.
A tumour can accumulate many thousands of mutations in complex overlapping patterns, and we work to understand how combinations of mutations cooperate to drive cancer growth. This helps to match cancer patients to the most effective treatments, and may suggest new treatments in under-studied tumour types.
Our work links diverse researchers, NHS clinical staff and industry in multidisciplinary research that generates important new data as well as extracting new insights from existing data. Our projects are designed to reveal fundamental new knowledge about human disease, but also to open new possibilities in current medical practice.

Technical Summary

Mutagenic processes give rise to a diverse array of mutations, from short variants affecting a single base-pair to structural variants altering multi-megabase regions. These processes are often biased to preferentially accumulate mutations in particular regions, including regulatory sites that direct gene expression. We and others have shown that these sites can incur elevated rates of mutation in germline and tumour cells, impacting gene regulation. Human genome structure itself is now known to be surprisingly labile in normal and tumour cells, though the origins and impacts of structural variants remain under studied. However, recent technological and algorithmic developments mean that a full understanding of the interacting roles of all variants underlying a disease state is within reach.
The overarching aim of this programme is to characterise the roles of mutational processes in human development and disease. We study genomic instability in the germline, to understand how mutational processes have shaped human gene regulation and may disrupt development. This complements our development of new algorithms to prioritise de novo mutations as candidate variants underlying developmental disorders. We also study mutation in structurally unstable tumours, such as high grade serous ovarian cancer and glioblastoma, to understand the roles of complex somatic mutation patterns in gene regulation and discover new biomarkers of cancer patient outcomes. Recurring themes include the effects of noncoding variation on gene expression, and the combinatorial effects of mutations on phenotypes. We generate and analyse a wide range of large human cohort sequencing datasets, linking diverse HGU and IGC research groups in multidisciplinary projects, in line with the HGU Genome Interpretation research theme. We form novel hypotheses that are tested with the aid of model systems and established HGU/IGC collaborations, as well as the excellent environment for bioinformatics we have established with others at the HGU. Our strong focus on computational biology provides the HGU with the specialised expertise required for big data handling, analysis based upon high performance computing, clinical data security and statistically rigorous science.
 
Description FANTOM6 
Organisation RIKEN
Country Japan 
Sector Public 
PI Contribution Analysis of RNA sequencing data
Collaborator Contribution Production of RNA sequencing data
Impact Multidisciplinary: molecular biology, bioinformatics
Start Year 2015
 
Description Liver Cancer Evolution Consortium 
Organisation EMBL European Bioinformatics Institute (EMBL - EBI)
Country United Kingdom 
Sector Academic/University 
PI Contribution Computational analysis of genomic, epigenomic and transcriptomic data
Collaborator Contribution Generation and analysis of genomic, epigenomic and transcriptomic data; pathology; funding; supervision/management
Impact This collaboration is multidisciplinary, involving experimental and computational biologists as well as pathologists.
Start Year 2017
 
Description Liver Cancer Evolution Consortium 
Organisation German Cancer Research Center
Country Germany 
Sector Academic/University 
PI Contribution Computational analysis of genomic, epigenomic and transcriptomic data
Collaborator Contribution Generation and analysis of genomic, epigenomic and transcriptomic data; pathology; funding; supervision/management
Impact This collaboration is multidisciplinary, involving experimental and computational biologists as well as pathologists.
Start Year 2017
 
Description Liver Cancer Evolution Consortium 
Organisation Institute for Research in Biomedicine (IRB)
Country Spain 
Sector Academic/University 
PI Contribution Computational analysis of genomic, epigenomic and transcriptomic data
Collaborator Contribution Generation and analysis of genomic, epigenomic and transcriptomic data; pathology; funding; supervision/management
Impact This collaboration is multidisciplinary, involving experimental and computational biologists as well as pathologists.
Start Year 2017
 
Description Liver Cancer Evolution Consortium 
Organisation University of Cambridge
Country United Kingdom 
Sector Academic/University 
PI Contribution Computational analysis of genomic, epigenomic and transcriptomic data
Collaborator Contribution Generation and analysis of genomic, epigenomic and transcriptomic data; pathology; funding; supervision/management
Impact This collaboration is multidisciplinary, involving experimental and computational biologists as well as pathologists.
Start Year 2017
 
Description NHS/MRC HGU Genomic Data Analysis Centre 
Organisation NHS Lothian
Country United Kingdom 
Sector Public 
PI Contribution We provide bioinformatics analysis services to the NHS Lothian Molecular Genetics service - which is the main centre for genetic testing in SE Scotland. The aim is to provide candidate variants for the diagnosis of developmental disorders based upon trio whole exome sequencing data.
Collaborator Contribution Our NHS collaborators collect relevant samples and generate trio whole exome sequencing data.
Impact This collaboration is multidisciplinary and involves NHS clinical staff, NHS laboratory staff and MRC HGU bioinformatics staff.
Start Year 2020
 
Description PREDICT-Meso 
Organisation Beatson Institute for Cancer Research
Country United Kingdom 
Sector Academic/University 
PI Contribution Collection and re-analysis of previously published WES/WGS/RNA-seq data for mesothelioma tumour samples
Collaborator Contribution Generation of novel sequencing and other data for pre-malignant mesothelioma samples
Impact Multidisciplinary: genomics, transcriptomics, proteomics, clinical oncology, bioinformatics
Start Year 2023
 
Description PREDICT-Meso 
Organisation University of Glasgow
Department School of Medicine Glasgow
Country United Kingdom 
Sector Academic/University 
PI Contribution Collection and re-analysis of previously published WES/WGS/RNA-seq data for mesothelioma tumour samples
Collaborator Contribution Generation of novel sequencing and other data for pre-malignant mesothelioma samples
Impact Multidisciplinary: genomics, transcriptomics, proteomics, clinical oncology, bioinformatics
Start Year 2023