Mutagenesis and its Biomedical Impact
Lead Research Organisation:
University of Edinburgh
Department Name: UNLISTED
Abstract
Of the many thousands of DNA differences between individuals, only a minority have important contributions to disease risk or other traits that differ between them.
Finding those rare consequential differences is the basis for genetic diagnosis and predicting traits like height and drug response. It is also often the first step in understanding the basis of a disease at a molecular level.
As a community we can only efficiently find the consequential differences if they directly alter an encoded protein, the vast majority don’t. We are developing ways to understand the consequences of the majority of DNA changes regardless of whether they alter a protein.
Our approaches are often based on studying how large numbers of DNA sequence differences are inherited through generations of the human population. It allows us to tease apart two patterns, the pattern of new mutations and the pattern shaped by the health of people that carried those mutations.
The pattern of new mutations tells us about the underlying biology of the DNA, how it is replicated and repaired as well as showing how likely a piece of DNA is likely to be disrupted by a new mutation. The second pattern is what tells us if a type of DNA change is likely to have a consequence for human health. Although described here in the context of inherited DNA differences, we apply similar approaches to interpret the new mutations that arise in and drive the development of cancer.
Finding those rare consequential differences is the basis for genetic diagnosis and predicting traits like height and drug response. It is also often the first step in understanding the basis of a disease at a molecular level.
As a community we can only efficiently find the consequential differences if they directly alter an encoded protein, the vast majority don’t. We are developing ways to understand the consequences of the majority of DNA changes regardless of whether they alter a protein.
Our approaches are often based on studying how large numbers of DNA sequence differences are inherited through generations of the human population. It allows us to tease apart two patterns, the pattern of new mutations and the pattern shaped by the health of people that carried those mutations.
The pattern of new mutations tells us about the underlying biology of the DNA, how it is replicated and repaired as well as showing how likely a piece of DNA is likely to be disrupted by a new mutation. The second pattern is what tells us if a type of DNA change is likely to have a consequence for human health. Although described here in the context of inherited DNA differences, we apply similar approaches to interpret the new mutations that arise in and drive the development of cancer.
Technical Summary
Discriminating mutations of medical and functional importance from the many more that are of negligible biological consequence is a major unmet challenge for genomic medicine. This problem is particularly acute for sequence changes in the non-protein-coding majority of the genome. Solving this is a key component of realizing the MRC’s objective “to use genetics, imaging and biological indicators to understand predispositions to disease”. We will help address this problem by disentangling the mutually confounding patterns of mutation and selection, both of which if separately resolved can give insight into the biology of the genome and the phenotypic consequences of a DNA sequence change. Our three main aims are to: 1. Reveal the fine-scale mutation landscape of the genome and identify the processes that shape it. 2. Understand the molecular consequences of mutation at regulatory sites in the non-protein-coding genome. 3. Relate molecular phenotypes to organism biology through improved measures of selection that are applicable to the non-protein-coding genome.
Our approach is predominately computational, applying statistical analyses to somatic (cancer), population and between species variation obtained from genome sequencing studies. These data are intersected with transcription and chromatin state measurements and key missing data generated by collaborators such as the international FANTOM consortium, or ourselves in which case may involve obtaining and processing primary human tissue samples. Specific hypotheses generated from our initial analyses, such as mutagenic processes are explored experimentally using yeast or in vitro systems where tractable.
Our approach is predominately computational, applying statistical analyses to somatic (cancer), population and between species variation obtained from genome sequencing studies. These data are intersected with transcription and chromatin state measurements and key missing data generated by collaborators such as the international FANTOM consortium, or ourselves in which case may involve obtaining and processing primary human tissue samples. Specific hypotheses generated from our initial analyses, such as mutagenic processes are explored experimentally using yeast or in vitro systems where tractable.
Organisations
- University of Edinburgh (Lead Research Organisation)
- Francis Crick Institute (Collaboration)
- Cancer Research UK Cambridge Institute (Collaboration)
- University College London (Collaboration)
- EMBL European Bioinformatics Institute (EMBL - EBI) (Collaboration)
- Congenica Ltd (Collaboration)
- RIKEN (Collaboration)
- Biodonostia Health Research Institute (Collaboration)
- German Cancer Research Center (Collaboration)
- Laboratory of the Government Chemist (LGC) Ltd (Collaboration)
- Medical Research Council (MRC) (Collaboration)
- Institute for Research in Biomedicine (IRB) (Collaboration)
Publications
Aitken S
(2019)
Pervasive lesion segregation shapes cancer genome evolution
Aitken S
(2020)
Pervasive lesion segregation shapes cancer genome evolution
Aitken SJ
(2020)
Pervasive lesion segregation shapes cancer genome evolution.
in Nature
Alam T
(2020)
Comparative transcriptomics of primary cells in vertebrates.
in Genome research
Anderson C
(2022)
Strand-resolved mutagenicity of DNA damage and repair
Anderson CJ
(2024)
Strand-resolved mutagenicity of DNA damage and repair.
in Nature
Halachev M
(2019)
Increased ultra-rare variant load in an isolated Scottish population impacts exonic and regulatory regions.
in PLoS genetics
James E
(2018)
Preterm birth and the timing of puberty: a systematic review.
in BMC pediatrics
Kaiser V
(2021)
Mutational bias in spermatogonia impacts the anatomy of regulatory sites in the human genome
in Genome Research
Related Projects
Project Reference | Relationship | Related To | Start | End | Award Value |
---|---|---|---|---|---|
MC_UU_00007/1 | 31/03/2018 | 30/03/2023 | £662,000 | ||
MC_UU_00007/2 | Transfer | MC_UU_00007/1 | 31/03/2018 | 30/03/2023 | £3,730,000 |
MC_UU_00007/3 | Transfer | MC_UU_00007/2 | 31/03/2018 | 30/05/2022 | £3,053,000 |
MC_UU_00007/4 | Transfer | MC_UU_00007/3 | 31/03/2018 | 30/03/2023 | £1,772,000 |
MC_UU_00007/5 | Transfer | MC_UU_00007/4 | 31/03/2018 | 30/03/2023 | £4,524,000 |
MC_UU_00007/6 | Transfer | MC_UU_00007/5 | 31/03/2018 | 30/03/2023 | £2,878,000 |
MC_UU_00007/7 | Transfer | MC_UU_00007/6 | 31/03/2018 | 30/03/2023 | £2,829,000 |
MC_UU_00007/8 | Transfer | MC_UU_00007/7 | 31/03/2018 | 31/12/2022 | £4,072,000 |
MC_UU_00007/9 | Transfer | MC_UU_00007/8 | 31/03/2018 | 30/03/2023 | £3,137,000 |
MC_UU_00007/10 | Transfer | MC_UU_00007/9 | 31/03/2018 | 30/03/2023 | £6,948,000 |
MC_UU_00007/11 | Transfer | MC_UU_00007/10 | 31/03/2018 | 30/03/2023 | £2,421,000 |
MC_UU_00007/12 | Transfer | MC_UU_00007/11 | 31/03/2018 | 30/03/2023 | £1,205,000 |
MC_UU_00007/13 | Transfer | MC_UU_00007/12 | 31/03/2018 | 30/03/2023 | £1,174,000 |
MC_UU_00007/14 | Transfer | MC_UU_00007/13 | 31/03/2018 | 30/03/2023 | £1,838,000 |
MC_UU_00007/15 | Transfer | MC_UU_00007/14 | 31/03/2018 | 30/03/2023 | £2,551,000 |
MC_UU_00007/16 | Transfer | MC_UU_00007/15 | 31/03/2018 | 30/03/2023 | £1,496,000 |
MC_UU_00007/17 | Transfer | MC_UU_00007/16 | 31/03/2018 | 30/03/2023 | £1,886,000 |
Description | 'ECAT-Plus' John Connelly - DNA damage as an engine of genetic diversification during cancer evolution |
Amount | £232,403 (GBP) |
Funding ID | 9897215 |
Organisation | Wellcome Trust |
Sector | Charity/Non Profit |
Country | United Kingdom |
Start | 07/2020 |
End | 07/2023 |
Description | ECAT Fellowship for Thomas Williams |
Amount | £233,584 (GBP) |
Funding ID | 204802/Z/16/Z |
Organisation | Wellcome Trust |
Sector | Charity/Non Profit |
Country | United Kingdom |
Start | 07/2016 |
End | 07/2019 |
Description | MRC EQUIP - WORLD CLASS LABS AWARD 2022/23 |
Amount | £394,059 (GBP) |
Funding ID | MC_PC_MR/X013677/1 |
Organisation | University of Edinburgh |
Sector | Academic/University |
Country | United Kingdom |
Start | 11/2022 |
End | 03/2023 |
Description | MRC Proximity to Discovery Fund |
Amount | £10,000 (GBP) |
Funding ID | Project36 |
Organisation | Medical Research Council (MRC) |
Sector | Public |
Country | United Kingdom |
Start | 08/2018 |
End | 12/2018 |
Title | Pervasive lesion segregation shapes cancer genome evolution |
Description | Cancers arise through the acquisition of oncogenic mutations and grow by clonal expansion. Here we reveal that most mutagenic DNA lesions are not resolved into a mutated DNA base pair within a single cell cycle. Instead, DNA lesions segregate, unrepaired, into daughter cells for multiple cell generations, resulting in the chromosome-scale phasing of subsequent mutations. We characterize this process in mutagen-induced mouse liver tumours and show that DNA replication across persisting lesions can produce multiple alternative alleles in successive cell divisions, thereby generating both multiallelic and combinatorial genetic diversity. The phasing of lesions enables accurate measurement of strand-biased repair processes, quantification of oncogenic selection and fine mapping of sister-chromatid-exchange events. Finally, we demonstrate that lesion segregation is a unifying property of exogenous mutagens, including UV light and chemotherapy agents in human cells and tumours, which has profound implications for the evolution and adaptation of cancer genomes. The data submitted here is analysis output based on primary data submitted to the EBI under the indicated accession PRJEB37808. |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
Impact | Downloaded 508 times to date. |
URL | https://datashare.ed.ac.uk/handle/10283/3742 |
Title | Study of mutation patterns from DEN mutagenesis in mouse liver. |
Description | Liver Cancer Evolution - Lesion segregation |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
Impact | Download and usage statistics currently unknown. |
URL | https://www.ebi.ac.uk/ena/browser/view/PRJEB37808 |
Description | Accurate measurement of nucleic acids |
Organisation | Laboratory of the Government Chemist (LGC) Ltd |
Country | United Kingdom |
Sector | Private |
PI Contribution | Development of novel methods to explore the quantisation of DNA methylation. |
Collaborator Contribution | Preparation and generation of known reference data. |
Impact | PMID:22841564 PMID:25539843 |
Start Year | 2012 |
Description | Detection of DNA embedded ribonucleotides in the mitochondrial genome |
Organisation | Biodonostia Health Research Institute |
Country | Spain |
Sector | Hospitals |
PI Contribution | Developed computational tools, co-developed original emRibo-seq methodology and performed analysis on generated data. |
Collaborator Contribution | Generation of genetic model mouse, preparation of tissues and high purity mitochondrial DNA from cells and tissues. Experimental perturbation of cultured cells. |
Impact | Publication: Moss et al, Nucleic Acids Research 2017 doi:10.1093/nar/gkx1009 |
Start Year | 2015 |
Description | Detection of DNA embedded ribonucleotides in the mitochondrial genome |
Organisation | Francis Crick Institute |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | Developed computational tools, co-developed original emRibo-seq methodology and performed analysis on generated data. |
Collaborator Contribution | Generation of genetic model mouse, preparation of tissues and high purity mitochondrial DNA from cells and tissues. Experimental perturbation of cultured cells. |
Impact | Publication: Moss et al, Nucleic Acids Research 2017 doi:10.1093/nar/gkx1009 |
Start Year | 2015 |
Description | Detection of DNA embedded ribonucleotides in the mitochondrial genome |
Organisation | University College London |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | Developed computational tools, co-developed original emRibo-seq methodology and performed analysis on generated data. |
Collaborator Contribution | Generation of genetic model mouse, preparation of tissues and high purity mitochondrial DNA from cells and tissues. Experimental perturbation of cultured cells. |
Impact | Publication: Moss et al, Nucleic Acids Research 2017 doi:10.1093/nar/gkx1009 |
Start Year | 2015 |
Description | FANTOM6 Consortium |
Organisation | RIKEN |
Department | Institute of Physical and Chemical Research (RIKEN) |
Country | Japan |
Sector | Public |
PI Contribution | Planning of large scale systematic study on lncRNA and their effect on gene regulation. Planning and initiating analysis of the resulting data. |
Collaborator Contribution | Planning, coordination and primary data generation. |
Impact | Project is ongoing - no impact yet. |
Start Year | 2015 |
Description | Liver Cancer Evolution Consortium |
Organisation | Cancer Research UK Cambridge Institute |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | Computational analysis of tumor whole genome and transcriptome sequence data to profile mutation patterns. |
Collaborator Contribution | Generation, histological profiling and whole genome and transcriptome sequencing of carcinogen induced tumors in rodents. |
Impact | No published outcomes yet, less that 1 year into project and data generation still under way. Scientific discovery & insight. Manuscript writing, project coordination. |
Start Year | 2017 |
Description | Liver Cancer Evolution Consortium |
Organisation | EMBL European Bioinformatics Institute (EMBL - EBI) |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | Computational analysis of tumor whole genome and transcriptome sequence data to profile mutation patterns. |
Collaborator Contribution | Generation, histological profiling and whole genome and transcriptome sequencing of carcinogen induced tumors in rodents. |
Impact | No published outcomes yet, less that 1 year into project and data generation still under way. Scientific discovery & insight. Manuscript writing, project coordination. |
Start Year | 2017 |
Description | Liver Cancer Evolution Consortium |
Organisation | German Cancer Research Center |
Country | Germany |
Sector | Academic/University |
PI Contribution | Computational analysis of tumor whole genome and transcriptome sequence data to profile mutation patterns. |
Collaborator Contribution | Generation, histological profiling and whole genome and transcriptome sequencing of carcinogen induced tumors in rodents. |
Impact | No published outcomes yet, less that 1 year into project and data generation still under way. Scientific discovery & insight. Manuscript writing, project coordination. |
Start Year | 2017 |
Description | Liver Cancer Evolution Consortium |
Organisation | Institute for Research in Biomedicine (IRB) |
Country | Spain |
Sector | Academic/University |
PI Contribution | Computational analysis of tumor whole genome and transcriptome sequence data to profile mutation patterns. |
Collaborator Contribution | Generation, histological profiling and whole genome and transcriptome sequencing of carcinogen induced tumors in rodents. |
Impact | No published outcomes yet, less that 1 year into project and data generation still under way. Scientific discovery & insight. Manuscript writing, project coordination. |
Start Year | 2017 |
Description | Prototyping genome analysis hub |
Organisation | Congenica Ltd |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | Developing computational systems that could serve as a national hub for computational analysis of clinical genomic data. Demonstrating a proof of principal system. |
Collaborator Contribution | Commercial partner providing clinical interpretation system for evaluation purposes. Edinburgh Parallel Compute Centre providing compute infrastructure and secure systems for the handling of sensitive data. |
Impact | MRC Proximity to Discovery grant awarded to fund pilot work and develop industry-academia partnership. Initiative further developed by Dr Alison Meynert, Professor David Fitzpatrick and colleagues from the MRC Human Genetics Unit to provide national scale Genomic Data Analysis Centre. |
Start Year | 2018 |
Description | Prototyping genome analysis hub |
Organisation | Medical Research Council (MRC) |
Country | United Kingdom |
Sector | Public |
PI Contribution | Developing computational systems that could serve as a national hub for computational analysis of clinical genomic data. Demonstrating a proof of principal system. |
Collaborator Contribution | Commercial partner providing clinical interpretation system for evaluation purposes. Edinburgh Parallel Compute Centre providing compute infrastructure and secure systems for the handling of sensitive data. |
Impact | MRC Proximity to Discovery grant awarded to fund pilot work and develop industry-academia partnership. Initiative further developed by Dr Alison Meynert, Professor David Fitzpatrick and colleagues from the MRC Human Genetics Unit to provide national scale Genomic Data Analysis Centre. |
Start Year | 2018 |
Description | Sarah Aitken |
Organisation | Medical Research Council (MRC) |
Department | MRC Toxicology Unit |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | Joint supervision of ECAT PhD student John Connelly. Machine learning based analysis of cancer histology whole slide images. |
Collaborator Contribution | Joint supervision of ECAT PhD student John Connelly. Qualified pathologist evaluation of cancer histology whole slide images. |
Impact | Multidisciplinary: Computer machine learning, mathematical modelling, histopathology, oncology, genomics. |
Start Year | 2021 |
Title | GenomeArtiFinder |
Description | GenomeArtiFinder is a collection of command line RScripts that use the variant allele frequency (VAF) distribution of germline heterozygous variants in matched tumour:normal samples to detect sample contamination and other abnormalities in sequencing data. |
Type Of Technology | Software |
Year Produced | 2020 |
Open Source License? | Yes |
Impact | Impact as yet is unknown. |
URL | https://git.ecdf.ed.ac.uk/taylor-lab/GenomeArtiFinder |
Description | Chelsea Flower Show |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Public/other audiences |
Results and Impact | Manned information stand & display on genetics including live demonstrations of genome sequencing. Stand staffed by 2-4 people continuously through the week long event. Engaged with >500 members of the public varying from a few seconds to 20min interaction with individuals and small groups. Many people highly engaged, both in the implications of genetic variation, the "rules" of inheritance and the wider implications of genetics. Though initially plant-genetics focused, discussions often ventured into implications for human biology and health. |
Year(s) Of Engagement Activity | 2019 |
URL | https://genetics.org.uk/news/centenary-garden-exhibit-at-chelsea-flower-show-2019/ |
Description | Conference organisation: European Association of Cancer Research - Bioinformatics |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Organised virtual conference (virtual) on behalf of EACR. Good feedback despite COVID necessitating a virtual-only meeting. |
Year(s) Of Engagement Activity | 2021 |
URL | https://www.eacr.org/conference/Bioinformatics2021virtual |
Description | Genetics Society Centenary |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Professional Practitioners |
Results and Impact | Organising the Genetics Society Centenary gathering for society members and relevant policy makers and influences in the field of genetics, both academia and industry. |
Year(s) Of Engagement Activity | 2019 |
URL | https://genetics.org.uk/centenary/birthday-celebration/ |
Description | International press release and coverage |
Form Of Engagement Activity | A press release, press conference or response to a media enquiry/interview |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Media (as a channel to the public) |
Results and Impact | Press release, educational video and Twitter campaign to promote major publication and it's human health implications - how damaged DNA promotes cancer development. Twitter reach was particularly broad with Altmetric estimated reach to over 3m Twitter users. https://nature.altmetric.com/details/84616909/twitter |
Year(s) Of Engagement Activity | 2020 |
URL | https://www.ed.ac.uk/igmm/news-and-events/news-2020/study-shows-how-chemicals-cause-cell-mutations |
Description | Negotiated and commissioned "Genetics Unzipped" podcast series |
Form Of Engagement Activity | Engagement focused website, blog or social media channel |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Public/other audiences |
Results and Impact | Over 4,500 unique subscribers to podcast channel to date on RSS, additional listeners and subscribers on Spotify and iTunes. Fortnightly podcasts showing steady growth episode on episode. |
Year(s) Of Engagement Activity | 2018,2019 |
URL | https://geneticsunzipped.com/ |
Description | Public engagement event at Edinburgh Botanic Gardens |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | Regional |
Primary Audience | Public/other audiences |
Results and Impact | Students from the lab Thomas Williams and Kathryn Jones presented a genetics workshop event open on a drop-in basis to the general public. This included supervised activities such as preparation of DNA from strawberries and live demonstration of DNA sequencing using ONT MinIon technology. This was a pilot event for a larger scale engagement activity at the Chelsea Flower Show and subsequent roadshow through Q2 2019. |
Year(s) Of Engagement Activity | 2018 |
URL | http://www.genetics.org.uk/centenary/centenary-garden-exhibit-at-chelsea-flower-show-2019/ |
Description | Royal Institution Christmas Lectures 2018 |
Form Of Engagement Activity | A broadcast e.g. TV/radio/film/podcast (other than news/press) |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Public/other audiences |
Results and Impact | Through my role as Honorary Treasurer of The Genetics Society I had a substantial roll in organising the theme "Who am I?" and logistics of the 2018 Christmas Lectures broadcast by the BBC and available online. The activity includes information packs sent out to schools and a series of internationally touring lectures. |
Year(s) Of Engagement Activity | 2018,2019 |
URL | https://www.rigb.org/christmas-lectures/2018-who-am-i |