Genetic variations in transposable elements: Germ line differences and somatic variations induced during neurogenesis
Lead Research Organisation:
MRC Harwell Institute
Department Name: UNLISTED
Abstract
The sequencing of human and other genomes has produced an overwhelming amount of data. One challenge is to understand why 98.94% of the genome does not seem to be coding for functional proteins. Another is why over half of the sequence contains "parasitic" Transposable Elements (TE) that can produce copies of themselves anywhere else in the genome. In recent years there has been a shift in perception concerning the effects of "parasitic" and "junk" DNAs on biology. This project aims to use the data produced by next-generation sequencing technologies to determine if and how TE affect the development of the brain, whether they contribute differences between individuals and exactly how commonly they copy themselves around the genome. The occurrence and expression of TE elements have been associated with several human diseases such as cancer, diabetes and schizophrenia. A fundamental understanding of their role in normal "healthy" processes is necessary before we can hope to understand how they contribute to disease.
Technical Summary
This proposed project aims to capitalise on the wealth of data emanating from the recent and rapid developments in next-generation DNA sequencing technologies. Transposable elements (TE) are a class of repetitive elements, which when intact, are capable of making retrocopies in the genome. Formerly considered "junk" or "parasitic" DNA these elements are suspected of being a source of non-coding regulation and phenotypic variation. TE have been associated with several human diseases such as cancer, haemophilia A, muscular dystrophy and schizophrenia.
The specific aims of this project are two-fold: (i) to identify somatic TE insertions causing alterations in expressed sequences between neuronal tissues within an individual ; (ii) to identify differences in TE insertions between mouse strains and possible association to phenotype; and, (iii) to identify TE insertion polymorphisms among 1000 individuals in the human population. The objective is to identify somatic variation arising from TE insertions occurring during neurogenesis; thereafter, I will determine if novel insertion events have preferentially occurred in neuronally active transcripts. I will achieve this by isolating RNA from cortical layers in mouse brain, by enriching sequence with TE-containing transcripts, and by performing paired-end deep sequencing with a Illumina Genome Analyzer. This will be done in collaboration with Dr. Elliott Margulies of NHGRI and Dr. Zoltan Molnar (University of Oxford).
The objective of the project focusing on TE insertion variants between mouse strain genetics is to investigate whether these variants contribute to phenotypic variation (collaboration with Dr. Jonathan Flint, Oxford University). Since the mouse strains that are currently being sequenced display a large range of phenotypes this provides a unique opportunity to determine the role of TE insertion events on trait variation. The aims of the human population genetics project are to map rare variant TE insertions in humans. A collaboration with Dr. Richard Durbin and Dr. David Adams of the Wellcome Trust Sanger Institute will allow me to gain access to, and receive advice about, efforts to sequence, assembly and analyse multiple human genomes. These world-class research groups are primary contributors to the 1000 Genome and the MRC-funded Mouse Strain Sequencing Projects, thereby providing an invaluable training opportunity.
With my background in neuroscience, repetitive elements and method development, and the many opportunities, both training and collaborative, presented in this application, there is a potential for spearheading research into the role of TE in phenotypic variation and, by extension, human disease.
The specific aims of this project are two-fold: (i) to identify somatic TE insertions causing alterations in expressed sequences between neuronal tissues within an individual ; (ii) to identify differences in TE insertions between mouse strains and possible association to phenotype; and, (iii) to identify TE insertion polymorphisms among 1000 individuals in the human population. The objective is to identify somatic variation arising from TE insertions occurring during neurogenesis; thereafter, I will determine if novel insertion events have preferentially occurred in neuronally active transcripts. I will achieve this by isolating RNA from cortical layers in mouse brain, by enriching sequence with TE-containing transcripts, and by performing paired-end deep sequencing with a Illumina Genome Analyzer. This will be done in collaboration with Dr. Elliott Margulies of NHGRI and Dr. Zoltan Molnar (University of Oxford).
The objective of the project focusing on TE insertion variants between mouse strain genetics is to investigate whether these variants contribute to phenotypic variation (collaboration with Dr. Jonathan Flint, Oxford University). Since the mouse strains that are currently being sequenced display a large range of phenotypes this provides a unique opportunity to determine the role of TE insertion events on trait variation. The aims of the human population genetics project are to map rare variant TE insertions in humans. A collaboration with Dr. Richard Durbin and Dr. David Adams of the Wellcome Trust Sanger Institute will allow me to gain access to, and receive advice about, efforts to sequence, assembly and analyse multiple human genomes. These world-class research groups are primary contributors to the 1000 Genome and the MRC-funded Mouse Strain Sequencing Projects, thereby providing an invaluable training opportunity.
With my background in neuroscience, repetitive elements and method development, and the many opportunities, both training and collaborative, presented in this application, there is a potential for spearheading research into the role of TE in phenotypic variation and, by extension, human disease.
People |
ORCID iD |
Publications
Ansari M
(2014)
Genetic heterogeneity in Cornelia de Lange syndrome (CdLS) and CdLS-like phenotypes with observed and predicted levels of mosaicism.
in Journal of medical genetics
Danecek P
(2012)
High levels of RNA-editing site conservation amongst 15 laboratory mouse strains.
in Genome biology
Ferry Q
(2014)
Diagnostically relevant facial gestalt information from ordinary photos.
in eLife
Keane TM
(2011)
Mouse genomic variation and its effect on phenotypes and gene regulation.
in Nature
Nellåker C
(2012)
The genomic landscape shaped by selection on transposable elements across 18 mouse strains.
in Genome biology
Ponting CP
(2011)
Rapid turnover of functional sequence in human and other genomes.
in Annual review of genomics and human genetics
Yalcin B
(2011)
Sequence-based characterization of structural variation in the mouse genome.
in Nature
Description | Extension of MRC Centenary Fellowship award |
Amount | £25,970 (GBP) |
Organisation | University of Oxford |
Sector | Academic/University |
Country | United Kingdom |
Start | 09/2013 |
End | 02/2014 |
Description | MRC Centenary Award |
Amount | £51,940 (GBP) |
Organisation | Medical Research Council (MRC) |
Sector | Public |
Country | United Kingdom |
Start | 09/2012 |
End | 08/2013 |
Description | New group leader bridging funding - BRC |
Amount | £50,000 (GBP) |
Organisation | Oxford University Hospitals NHS Foundation Trust |
Department | NIHR Oxford Biomedical Research Centre |
Sector | Academic/University |
Country | United Kingdom |
Start | 03/2014 |
End | 02/2015 |
Description | New group leader bridging funding - DPAG |
Amount | £50,000 (GBP) |
Organisation | University of Oxford |
Department | Department of Physiology, Anatomy and Genetics |
Sector | Academic/University |
Country | United Kingdom |
Start | 03/2014 |
End | 02/2015 |
Title | CFPS |
Description | This is an open source release of the algorithms underlying the recent publication "Diagnostically relevant facial gestalt information from ordinary photos" |
Type Of Material | Technology assay or reagent |
Year Produced | 2014 |
Provided To Others? | Yes |
Impact | This is the proof of principle tool and publication that my future main research direction is based around. The approach hold great promise for delivering a cheap, high throughput clinical investigation tool for rare disases. |
URL | http://elifesciences.org/content/3/e02020 |
Title | Mouse Strains TE variation catalogue |
Description | A database of all transposable element variations among 18 mouse strains. Database is published with the publication "The genomic landscape shaped by selection on transposable elements across 18 mouse strains." |
Type Of Material | Biological samples |
Year Produced | 2012 |
Provided To Others? | Yes |
Impact | This is a valuable resource for the TE community but is too new for impact to be assessed. |
URL | http://genomebiology.com/content/13/6/R45 |
Title | CFPS data |
Description | Diagnostically relevant facial gestalt information from ordinary photos database. |
Type Of Material | Database/Collection of data |
Year Produced | 2014 |
Provided To Others? | Yes |
Impact | This is the data underlying the algorithm development and testing described in our recent publication "Diagnostically relevant facial gestalt information from ordinary photos." |
URL | http://elifesciences.org/content/3/e02020/article-data |
Title | CFPS pipeline |
Description | This is the algorithms comprising the Clinical Face Phenotype Space pipeline described in our publication "Diagnostically relevant facial gestalt information from ordinary photos". |
Type Of Material | Computer model/algorithm |
Year Produced | 2014 |
Provided To Others? | Yes |
Impact | This is the proof of principle pipeline from which the future research development will build. |
URL | http://dx.doi.org/10.7554/eLife.02020 |
Description | Systems Biology Laboratory / Fischer Family Trust |
Organisation | Fischer Family Trust (FFT) |
Department | Systems Biology Laboratory UK |
Country | United Kingdom |
Sector | Charity/Non Profit |
PI Contribution | We were contacted by SBL regarding collaboration on a project closely related with the projects I am funded by the MRC to pursue. After an initial meeting to evaluate the potential for a collaborative effort SBL agreed to fund a post-doctoral position within the FGU computational biology group. This position has been appointed to this position and is working with the SBL questions. This collaboration is expanding to encompass one of my originial projects which will improve the potential and time to results being produced. |
Collaborator Contribution | Postdoctoral position within the group funded working with separate question but that adds expertise to the group. Exchange of ideas on similar type problems experienced by both projects (theirs and mine). Collaboration regarding the wet-lab portion of my project has been offered and will speed up the work immensely. |
Impact | - post-doctoral position funded within the FGU computational biology group. |
Start Year | 2010 |
Title | CFPS |
Description | This is the method by which we extract diagnostically relevant facial gestalt information from ordinary photos. We are continuing to develop this to become a valuable clinical investigation tool for developmental disorders. The CFPS algorithms have been shown to work as a proof of principle, we are now at the stage of expanding clinical collaborations to drive development to the clinical setting. |
Type | Diagnostic Tool - Imaging |
Current Stage Of Development | Refinement. Non-clinical |
Year Development Stage Completed | 2014 |
Development Status | Actively seeking support |
Impact | This is at an early stage, but is garnering significant interest from the clincial genetics community. |
Title | CFPS |
Description | The Clinical Face Phenotype Space algorithms and pipeline as described in the publication "Diagnostically relevant facial gestalt information from ordinary photos" |
Type Of Technology | New/Improved Technique/Technology |
Year Produced | 2014 |
Impact | This is leading to a new research theme which is now the main focus of my research career. This is the first proof of principle approach for automatics detection, annotation and analysis of craniofacial phenotype from photographs. I am continuing to develop this to become a valuable tool in clinical investigations of developmental disorders. |
Description | AskScience panelist |
Form Of Engagement Activity | A magazine, newsletter or online publication |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Public/other audiences |
Results and Impact | I answer questions with regards to genetics and transposable elements. This is a forum for the public to ask any science related questions they like and a panel of scientists can answer. When I joined AskScience there were 5K people subscribing to the forum. Over the past 4 years this has grown to a subscribed following of more than 1M. The number of passive participants vastly exceed the number of subscribers. |
Year(s) Of Engagement Activity | 2010,2011,2012,2013 |
URL | http://www.reddit.com/r/AskScience |