Genetic variations in transposable elements: Germ line differences and somatic variations induced during neurogenesis

Lead Research Organisation: MRC Harwell Institute
Department Name: UNLISTED

Abstract

The sequencing of human and other genomes has produced an overwhelming amount of data. One challenge is to understand why 98.94% of the genome does not seem to be coding for functional proteins. Another is why over half of the sequence contains "parasitic" Transposable Elements (TE) that can produce copies of themselves anywhere else in the genome. In recent years there has been a shift in perception concerning the effects of "parasitic" and "junk" DNAs on biology. This project aims to use the data produced by next-generation sequencing technologies to determine if and how TE affect the development of the brain, whether they contribute differences between individuals and exactly how commonly they copy themselves around the genome. The occurrence and expression of TE elements have been associated with several human diseases such as cancer, diabetes and schizophrenia. A fundamental understanding of their role in normal "healthy" processes is necessary before we can hope to understand how they contribute to disease.

Technical Summary

This proposed project aims to capitalise on the wealth of data emanating from the recent and rapid developments in next-generation DNA sequencing technologies. Transposable elements (TE) are a class of repetitive elements, which when intact, are capable of making retrocopies in the genome. Formerly considered "junk" or "parasitic" DNA these elements are suspected of being a source of non-coding regulation and phenotypic variation. TE have been associated with several human diseases such as cancer, haemophilia A, muscular dystrophy and schizophrenia.
The specific aims of this project are two-fold: (i) to identify somatic TE insertions causing alterations in expressed sequences between neuronal tissues within an individual ; (ii) to identify differences in TE insertions between mouse strains and possible association to phenotype; and, (iii) to identify TE insertion polymorphisms among 1000 individuals in the human population. The objective is to identify somatic variation arising from TE insertions occurring during neurogenesis; thereafter, I will determine if novel insertion events have preferentially occurred in neuronally active transcripts. I will achieve this by isolating RNA from cortical layers in mouse brain, by enriching sequence with TE-containing transcripts, and by performing paired-end deep sequencing with a Illumina Genome Analyzer. This will be done in collaboration with Dr. Elliott Margulies of NHGRI and Dr. Zoltan Molnar (University of Oxford).
The objective of the project focusing on TE insertion variants between mouse strain genetics is to investigate whether these variants contribute to phenotypic variation (collaboration with Dr. Jonathan Flint, Oxford University). Since the mouse strains that are currently being sequenced display a large range of phenotypes this provides a unique opportunity to determine the role of TE insertion events on trait variation. The aims of the human population genetics project are to map rare variant TE insertions in humans. A collaboration with Dr. Richard Durbin and Dr. David Adams of the Wellcome Trust Sanger Institute will allow me to gain access to, and receive advice about, efforts to sequence, assembly and analyse multiple human genomes. These world-class research groups are primary contributors to the 1000 Genome and the MRC-funded Mouse Strain Sequencing Projects, thereby providing an invaluable training opportunity.
With my background in neuroscience, repetitive elements and method development, and the many opportunities, both training and collaborative, presented in this application, there is a potential for spearheading research into the role of TE in phenotypic variation and, by extension, human disease.

People

ORCID iD

 
Description Extension of MRC Centenary Fellowship award
Amount £25,970 (GBP)
Organisation University of Oxford 
Sector Academic/University
Country United Kingdom
Start 09/2013 
End 02/2014
 
Description MRC Centenary Award
Amount £51,940 (GBP)
Organisation Medical Research Council (MRC) 
Sector Public
Country United Kingdom
Start 09/2012 
End 08/2013
 
Description New group leader bridging funding - BRC
Amount £50,000 (GBP)
Organisation Oxford University Hospitals NHS Foundation Trust 
Department NIHR Oxford Biomedical Research Centre
Sector Academic/University
Country United Kingdom
Start 03/2014 
End 02/2015
 
Description New group leader bridging funding - DPAG
Amount £50,000 (GBP)
Organisation University of Oxford 
Department Department of Physiology, Anatomy and Genetics
Sector Academic/University
Country United Kingdom
Start 03/2014 
End 02/2015
 
Title CFPS 
Description This is an open source release of the algorithms underlying the recent publication "Diagnostically relevant facial gestalt information from ordinary photos" 
Type Of Material Technology assay or reagent 
Year Produced 2014 
Provided To Others? Yes  
Impact This is the proof of principle tool and publication that my future main research direction is based around. The approach hold great promise for delivering a cheap, high throughput clinical investigation tool for rare disases. 
URL http://elifesciences.org/content/3/e02020
 
Title Mouse Strains TE variation catalogue 
Description A database of all transposable element variations among 18 mouse strains. Database is published with the publication "The genomic landscape shaped by selection on transposable elements across 18 mouse strains." 
Type Of Material Biological samples 
Year Produced 2012 
Provided To Others? Yes  
Impact This is a valuable resource for the TE community but is too new for impact to be assessed. 
URL http://genomebiology.com/content/13/6/R45
 
Title CFPS data 
Description Diagnostically relevant facial gestalt information from ordinary photos database. 
Type Of Material Database/Collection of data 
Year Produced 2014 
Provided To Others? Yes  
Impact This is the data underlying the algorithm development and testing described in our recent publication "Diagnostically relevant facial gestalt information from ordinary photos." 
URL http://elifesciences.org/content/3/e02020/article-data
 
Title CFPS pipeline 
Description This is the algorithms comprising the Clinical Face Phenotype Space pipeline described in our publication "Diagnostically relevant facial gestalt information from ordinary photos". 
Type Of Material Computer model/algorithm 
Year Produced 2014 
Provided To Others? Yes  
Impact This is the proof of principle pipeline from which the future research development will build. 
URL http://dx.doi.org/10.7554/eLife.02020
 
Description Systems Biology Laboratory / Fischer Family Trust 
Organisation Fischer Family Trust (FFT)
Department Systems Biology Laboratory UK
Country United Kingdom 
Sector Charity/Non Profit 
PI Contribution We were contacted by SBL regarding collaboration on a project closely related with the projects I am funded by the MRC to pursue. After an initial meeting to evaluate the potential for a collaborative effort SBL agreed to fund a post-doctoral position within the FGU computational biology group. This position has been appointed to this position and is working with the SBL questions. This collaboration is expanding to encompass one of my originial projects which will improve the potential and time to results being produced.
Collaborator Contribution Postdoctoral position within the group funded working with separate question but that adds expertise to the group. Exchange of ideas on similar type problems experienced by both projects (theirs and mine). Collaboration regarding the wet-lab portion of my project has been offered and will speed up the work immensely.
Impact - post-doctoral position funded within the FGU computational biology group.
Start Year 2010
 
Title CFPS 
Description This is the method by which we extract diagnostically relevant facial gestalt information from ordinary photos. We are continuing to develop this to become a valuable clinical investigation tool for developmental disorders. The CFPS algorithms have been shown to work as a proof of principle, we are now at the stage of expanding clinical collaborations to drive development to the clinical setting. 
Type Diagnostic Tool - Imaging
Current Stage Of Development Refinement. Non-clinical
Year Development Stage Completed 2014
Development Status Actively seeking support
Impact This is at an early stage, but is garnering significant interest from the clincial genetics community. 
 
Title CFPS 
Description The Clinical Face Phenotype Space algorithms and pipeline as described in the publication "Diagnostically relevant facial gestalt information from ordinary photos" 
Type Of Technology New/Improved Technique/Technology 
Year Produced 2014 
Impact This is leading to a new research theme which is now the main focus of my research career. This is the first proof of principle approach for automatics detection, annotation and analysis of craniofacial phenotype from photographs. I am continuing to develop this to become a valuable tool in clinical investigations of developmental disorders. 
 
Description AskScience panelist 
Form Of Engagement Activity A magazine, newsletter or online publication
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact I answer questions with regards to genetics and transposable elements. This is a forum for the public to ask any science related questions they like and a panel of scientists can answer.

When I joined AskScience there were 5K people subscribing to the forum. Over the past 4 years this has grown to a subscribed following of more than 1M. The number of passive participants vastly exceed the number of subscribers.
Year(s) Of Engagement Activity 2010,2011,2012,2013
URL http://www.reddit.com/r/AskScience