Using transcriptomics to transform the diagnosis and understanding of inherited adult neurological disorders

Lead Research Organisation: University College London
Department Name: Institute of Neurology

Abstract

We all inherit an instruction manual for making the human body and this is encoded in our DNA. Unfortunately, spelling mistakes or mutations in our DNA can increase our risk or even directly cause certain disorders including disorders, which mainly affect the brain and nervous system in adults. Recognising the mutations that cause these disease can be difficult. This is because we all carry many mutations within our DNA most of which are probably harmless and because the brain is a very complicated organ. However, it is important that doctors and researchers find ways of assessing the importance of specific mutations so that we can improve the way in which we diagnose people with rare inherited neurological diseases and so that we can find new ways to treat these conditions. This is particularly important now that more people are being offered testing to look for mutations in their DNA and that technological advances mean that we can potentially test all of a patient's DNA (not just a small part).

One way to try and decide whether mutations in the DNA are contributing to neurological diseases is by understanding more about which parts of the DNA are expressed in the human brain and the impact that changes in the DNA have on the expression of particular genes (the basic building blocks of the DNA) in the brain. In order to check this, researchers need to make measurements about the genetic variation an individual carries and link this information to the genes they express in their brain cells. Although making these types of measurements in the brain is very difficult, it is possible and this is the aim of this project. One of the challenges is accurately measuring the amount of each of the genes expressed in specific parts of the brain and even in the specific types of cells in the brain. I will try and overcome this challenge in two ways. Firstly, I will study gene expression data from 12 different regions of the human brain and central nervous system taken from people who have donated their brain to a brain bank. Secondly, I will study cell types found in human brain that have been made in culture. These cells start as a patient's skin cells, but are then converted into induced pluripotent stem cells, a very special kind of cell that can be used to make lots of different types of cells in the human body including brain cells.

This project also uses a new technology for measuring all the genes expressed in a single cell type or tissue sample. This technology is called "RNA-seq" or "whole transcriptome sequencing" and it allows researchers to measure all kinds of genes at the same time. It also allows us to measure the relative amounts of alternative versions of the same gene, and to measure these quantities in such a way that the influence of genetic risk factors can be more sensitively detected by directly comparing the relative amounts in individuals who happen to have both a "good" and a "bad" copy of a given genetic variant. In order to make more sensitive measurements about some types of RNA I will also be making measurements from two main parts of the cell, called the nucleus and cytoplasm.

By making all these measurements about the way genes are expressed in the human brain and brain cells, I will create a very detailed map of gene expression in the human brain together with a map of how this expression is controlled. Most importantly, I will work with other doctors and researcher and use the detailed map to try and help reach a diagnosis in specific patients. In this way, I will find out whether having detailed information on the expression and regulation of genes in the human brain really does help us to better diagnose and understand inherited brain diseases.

Technical Summary

The aim of this study is to determine the effect of genetic factors on whole transcriptome expression in the human central nervous system and in brain-relevant cell types with the primary aim of transforming diagnostics for adult inherited neurological disorders. This project is inspired by the growth in whole genome sequencing in patients, which is likely to drive the discovery of an increasing number of potentially pathogenic variants within non-coding regions of the genome. Prioritising such variants for further analysis and understanding the molecular processes and regional/cellular site of action is key to ensuring diagnostic accuracy and driving further research. I aim to address both these important issues. Using control post-mortem human tissue from 12 regions of the human central nervous system, and through analysis of paired nuclear and cytoplasmic RNAseq data, I will define the transcribed portion of the human genome in adult brain. This data will also be analysed using allele-specific expression (ASE) analysis to identify genetic loci that regulate the expression of specific gene transcripts within specific tissues. A similar approach will be taken to the analysis of RNA sequencing data generated from brain-relevant cell-types originating from human induced pluripotent stem cells. This data will be used to explore cell and state-specific regulation of gene expression in brain-relevant cell types. Through collaboration with the neurology domain of the Genomics England Clinical Interpretation Partnership, I hope to test the value of this approach in real clinical scenarios and in some cases through additional RNA sequencing of patient samples. All the data generated by this project will be publicly released as both raw data files for re-analysis and processed information suitable for non-expert users. Thus, I will generate novel disease-relevant findings and provide the neuroscience community with a world-class resource.

Planned Impact

The goal of this work is to use transcriptomics to transform the diagnostic process and improve the understanding of inherited adult neurological disorders. Reaching this goal will require many intermediate steps and I describe who I envisage using the research findings and how this data will help them.

Clinicians and clinical scientists: This study will provide a detailed map of the transcribed portion of the human genome and the underlying regulatory architecture in human brain. This information will inform decisions about the prioritisation of rare variants for further investigation following whole genome sequencing. This will be facilitated through the neurology domain of the Genomics England Clinical Interpretation Partnership (GECIP). Given that the 100,000 Genomes Project is also committed to the collection of "omics" samples, including blood samples suitable for transcriptomic analysis, in some cases such samples may be requested and analysed.

Researchers interested in the follow up of genetic variant data: This study was designed to improve our understanding of the molecular mode of action and cellular location in which genetic variants for adult neurological disorders operate. Therefore, users of the data would include international consortia for neurological diseases, such as the International Parkinson's Disease Genetics Consortium amongst others. Since I will be publicly releasing all findings through the UK Brain Expression Consortium's web portal (www.braineac.org), I envisage other less specialist researchers also accessing the data. This could include drug companies interested in following up genetic findings for clinical trial design or biomarker development.

Researchers interested in the creation and use of in vitro model systems: The generation of region-specific and cell-specific whole transcriptome sequencing data produced in this project will be a baseline for the detailed molecular assessment of stem cell-based disease models. I will work collaboratively to identify the strengths and weaknesses of existing cell models and work to improve the accuracy of these systems. Thus, I will contribute to the development of new research tools for use by the neuroscience research community.

Basic neurobiologists: The large gene expression data sets I am producing will provide insights into other aspects of basic neurobiology. Sequencing of paired nuclear and cytoplasmic RNA will provide a unique resource for the analysis of pre-mRNA processing. This may provide the basic information required to develop novel approaches to manipulating these processes to "salvage" mutations through facilitating exon-skipping or use of alternative promoters. In the longer term, I believe that this data could facilitate research into novel therapeutic approaches.

Bioinformaticians & statistical geneticists: The paired exome and RNA sequencing data will be used by statistical genetics and bioinformatics wishing to test and develop new statistical and analytical methods. In particular, the existence of paired nuclear and cytoplasmic RNA sequencing data will allow researchers to objectively test and improve methods for transcript calling. Therefore, I will directly contribute to one of the MRC's stated aims, the training of more researchers and clinicians in bioinformatics.

Charities & patient groups: The rapid discovery of genetic variants for inherited neurological disorders represents the most significant advance in the understanding of these diseases in the last 10 years. Therefore, they are hugely important to patients and carers desperately seeking progress in the diagnosis and treatment of their diseases. From my experience of contact with patients and charities, it is clear that engaging with patients and communicating the findings generated from research is vital to maintain hope and improve the uptake of clinical trials.

Publications

10 25 50
 
Description Joint Lead for Bioinformatics in Neurology GeCIP
Geographic Reach National 
Policy Influence Type Membership of a guideline committee
 
Description Report to NHSE on psychiatric genetics
Geographic Reach National 
Policy Influence Type Contribution to a national consultation/review
 
Description Dissecting the Mechanisms Underlying Disease Progression
Amount $8,500,000 (USD)
Organisation Aligning Sciences Across Parkinson's 
Sector Charity/Non Profit
Country United States
Start 01/2021 
End 12/2024
 
Description Mapping The PD Brain: Oligomer-driven Functional Genomics
Amount $8,700,000 (USD)
Organisation Aligning Sciences Across Parkinson's 
Sector Charity/Non Profit
Country United States
Start 01/2021 
End 12/2024
 
Description Pilot Project Grant
Amount £48,507 (GBP)
Organisation Alzheimer's Research UK 
Sector Charity/Non Profit
Country United Kingdom
Start 05/2018 
End 05/2019
 
Description Understanding the cellular basis for Parkinson's associated dementia
Amount £280,000 (GBP)
Organisation Parkinson's UK 
Sector Charity/Non Profit
Country United Kingdom
Start 01/2023 
End 01/2025
 
Description Using transcriptomics to improve the diagnosis of neurometabolic disorders
Amount £225,000 (GBP)
Organisation Sparks Charity 
Sector Charity/Non Profit
Country United Kingdom
Start 07/2021 
End 06/2024
 
Description Identification and annotation of novel transcription 
Organisation Johns Hopkins Medicine
Country United States 
Sector Hospitals 
PI Contribution Expertise in transcriptomic analyses
Collaborator Contribution Access to a resource of mapped transcriptomic data.
Impact This is multidisciplinary - clinicians and bioinformaticians
Start Year 2017
 
Description Increasing the diagnostic yield for patients with probable genetic mitochondrial disorders 
Organisation King's College London
Country United Kingdom 
Sector Academic/University 
PI Contribution We are currently running RNAseq analyses of patient fibroblasts to trying and improve the diagnostic yield for patients who have not had a diagnosis after WES and mitochondrial genome sequence analys
Collaborator Contribution Patient samples and expertise regarding inherited mitochondrial disorders
Impact This is collaboration is multidisciplinary and involves neuroscientists, biochemists and clincians.
Start Year 2017
 
Description Increasing the diagnostic yield for patients with probable genetic mitochondrial disorders 
Organisation Newcastle University
Country United Kingdom 
Sector Academic/University 
PI Contribution We are currently running RNAseq analyses of patient fibroblasts to trying and improve the diagnostic yield for patients who have not had a diagnosis after WES and mitochondrial genome sequence analys
Collaborator Contribution Patient samples and expertise regarding inherited mitochondrial disorders
Impact This is collaboration is multidisciplinary and involves neuroscientists, biochemists and clincians.
Start Year 2017
 
Description Increasing the diagnostic yield for patients with probable genetic mitochondrial disorders 
Organisation University of Oxford
Country United Kingdom 
Sector Academic/University 
PI Contribution We are currently running RNAseq analyses of patient fibroblasts to trying and improve the diagnostic yield for patients who have not had a diagnosis after WES and mitochondrial genome sequence analys
Collaborator Contribution Patient samples and expertise regarding inherited mitochondrial disorders
Impact This is collaboration is multidisciplinary and involves neuroscientists, biochemists and clincians.
Start Year 2017
 
Description Nanotweezer technology 
Organisation Imperial College London
Country United Kingdom 
Sector Academic/University 
PI Contribution We provide expertise on the analysis of mitochondria using next generation sequencing
Collaborator Contribution Access to the nanotweezer technology enable directed single mitochondrion extraction.
Impact An MRC grant application is being formulated.
Start Year 2018
 
Description Neurometabolic disorders 
Organisation University College London
Department Institute of Child Health
Country United Kingdom 
Sector Academic/University 
PI Contribution I have been working with Dr Philippa Mills and Professor Paul Gissen on the use of transcriptomics for the analysis of patients enrolled in their Study of Inherited Metabolic Diseases.
Collaborator Contribution They have provided access and expertise in this area.
Impact I have submitted a grant application with them to fund further work on neurometabolic diseases.
Start Year 2020
 
Description The genetics of epilepsy progression 
Organisation University College London
Country United Kingdom 
Sector Academic/University 
PI Contribution Expertise in the analysis of transcriptomic data.
Collaborator Contribution Supply of imaging data for epilepsy patients.
Impact This is multidisciplinary - basic scientists and clinicians
Start Year 2017
 
Description The regulation of gene expression by variable repeats 
Organisation Cardiff University
Country United Kingdom 
Sector Academic/University 
PI Contribution Provision of high depth RNAseq data and some analysis
Collaborator Contribution Expertise in the genetics of HD progression
Impact Nil
Start Year 2018
 
Description snRNAseq of post-mortem human brain 
Organisation Imperial College London
Country United Kingdom 
Sector Academic/University 
PI Contribution We are collaborating on the analysis of paired snRNAseq and bulk RNAseq data from human brain focusing on PD.
Collaborator Contribution Access to single nuclear RNAseq expertise.
Impact Nil yet
Start Year 2018
 
Title CoExp 
Description It is a web application to enable the use of gene co-expression networks and their visualisation. 
Type Of Technology Webtool/Application 
Year Produced 2020 
Impact This web app has enabled new collaborations as well as improving outputs within the group. 
URL https://rytenlab.com/coexp
 
Title IntroVerse 
Description This is a database of splicing across all human tissues. 
Type Of Technology Webtool/Application 
Year Produced 2022 
Impact This work has been published in the database issue of Nucleic Acid Research and we have had a number of requests for full data downloads 
 
Title vizER 
Description WebApp to search for unannotated expressed genomic regions in close proximity to known genes. 
Type Of Technology Webtool/Application 
Year Produced 2020 
Impact This web app has contributed to a paper and is being used internationally. 
URL https://rytenlab.com/browser/app/vizER
 
Description Research in Clinical Genetics 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Postgraduate students
Results and Impact It was a talk to junior doctors interested in research on research opportunities in clinical genetics.
Year(s) Of Engagement Activity 2015
 
Description Talk to Hospital consultants 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Professional Practitioners
Results and Impact Explanation of the applications and future of genomic medicine.
Year(s) Of Engagement Activity 2018