Promoter-associated histone modifications and establishment of the developmental gene expression program during early embryogenesis

Lead Research Organisation: University of Birmingham
Department Name: Clinical and Experimental Medicine

Abstract

Animal bodies including that of humans develop from a single cell, the fertilised egg. The development of embryos from fertilised eggs happens through proliferation of embryonic cells and a coordinated sequence of changes of these cells, which leads to formation of a variety of tissues. This process is governed by the coordinated activation of thousands of genes at the right time and in the right cells and is called genetic regulation of development. Each of the cells carry the same set of genes yet the activity of genes varies greatly in various cell types. When and where genes get switched on is a key determinant of normal development and is encoded in the DNA sequence present in every cell.

However, there is an additional mechanism of gene control, which operates by selective packaging of DNA (the genes and their switches) in the cell nucleus. The packaging of DNA (called chromatin) varies between cells and is important for marking the genes for activation. The marking and order of DNA packaging is inherited between cells during cell proliferation and may also be inherited from the oocyte and the sperm after fertilisation. This process is called epigenetic inheritance and has important implications in how inherited information is conveyed from parent to the embryo.

In this project we will study how one particular type of epigenetic information may contribute to the regulation of genes during embryo development. We will use a small animal model zebrafish, which is popular for genetic analysis of development due to the similarities in the way fish and human embryos regulate their genes during development and because the small transparent fish embryos can be easily cultured and monitored in laboratory conditions. We will study how proteins important in DNA packaging (histones) are marked in gametes and embryos to ask whether they are important for conveying epigenetic information from the parents to the developing embryo. We will explore the DNA codes of epigenetic regulation using biochemical assays that will tell us about histone marks on gene switches, and by analysing the DNA sequences which carry the codes for DNA packaging at the sites of the switches of genes active during embryo development. Our findings will be important for understanding the regulation of genes in normal development and will be useful for those studying the reasons for abnormal gene regulation, which causes a variety of disease both in development and in adult life.

Technical Summary

BACKGROUND - Understanding the mechanisms of epigenetic regulation and their phenotypic consequences in development is a key prerequisite to unravelling the disease-causing mechanisms involving epigenetic change. The mechanisms of epigenetic signals (DNA methylation, histone posttranslational modifications (PTM) and RNAs) in the maintenance and reprogramming of cell fates remains elusive. Premarking of histones on promoters is present at early ontogenic stages and in sperm, suggesting predictive roles in establishment of a developmental gene expression.

OBJECTIVES - We address histone modifications in transcription regulation during earliest stages of embryo development. We shall exploit the transcription-free stages of zebrafish development to study the temporal regulation of chromatin modifications from gametes to transcriptionally active embryo and ask: 1. Does the pattern of PTMs in gametes predict pattern of chromatin modifications. 2. Can parent-specific histone PTMs be traced in the developing vertebrate embryo and do parental PTM patterns impact developmental gene expression? 3. What sequence information (CpG content) is required for the deposition of H3K4me3 and transcription start site choice?

METHODS - We will use small cell number ChIP and RNA sequencing in embryos and gametes, and apply our recent advances in deciphering core promoter codes to decipher the signals associated with deposition of the transcription initiation machinery, histone PTMs and nucleosome positioning signals by genome-scale analysis.

EXPECTED OUTCOME - This work will shed light on the relationship between chromatin regulation and transcription initiation at zygotic genome activation of the vertebrate embryo. It will elucidate the contribution of histone PTMs to developmental program of gene expression for normal ontogeny of a complex organism and provide critical evidence and a resource to study transgenerational epigenetic inheritance in a tractable vertebrate model.

Planned Impact

The projects will have a large academic impact, described in detail in "Academic beneficiaries". It is perfectly aligned with the BBSRC strategic priority "Data Driven Biology" - it includes analysis and interrogation of next-generation sequencing datasets, analysis of the impact of genome variation, the extraction of quantitative information, and the development of new visualisation approaches for answering biological questions. Even though it addresses a fundamental scientific problem, the results and the methodologies developed within the project have the potential for a broader impact:

IMPACT ON HEALTH - The research focuses on the understanding of fundamental processes during early development, including the mechanism of activation of developmental programmes and the genes that control it, often associated with genetic and multifactorial disease. Understanding their early activation will have impact on understanding their function and the generation of hypotheses about the mechanism of disease. The events during the transition from gametes to dividing embryo and its activation will characterise key epigenetic events following fertilisation and provide a wealth of data that can be exploited for studies of stem cell biology, and transgenerational epigenetic inheritance and their applications.

IMPACT OF NEW METHODOLOGIES IN DEVELOPMENTAL GENOMICS - The experimental protocols we plan to develop for enabling genome-wide studies on limited amount of material will have impact on future efforts with cell type specific epigenomics. Applications will include include human (patient) samples, other animal model organism, and commercial species.

IMPACT OF THE GENERATED DATA AND SOFTWARE TOOLS - To maximise the impact of genome-wide datasets produced in the course of the project, it is necessary to provide them in an easily accessible and documented way. The combination of the data and tools to manipulate them is expected to have additional impact beyond the areas directly addressed by the project itself.

IMPACT ON THE EXPERTISE IN COMPUTATIONAL AND DEVELOPMENTAL BIOLOGY - The expertise in computational genomics and epigenomics in particular, is currently in high demand that vastly exceeds the supply of qualified, talented people. Training postdoctoral researchers in an interdisciplinary setting by collaborating groups with strong track record in computational genomics and developmental genetics bridges the training gap between computational and wet lab and has a potential to produce future research leaders. The computational methods developed on the project will benefit the community by increasing productivity through reusing and extending the softwares created here.

GENERAL IMPACT ON THE UK SCIENCE - University of Birmingham and Imperial College London are currently expanding their activities to become world leaders in the areas of genomics, integrative and systems biology. The competitive research program that we propose here will represent a large step towards that goal. Our training activities will provide exposure to our research and skills to talented students in developmental and computational biology across Europe, and serve as a platform for their recruitment to the UK.

IMPACT ON GENERAL PUBLIC AND APPRECIATION OF SCIENCE - Our outreach activities will span the audiences from general public to students. Within general public, we will organise activities that cover age groups from primary school pupils to adults. In addition to conveying the importance of our research, we shall address topics such as why zebrafish is a good organism to study human biology and disease, how whole genomes are studied, how biologists struggle with vast amounts of data etc. We expect this to benefit both the general public and the scientific community by increasing public appreciation of science and scientists, and by early exposure to current research.

The details of implementation are provided in "Pathways to Impact".
 
Title Expert advice and technical support for art piece generation by creative artist 
Description Provided laboratory space, equipment and reagents and provided expert advice on genetics to inspire and aid in the creation of a genetics-inspired artwork by Laurie Ramsell, resident artist at Birmingham Open Media. 
Type Of Art Artistic/Creative Exhibition 
Year Produced 2017 
Impact The art piece was exhibited at BOM in Birmingham. 
URL http://www.bom.org.uk/tag/laurie-ramsell
 
Description We have shown that chromatin (the DNA and proteins whcih packaging in the early embryo is showing similarities to what is detected in the gametes such as sperm and egg and this suggest that the way DNA is packaged in the gametes influences how the embryo's own DNA is packaged. It has previously been shown that histones the proteins that are used for DNA packaging may carry chemical tags which may influence how genes are activated. We should that these chemical marks are similar between sperm and the embryo and that these marks are present before the genes of the embryo are switched on. This remarking may have important implications on how the gametes influence gene expression in the embryo. We have shown that primordial germ cells, the cell type which differentiates very early in zebrafish embryo development carry unique chromatin packaging profiles suggesting that these cells, which will become the gametes, are disnitiguished from the rest of the embryo by modulating their gene expression profiles through chromatin packaging mechanisms.
Genes are switched on and off in development in a highly regulated fashion and this regulation is carried out by DNA sequences called promoters and enhancers. We have studied promoters of the genes of the developing embryos and identified several promoter codes (DNA sequences) which define gene regulation in early embryos. We showed that more than one code may be embedded in a single promoter and that these codes will be distinctly used during different phases of early embryo development.
Exploitation Route Datasets generated in the project have been deposited to GEO and other genomics dataset repositories and also uploaded to the DANIO-CODE Data Coordination Centre , which we co-developed with collaborating laboratories. The DCC is publicly available and maintains all accessible published and unpublished zebrafish genomics datasets in a standardised form.
Sectors Education,Pharmaceuticals and Medical Biotechnology

 
Description Providing advice
Geographic Reach Multiple continents/international 
Policy Influence Type Membership of a guideline committee
URL http://www.greekc.org
 
Description - Marie Sklodowska Curie Initial Training Network (ITN),
Amount € 4,000,000 (EUR)
Organisation European Commission 
Sector Public
Country European Union (EU)
Start 01/2015 
End 12/2019
 
Description Investigator AWARDS
Amount £930,000 (GBP)
Organisation Wellcome Trust 
Department Wellcome Trust Senior Investigator Award
Sector Charity/Non Profit
Country United Kingdom
Start 10/2015 
End 10/2020
 
Description PrOGRAMME GRANTS
Amount $1,000,000 (USD)
Organisation Human Frontier Science Program (HFSP) 
Sector Charity/Non Profit
Country France
Start 12/2015 
End 12/2018
 
Title DANIO-CODE Data Coordination Centre 
Description I am coordinator of the DANIO-CODE consortium ( see details below) which has recently generated a Data co-ordination Centre (DCC). My role in the establishment of the DCC was to coordinate the establishment of the network, create website for the network and organise the DCC activity. Our lab contributed to the DCC directly by data submission, organising data annotation jamboree and coordinating the establishment of the DCC. The DCC design and setup was led by Carsten Daub in Karolinska Institutet and included 5 collaborating laboratories at Imperial, Karolinska, Berkeyel California, ZFIN at Oregon and University of Birmingham). DANIO-CODE: DANIO-CODE is an international collaborative effort that aims to annotate the functional elements of the zebrafish genome. Zebrafish are increasingly used to successfully model human disease, to screen drugs and to study environmental toxicity in addition to being a model for embryonic development. Zebrafish genome is the third most complete but further annotation is essential to better realize the power of this model organism in diverse areas of biomedical research and to further strengthen its role in biomedical research. DANIO-CODE partners work together to provide a central resource of publicly available data and genome annotation of the zebrafish genome. DANIO-CODE combines a large variety of genome wide datasets including protein-coding and non-coding transcribed genome elements, non-coding functional elements such as cis-regulatory modules and associated epigenetic features. DANIO-CODE was established at a workshop held in Imperial College London in December 2014 (see meeting report published in Zebrafish) where members of the zebrafish genomics research community and previous contributors to ENCODE and FANTOM genome annotation projects identified key aims. The DANIO-CODE key aims Establishment of a Data Coordination Centre for standardised data input into a shared Data Resource. This will provide centralised and convenient access and visualisation of currently inaccessible zebrafish genomics datasets such as ChIP-seq, CAGE-seq, RNA-seq, ATAC-seq, Ribo-seq, meDIP-seq. As a first step, published data is being reanalyzed for GRCz10 by standard pipelines with the aim that data will become mineable through a Track hub hosted by ZFIN and managed by DANIO-CODE partners. Identification and generation of common policies on data standardization, integration and data sharing. Seeking and obtaining funding through joint ventures for genomics resource generation and management identified as community priorities Among the future aims is to establish a consortium agreement and a platform for sharing unpublished data and to work together in community inspired annotation efforts. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
Impact Most of all published zebrafish epigenetic datasets up to 2terabyte of data have been uploaded to the database. This provides the basis for standardisation, reanalysis and dissemination in a centralised track hub for use by the wider community. 
URL https://danio-code.zfin.org
 
Title Track Hub 
Description A truck hub of published epigenome datato be released at the UCSC server of genome browsers and which has been generated in collaboration with the BBSRC co-PI, Professor Boris Lenhard's group. Details: hub ZebrafishPromoterome shortLabel Promoterome CAGE and nucleosome positioning longLabel Zebrafish promoterome based on Haberle et al. and Nepal et al. genomesFile genomes.txt email zebrafish.promoterome@gmail.com descriptionUrl http://trackhub.genereg.net/promoterome/danRer7/index.html 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
Impact totally new track hub, impact expected in the future 
URL http://trackhub.genereg.net/promoterome/danRer7/index.html
 
Description Collaboration with Juanma Vaquerizas 
Organisation Max Planck Society
Department Max Planck Institute for Molecular Biomedicine
Country Germany 
Sector Academic/University 
PI Contribution We led the project for characterisation of chromatin states and transcription in early embryos with focus on primordial germ cells.
Collaborator Contribution JV contributed with RNA sequencing analysis support and more recent with offering Hi-C technology to analyse chromatin conformation in whole embryos and primordial germ cells.
Impact Germ plasm localisation dynamics mark distinct phases of transcriptional and post-transcriptional regulation control in primordial germ cells Fabio M. D'Orazio, Piotr Balwierz, Yixuan Guo, Benjamín Hernández-Rodríguez, Aleksandra Jasiulewicz, Juan M. Vaquerizas, Bradley Cairns, Boris Lenhard, Ferenc Müller bioRxiv 2020.01.12.903336; doi: https://doi.org/10.1101/2020.01.12.903336
Start Year 2016
 
Description Collaboration with Professor Boris Lenhard (co-PI) on the grant "Promoter-associated histone modifications..." 
Organisation Imperial College London
Department School of Public Health
Country United Kingdom 
Sector Academic/University 
PI Contribution This collaboration is a core partnership in our BBSRC-funded project. Our laboratory in Birmingham carries out the wet lab aspects of the programme and carries out epigenome characterisation experiments on zebrafish embryos. These experiments include genome wide analysis of histone post translational modifications characteristic to promoters and enhancers by ChIP sequencing technologies specialising on small cell number protocol (H3K27me3, H3K4me3), as well as promoter associated transcriptiomics by CAGE sequencing and open chromatin analyses by ATAC sequencing. Prof Lenhard contributes by computational biology analysis of the wet lab data generated by our laboratory.
Collaborator Contribution Prof Lenhard's group provides computational analysis of the wet la data generate in our laboratory. They map sequenced reads on to the zebrafish genome, carry out peak calling and identify epigenetic features using computational tools. They provide quality control analysis for our experimental designs and provide computation validation of hypotheses which are the result of joint discussions as well as those initiated by either of the two parties.
Impact Outcomes: Publications: Transcriptional, post-transcriptional and chromatin-associated regulation of pri-miRNAs, pre-miRNAs and moRNAs. Nepal C, Coolen M, Hadzhiev Y, Cussigh D, Mydel P, Steen VM, Carninci P, Andersen JB, Bally-Cuif L, Müller F, Lenhard B. Nucleic Acids Res. 2016 Apr 20;44(7):3070-81. doi: 10.1093/nar/gkv1354. PMID: 26673698 Free PMC Article An atlas of active enhancers across human cell types and tissues. Andersson R, Gebhard C, Miguel-Escalada I, Hoof I, Bornholdt J, Boyd M, Chen Y, Zhao X, Schmidl C, Suzuki T, Ntini E, Arner E, Valen E, Li K, Schwarzfischer L, Glatz D, Raithel J, Lilje B, Rapin N, Bagger FO, Jørgensen M, Andersen PR, Bertin N, Rackham O, Burroughs AM, Baillie JK, Ishizu Y, Shimizu Y, Furuhata E, Maeda S, Negishi Y, Mungall CJ, Meehan TF, Lassmann T, Itoh M, Kawaji H, Kondo N, Kawai J, Lennartsson A, Daub CO, Heutink P, Hume DA, Jensen TH, Suzuki H, Hayashizaki Y, Müller F; FANTOM Consortium., Forrest AR, Carninci P, Rehli M, Sandelin A. Nature. 2014 Mar 27;507(7493):455-61. doi: 10.1038/nature12787. PMID: 24670763 Free PMC Article Two independent transcription initiation codes overlap on vertebrate core promoters. Haberle V, Li N, Hadzhiev Y, Plessy C, Previti C, Nepal C, Gehrig J, Dong X, Akalin A, Suzuki AM, van IJcken WF, Armant O, Ferg M, Strähle U, Carninci P, Müller F, Lenhard B. Nature. 2014 Mar 20;507(7492):381-5. doi: 10.1038/nature12974. Dynamic regulation of the transcription initiation landscape at single nucleotide resolution during vertebrate embryogenesis. Nepal C, Hadzhiev Y, Previti C, Haberle V, Li N, Takahashi H, Suzuki AM, Sheng Y, Abdelhamid RF, Anand S, Gehrig J, Akalin A, Kockx CE, van der Sloot AA, van Ijcken WF, Armant O, Rastegar S, Watson C, Strähle U, Stupka E, Carninci P, Lenhard B, Müller F. Genome Res. 2013 Nov;23(11):1938-50. doi: 10.1101/gr.153692.112.
Start Year 2014
 
Description collaboration with Dr Brad Cairns, HHMI Investigator, University of UTAH 
Organisation University of Utah
Country United States 
Sector Academic/University 
PI Contribution This collaboration was initiated to complement our BBSRC supported work to analyse the chromatin states in early embryos in preparation for genome activation. We have carried out histone modification analysis before and after genome activation in whole zebrafish embryos and in FACS sorted primordial germ cells.
Collaborator Contribution Together with Dr Cairns we have analysed DNA methylome data in whole embryos during early development and in primordial germ cells. The analysis indicates that unlike in mammals, in fish the primordial germ cells do not show remodelling of the DNA methylome during their first stages of specification. The outcomes of this collaboration complement the findings made by the BBSRC funded project and will be presented in a co-authored peer reviewed paper which is currently being prepared. The manuscript acknowledges BBSRC support.
Impact Invited speaker presentation by FM at the forthcoming EMBO Workshop entitled Awakening of the Genome: the maternal to zygotic transition , Prague, 15-18 May. Germ plasm localisation dynamics mark distinct phases of transcriptional and post-transcriptional regulation control in primordial germ cells Fabio M. D'Orazio, Piotr Balwierz, Yixuan Guo, Benjamín Hernández-Rodríguez, Aleksandra Jasiulewicz, Juan M. Vaquerizas, Bradley Cairns, Boris Lenhard, Ferenc Müller bioRxiv 2020.01.12.903336; doi: https://doi.org/10.1101/2020.01.12.903336
Start Year 2016
 
Description School visit (Bewdley, Worcestershire) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact During Science week 1 day was spent at St Anne's primary school, Bewdley, where i introduced embryo development, and use of zebrafish in research to over 90 year 5-6 pupils (9-11 years old). Additionally we carried out experiments with preparing DNA from Strawberries and
Year(s) Of Engagement Activity 2018