Dynamics of global chromatin landscape through the cell cycle and differentiation

Lead Research Organisation: CARDIFF UNIVERSITY
Department Name: School of Biosciences

Abstract

The genomes of all higher organisms are packaged in a form known as chromatin, a combination of the DNA and proteins that are bound to it. This packaging of DNA into chromatin is essential because each individual chromosome consist of a single molecule of DNA up to several centimeters long but which must to be packaged up in a cell nucleus only a few micrometers across. Indeed the total DNA length in a normal human cell is about 2m. At the same time, the DNA must be accessible when required to allow a controlled expression of genes, or the copying of DNA during the cell division cycle. The basic unit of chromatin packaging is known as the nucleosome, which consists of a core of special highly conserved proteins known as histones around which the DNA is wrapped approximately twice. Nucleosomes appear to be located in specific positions, particularly around important features such as genes. It has become clear that many characteristics of organisms, and some diseases, are affected by so-called epigenetic modifications, resulting from changes in the chromatin or modification of the DNA rather than changes in the underlying DNA sequence itself. Hence understanding chromatin is of fundamental importance in many biological processes.

In addition to nucleosomes, there are many other complexes of proteins that bind to DNA for a number of purposes, particularly the controlled expression of genes. The pattern of nucleosomes and other proteins binding to DNA can be visualised by using enzymes that cut DNA only where no proteins are bound. Regions that are protected are not cut, and digesting chromatin with such enzymes generates many millions of DNA fragments. These are predominantly the size of single or multiple nucleosomes.

We have developed a new technology that can simultaneously determine all such protected regions across the entire genome, and hence map every nucleosome and other DNA-bound protein complex to generate a "chromatin landscape". This makes use of a high throughput sequencing to generate DNA sequence from both ends of around 180 million such fragments simultaneously. This data is then used to map the endpoints of the fragments onto the genome sequence, thus locating the positions of the bound proteins through regions with few cut ends. We have demonstrated this technology using the model plant Arabidopsis. In this proposal will use this new approach to understand how chromatin structure changes both through the cell cycle as cells divide and as cells differentiate to adopt new characteristics. This analysis will be carried out in both cell culture and in cells from intact tissues, allowing a detailed understanding of how chromatin changes occur during these processes. Specifically we will study how chromatin structure in Arabidopsis changes in response to light and how it is different in root and shoot cells. We will also investigate how this pattern is changed in a specific mutant in which chromatin structure is altered and in which differentiation is affected. As a result, we will develop a detailed understanding of the organization of chromatin in a higher organism, and how this is dynamically altered as the fate of cells changes during the processes of development. These techniques and approaches will be applicable to all eukaryotic organisms and will be of great significance in progressing our understanding of the genome.

Technical Summary

Packaging of eukaryotic genomes into chromatin and the positioning of nucleosomes affects every process that occurs on DNA, as the nucleosome modifies or occludes access to underlying DNA sequences. We understand little of the changes in chromatin structure and occur in higher eukaryotic cells, for example during the cell cycle or during the processes of differentiation. Almost nothing is known of the chromatin structure of cells in a tissue context in terms either of nucleosome position or of the global changes in DNA-bound transcription factor complexes. We have established, for the 1st time in a higher eukaryote, a technique we call "chromatin landscaping" that allows us to map with base pair resolution the position of all DNA associated protein complexes across the genome. We have developed this for the model higher plant Arabidopsis because of its small genome size and excellent range of genomics tools and data. We will apply this technique first to understanding chromatin changes during the cell cycle in cultured cells. We will correlate this with detailed data on histone modification being obtained by our collaborators in the same cell type. We will then apply it to cells isolated from intact tissues by flow sorting, allowing us to compare mitotic cells and differentiated cells of both roots and shoots. Finally we will compare the chromatin maps between wild-type and a mutant in a transcription factor known to be associated with cell differentiation and to bind to multiple sites across the genome. Chromatin changes will be correlated with chromatin immunoprecipitation data and RNA profiles. This will (1) provide benchmark high resolution maps of nucleosome positioning in key developmental systems in Arabidopsis; (2) provide new understanding of chromatin changes associated with the cell cycle and cell differentiation, (3) establish how changes in differentiation caused by the binding of a specific factor are reflected in chromatin structure.

Planned Impact

This research will enhance our understanding of how the genome functions and how key biological processes are controlled. It will also develop and demonstrate new techniques of genome analysis that can be applied to other organisms, including humans and crops, as well as to basic research in other organsisms.

There are three main groups of beneficiaries: Academic research scientists, industry research and development scientists, the general public and schools.

Academic research scientists will benefit from a greatly increased understanding of genome structure and function. More specifically, a detailed map of chromatin will be developed which will provide the benchmark for all future work in this field. The map will locate the positions of nucleosomes, the basic particle of DNA packaging, as well as other protein complexes bound to the DNA. Importantly as a result of a close collaboration with two groups in the US, this map will be integrated with maps of modifications to the histone subunits of the nucleosomes. In addition to the benchmark map, a detailed understanding of how this varies in different conditions and cell types will be generated. For example, plant cells respond in a profound and general way to light, and the changes that occur in cells in response to light will be defined. When DNA is replicated as cells divide, profound changes may also take place- these are currently unknown and will be defined. Finally as cells take on specialized characteristics and differentiate, it has been known for many years that changes occur in their chromatin, but these have not yet been defined across the whole genome.

Both academic research scientists, and industry based research scientists will benefit from the new techniques developed to analyse chromatin and from easy-to-use software pipelines that can then be applied in other organisms and systems. Hence the technology can be applied in crops or humans to generate a much more detailed understanding of the epigenome- differences that affect how genes are expressed, and which can have profound effects on crop growth or disease responses in humans. Modification of epigenetics has the potential to serve as a new route to the treatment of human diseases and epigenetic modifications are also key mediators of cell fate, with implications for cellular therapies. Understanding chromatin organization is thus of key importance in important applied areas, and chromatin particle spectrum analysis as developed in this grant can make an important contribution. These applications can help improve commercially relevant research in epigenome related areas, in which the UK is a global leader through the research activities of companies such as Pfizer.

More generally this research integrates computational approaches and biology. Significant training potential is foreseen for the two PDRAs. PDRA1 will gain expertise in a broad spectrum state-of-the-art genomics techniques including flow sorting, chromatin analysis, ChIP and RNA Seq with an understanding of the bioinformatics pipeline. PDRA2 will be a computer scientist gaining an understanding and skills for biological application. The development of a skilled workforce with the ability to work collaboratively across discipline boundaries is central to UK Government and European strategies for building the future knowledge based bio-economy, and thus contributes both to UK competitiveness and wealth creation potential.

The general public, and school pupils in particular can benefit from this research. It is easy to understand that since the total DNA content of a human cell is around 2m in length and is enclosed in a nucleus of a few micrometers. Understanding how genes are organized provides an accessible way to discuss the problem of how we can investigate this problem. Engaging with school pupils will provide the staff employed on the project with new didactic and presentation skills.
 
Description All eukaryotic genomes are packaged by proteins as chromatin. The DNA is packaged by regularly patterned nucleosomes around which the DNA is wound at a scale of a few hundred base pairs. In addition there are smaller protein particles associated with the DNA- so-called sub-nucleosomal-sized protein structures such as mobile and labile transcription factors (TF) and initiation complexes. Together these represent a dynamic chromatin landscape that changes as the environmental conditions change. Whilst details of nucleosome position in Arabidopsis have been analysed before, there is less understanding of their relationship to more dynamic sub-nucleosomal particles (subNSPs) defined as protected regions shorter than the ~150bp typical of nucleosomes. The genome-wide profile of these subNSPs has not been previously analysed in plants and this study investigates the relationship of dynamic bound particles with transcriptional control. We combined differential micrococcal nuclease (MNase) digestion and a modified paired-end sequencing protocol to study the chromatin of Arabidopsis cells across a wide size range. We investigated the response of the chromatin landscape to changes in environmental conditions using light and dark growth, given the large transcriptional changes resulting from this simple alteration. The resulting shifts in the suites of expressed and repressed genes show little correspondence to changes in nucleosome positioning, but led to significant alterations in the profile of subNSPs upstream of TSS both globally and locally. We concluded that wide-spectrum analysis of the plant genomes by differential MNase digestion allows detection of sensitive features previously obscured, and the comparisons between genome-wide subNSP profiles reveals dynamic changes in their distribution. The method allows insight into the complex influence of genetic and extrinsic factors in modifying the chromatin in association with transcriptional changes during the responses to environmental changes and challenges.
Exploitation Route This was a basic research project, and provides new methods and insights on the use of wide-spectrum chromatin particle analysis for observing dynamic changes in the genome.
Sectors Agriculture

Food and Drink

URL http://orca.cf.ac.uk/104737/
 
Description The work carried out in this award was a major factor in the career development of early career researchers. One of the postdocs on the grant was appointed to a lectureship on the basis of the work carried out. One of the academics (co-I) was able to apply for promotion based on the work carried out. The work was also used for a successful grant application to the Royal Society.
First Year Of Impact 2016
Sector Education
Impact Types Cultural

 
Title Genome-wide chromatin mapping with size resolution reveals a dynamic sub-nucleosomal landscape in Arabidopsis 
Description Accession PRJNA369530; GEO: GSE94377. Linked to Pass DA et al., "Genome-wide chromatin mapping with size resolution reveals a dynamic sub-nucleosomal landscape in Arabidopsis.", PLoS Genet, 2017 Sep;13(9):e1006988. 16 datasets of paired end Illumina sequence of chromatin mapping as published, 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
Impact Background: Analysis of the effect that chromatin structure has on the expression patterns of eukaryotic genes has recently expanded knowledge of the complex influence genome accessibility has on genome function. Interlaced with regular nucleosomal patterning are other mobile and labile sub-nucleosomal-sized protein structures bound to the genome such as transcription factors (TF), initiation complexes, and modified nucleosomes. Results: We present chromatin structure maps of the A. thaliana Col-0 wild-type (in vitro cell culture) combining differential micrococcal nuclease (MNase) digestion and analysis by paired-end next-generation sequencing with RNASeq data to provide insight into the sensitive genomic regions and variable DNA-bound particle size related to transcriptional activity. The landscape of sub-nucleosomal sized particles (subNSPs) has been investigated, revealing the DNA-binding complex positioning genome-wide. We observe the dynamic changes in the distribution of these complexes surrounding genomic features, particularly at transcription start sites (TSS). Differential digestion reveals the presence of MNase-sensitive smaller particles and a labile -1 nucleosome upstream of the TSS of active genes. These changes correlate with gene expression differences resulting from different environmental conditions of light- or dark growth both globally and locally, and a complex profile of bound particles is directly visualised. Conclusions: Analysis of the A. thaliana genome reveals that resolution of chromatin particle size by differential MNase digestion allows detection of the sensitive features in chromatin structure that have hereto been obfuscated. The concomitant analysis of transcript levels reveals the impact of transient extrinsic factors in modifying the sub- nucleosomal landscape in association with transcriptional changes, giving insight into the binding of transcription-associated factors. Overall design: 2x Light grown, 2x Dark grown Arabidopsis thaliana Col0 under High or Low MNase digestion with 4x associated RNAseq per condition. 
URL https://www.ncbi.nlm.nih.gov/bioproject/PRJNA369530
 
Description Collaboration with North Carolina State University 
Organisation North Carolina State University
Country United States 
Sector Academic/University 
PI Contribution A collaboration with Prof Linda Hanley-Bowdoin and Prof Bill Thompson at NCSU provided access to their expertise in sorting of plant cells at different stages of the cell cycle. A team member went to NC to learn the techniques. I was also involved at NCSU in chairing the International Review Panel of an NSF grant that the US team held using similar technology. This involved annual visits by the PI to NCSU to review progress on their grant, which cemented the relationship and gave an in depth understanding of the technology, its capabilities and limitations.
Collaborator Contribution They provided expertise and training as outlined above.
Impact We have previously published together (PLoS Genet. 2010 Jun 10;6(6):e1000982. doi: 10.1371/journal.pgen.1000982), but no further joint publications during this grant period. However the interaction / collaboration assisted in generating the data for the paper describing the main outputs of this grant (PLoS Genet. 2017 Sep 13;13(9):e1006988. doi: 10.1371/journal.pgen.1006988.)
Start Year 2006