Developing methods and bioinformatics tools for the global analysis of accessible regions in chromatin

Lead Research Organisation: University of Leeds
Department Name: School of Medicine

Abstract

One of the great challenges in modern biology is to understand why genetic information is differentially expressed in different cell types. Once we have a firm grip on this, we should be able to reprogram any cell of the body, and turn differentiated cells into stem cells able to become any cell type. That such reprogramming is indeed possible is exemplified by the finding that certain chromosomal translocations produce aberrant transcription factors which turn normal haematopoietic precursor cells into malignant leukaemic cells that only remotely resemble normal cell types. In addition, it was recently shown that only four transcription factors are sufficient to reprogram human fibroblast cells into stem cells. However, leukaemic cells take a long time to develop and it is known that often secondary events, i.e. the activation of other genes are required to fully turn normal cells into cancer cells. Also fibroblast reprogramming occurs with low efficiency, indicating that more than the targets of these four factors need to be activated. If we had methods where we could precisely identify regions of accessible chromatin, we could target these regions using either sequence specific transcription factors or small DNA-binding molecules. In addition, diagnosing regions of chromatin about to be opened up during cell differentiation will greatly enhance our understanding of which genes are activated in development and how. In this proposal we will use highly sensitive techniques enabling identification of such regions. The development of this technology needs a special effort, because it will require the close collaboration between experimental scientists and experts in handling and analysing large molecular data sets. The reason for this is that the experimental scientists will generate vast amounts of DNA sequence data which will then be puzzled together by the bioinformaticians to home in on those sequences in the mouse or human genome that are accessible. This task is anything but trivial.

Technical Summary

This proposal will use high-throughput parallel DNA sequencing to identify regions in chromatin that show an increased accessibility to cleavage with DNaseI. Work from this and other laboratories has shown that the chromatin of genes can be reorganised long before the onset of gene expression. Moreover, we showed that such regions can overlap with cis-regulatory elements destined to become activated in development. Also, during gene silencing, genes that were once active are only completely epigenetically silenced after many cell generations, and often still display regions of reorganised chromatin that have to be actively repressed. Work by others and us has shown that the fact that the chromatin of lineage/specific genes in apparently differentiated cells of alternate lineages is still accessible and plastic is the most likely reason why certain cell types can be reprogrammed by over-expressing specific normal or aberrant transcription factors. However, these observations were made at individual genes and it is unknown to what extent this applies to all the genes that are important to establish a specific differentiated phenotype. To establish the technology necessary to tackle these questions, we are now seeking funds to establish highly sensitive high-throughput approaches that identify accessible regions in chromatin using Solexa clonal sequencing. This will require the close collaboration between the experimental scientists of the lab of Constanze Bonifer and a bioinformatician from the group of David Westhead. Such a collaboration is necessary because it will require writing specific software and developing new algorithms to annotate these regions in the genome and calculate relative differences in DNaseI cutting frequency.

Publications

10 25 50

publication icon
Lichtinger M (2010) Chromatin regulation by RUNX1. in Blood cells, molecules & diseases

 
Description We developed a novel way of studying gene activity
Exploitation Route The results were not published
Sectors Other

 
Title Multiple ChIP-Seq, DNaseI-Seq and gene expression data-sets 
Description ChIP-Seq data for Sp1 transcription factor 
Type Of Material Database/Collection of data 
Year Produced 2014 
Provided To Others? Yes  
Impact Data were down-loaded by multiple users and placed in public repositories, such as CODEX 
URL http://codex.stemcells.cam.ac.uk/