📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Single molecule analysis of Human DNA replication

Lead Research Organisation: Earlham Institute
Department Name: Research Faculty

Abstract

All cells contain a complete copy of the organism's DNA, the genetic blueprint of life, packaged into discrete units called chromosomes. Since new cells need a copy of the genetic material, the chromosomes must be completely and accurately replicated before the cell can divide. This requires the process of DNA replication to start at thousands of sites across the chromosomes - called DNA replication initiation sites. Our project aims to determine the location of replication initiation sites in human cells. This is important because the location and distribution of replication initiation sites have been implicated in causing human diseases such as cancer.
It has been challenging to identify DNA replication initiation sites in human cells, because there is a lot of variability between cells. To date, most experiments have used the average from millions of cells, but this hides the variability between cells. We have developed a novel technology that identifies DNA replication initiation sites on thousands of single molecules that each originated in a single cell. Our approach can also identify specific sequences that are challenging to copy. This will allow us to test the hypothesis that some sites, thought to be involved in replication initiation, in fact are sites that impede DNA replication. Distinguishing between sites that initiate versus impede DNA replication will be crucial in understanding the causes of genetic instability that underlie some human diseases.

Technical Summary

Complete, accurate genome replication is essential for life. Our long-term goal is to determine how cells faithfully complete genome replication. Complete genome replication is achieved by initiating DNA replication at multiple discrete sites. Both the characteristics of replication initiation sites and subsequent interruptions to replication have been implicated in human genetic and phenotypic variation. The location and distribution of replication initiation sites have been proposed to underlie genome instability and the biogenesis of tumorigenic translocations. However, widely used methods for detecting replication initiation sites in mammalian cells produce inconsistent results. Therefore, there is no consensus of where in the human genome DNA replication initiates. To tackle this problem, we have developed a single molecule sequencing technology that identifies replication initiation sites. Our preliminary data indicates that ~80% of initiation sites across the human genome are dispersed with only ~20% within zones that could be detected by existing population-level genomic approaches. In this project, we will comprehensively identify the location and nature of replication initiation sites in the human genome. In addition, we will test the hypothesis that some GC-rich sequences reported to direct replication initiation in fact are attributable to fork arrest and replication restart. Distinguishing between replication initiation and replication restart is crucial since such sites have been implicated in the biogenesis of tumorigenic translocations.

Publications

10 25 50
 
Title Most human DNA replication initiation is dispersed throughout the genome with only a minority within previously identified initiation zones 
Description File names indicate: The cell lines used (HeLa-S3 denoted as HeLa; hTert-RPE1 denoted as RPE1) as the first four characters of the file name; TE in the file name indicates that the sample was subjected to nCATS target enrichment sequencing (all other files are from genome-wide sequencing); BAM files include both alignment (to the GRCh38 human genome assembly) and BrdU modification probabilities in the mod.bam format; BED files list the location of left/right replication forks; initiation sites; termination sites as indicated.   Experimental protocol: HeLa-S3 (adherent) and hTERT-RPE1 were maintained in DMEM Glutamax (HeLa-S3) or DMEM/F12 Glutamax (hTERT-RPE1, both Gibco), with the addition of 10% foetal bovine serum (Sigma) and 1% penicillin/streptomycin (Gibco). Cells were maintained at 70% confluency in 5% CO2 at 37 °C. Cells were treated with sequential addition of 0.5 µM BrdU every 2.5 minutes until 12 µM, followed by incubated for a further 1 hr at 12 µM BrdU High molecular weight genomic DNA extraction protocol For the target enrichment (TE; via nCATS) samples genomic DNA was prepared for sequencing with this protocol: dx.doi.org/10.17504/protocols.io.bmi5k4g6 Libraries were prepared with ONT Ligation Sequencing Kit (SQK-LSK109 or SQK-ULK001)    Data processing steps: Basecalling was perform with guppy v6.1.5 Aligned reads to hg38 reference using minimap2 v2.17 Detected BrdU in reads using DNAscent v2.0.2 BrdU probabilities converted to mod.bam format with detect_to_modBAM script BrdU gradients, replication fork directions, replication initiation and replication termination sites identified with a custom Rscript 
Type Of Material Database/Collection of data 
Year Produced 2024 
Provided To Others? Yes  
Impact None. 
URL https://zenodo.org/doi/10.5281/zenodo.10827585
 
Description Earlham-NIG, Japan 
Organisation National Institute of Genetics
Country Japan 
Sector Academic/University 
PI Contribution Single molecule analysis of DNA replication in human cells using cell lines where PrimPol can be depleted.
Collaborator Contribution Generation of human cell line where PrimPol can be depleted.
Impact None
Start Year 2024