Understanding supercoiling-dependent DNA recognition: a combined experimental and computational approach

Lead Research Organisation: John Innes Centre
Department Name: Biological Chemistry

Abstract

It is well known that DNA is the molecule of heredity and that the sequence of bases in DNA encodes the genetic information that defines an organism. DNA itself is generally organised in highly compacted structures, in particular supercoils in which the helical axis of DNA is coiled about itself. Supercoiling has a dramatic effect on the expression of genes and has been shown to act as a global regulator of gene expression. Our understanding of how supercoiling influences the behaviour of DNA is far from complete, particularly at the molecular level. Therefore we have developed a model system involving small DNA circles that we can analyse both by experimental methods and computer modelling. By combining these experimental and theoretical approaches we aim to understand how DNA recognition is affected by supercoiling, and how we can use supercoiling to influence biological processes.

Technical Summary

The strength and specificity of many DNA-protein interactions is sensitive to DNA supercoiling, to the extent that E. coli has been shown to re-programme its transcriptome by modulating the superhelical state of its genome. However, we still have little understanding of how such control mechanisms operate. Despite the numerous DNA and protein-DNA structures available in the Protein Database, which have shown the importance of DNA structure and flexibility in DNA recognition, there is almost no structural information on supercoiled DNA because this is more difficult to study using traditional methods such as crystallography or NMR. We will develop tools that allow supercoiling-dependent DNA recognition to be studied in a highly controllable way. We will then use these tools to understand the physical principles that underlie supercoiling-dependent DNA recognition, so that it becomes possible to predict supercoiling-dependency in genome-level studies. The methodology uses specially designed small DNA circles whose structure and superhelical state is both predictable and controllable. Physical insight will come from integrated molecular dynamics (MD) simulations. We will employ three tractable systems to test our methodology: 1) DNA minicircles - these have controllable sequences and levels of supercoiling, and are sufficiently small to be amenable to atomistic simulation. 2) DNA triplexes - we aim to measure how DNA triplex formation is affected by supercoiling the DNA circle and to use triplexes as agents to probe and perturb DNA structure. 3) Phage 434 repressor - this is a paradigm example of a genetic switch that is sensitive to changes in DNA supercoiling and topology, which we will use as a model system of DNA-protein interaction to study supercoiling-dependent recognition.

Planned Impact

Who will benefit? 1. Academic researchers in the areas of ligand-DNA and protein-DNA interactions, and scientists interested in systems biology, i.e. how does supercoiling influence global gene expression (see Academic Beneficiaries). 2. Pharmaceutical companies involved in antibacterial drug discovery will potentially benefit from a broader understanding of transcriptional control in bacteria. 3. Biotechnology companies who may be able to exploit the properties of small DNA circles revealed by this research. 4. The drug-design community who will benefit from the improved theoretical understanding of molecular recognition. 5. The general public in terms of gaining a better understanding of how DNA functions (through our outreach activities) and of the applications of high performance supercomputing. How will they benefit? a. Improved understanding of ligand-DNA and protein-DNA interactions and of molecular recognition in general. b. New ideas for drug discovery. c. New ideas for bioengineering and synthetic biology What will be done to ensure that they have the opportunity to benefit from this research? i. Publication in high-profile journals and presenting the findings at relevant conferences (such as Topo2011 and the annual Biophysical Society meeting). ii. Media involvement via JIC TOC Comms. iii. Collaborations with UK and overseas partners. iv. Investigation of exploitation potential via PBL. v. Establishment of BBSRC-CASE studentships based on work stimulated by this project. vi. Continued engagement with local schools.

Publications

10 25 50
 
Description We have set up a model system to understand the effect of supercoiling on the interaction of oligonucleotides and proteins with DNA. This has produced new understanding, of, for example, DNA triplex formation in small DNA circles and the effects of topology on these interactions. Essentially the advances have been methodological and will be utilised by us and others for new work.
Exploitation Route Methods for looking at the effects of supercoiling on DNA-associated processes and how to make small DNA circles. Atomistic simulation methods for supercoiled DNA. This work has provided the theoretical and experimental underpinning for looking at interactions of other molecules with DNA. This has led to new collaborations.
Sectors Pharmaceuticals and Medical Biotechnology

 
Description Ideas about making and using small DNA circles have been taken up by two companies: Twister and Inspiralis
First Year Of Impact 2014
Sector Pharmaceuticals and Medical Biotechnology
Impact Types Economic

 
Description Tackling tricky twists - how does DNA gyrase function inside living cells?
Amount £379,684 (GBP)
Funding ID BB/R001235/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 03/2018 
End 02/2021
 
Title Atomic force microscopy and atomistic molecular dynamics simulation data to resolve structures of negatively-supercoiled DNA minicircles at base-pair resolution. 
Description This data record consists 2 zipped folders: Full AFM raw data set.zip, and Source data .zip.The zipped folder Full AFM raw data set.zip contains all raw AFM data including repeats and experiments carried out in alternative conditionsThe primary subfolder names correspond to the method of DNA immobilisation:Nickel - use of 3 mM NiCl2 in Ph7.4 20 mM HEPES bufferPLLNaOAc - use of PLL and pH 5.4 50 mM NaOAc bufferHR images - high resolution images, obtained also using the nickel conditions.The secondary subfolder names correspond to the superhelical density as shown in figure 3 in the article, and these contain the raw AFM images as .spm isles, the sub folders within those are created by the program TopoStats, and are processed data from the raw AFM images. File formats included in the zipped folder: .spm, .tiff, .json, .txt and .pdf.The zipped folder Source data .zip comprises all relevant data, pdbs of all the structures depicted in the paper obtained from simulations and AFM. See below for details on each sub folder within Source data 2.zip. Each folder contains the data used to generate each figure ad supplementary figure in the article. Figure 1: AFM data: the AFM raw files for the high-resolution images shown in figure 1, and calculations of their aspect ratios as aspectratiomanual.xlsxAFM movie: the AFM raw files for the time-lapse images shown in figure 1.MD data: the MD images used for the high-resolution images shown in figure 1 and .tar files - the MD files used to generate the snapshots MD movie: the MD snapshots files for the time-lapse images shown in figure 1 and .tar files - the MD files used to generate the snapshotsFile formats included in the Figure 1 sub folder: 0## files where ## represent numbers, .gwy, .txt, .eps, .mpg and .xlsx.Figure 2: Kink and defect measurements - the measured bend angles shown in Fig 2 and an AFM image showing how the FAM bends were measuredMD Radgyr Writhe - measurements of radius of gyrations and writhe for each topoisomer.tar files - the MD files used to generate the snapshots in 2a.txt file - the profile shown in fig 2bFile formats included in the Figure 2 sub folder: .tiff, .txt and .datFigure 3: The subfolder names correspond to the superhelical density as shown in figure 3, and these contain the raw AFM images as '.spm' isles, the sub folders within those are created by the program TopoStats, and are processed data from the raw AFM images. The '.json' file contains the data used to make the plots shown in Figure 3File formats included in the Figure 3 sub folder: .spm, .tiff, .txt, .json and .pdfFigure 4: '.dat' files contain information from MD simulations used to create the subfigure they are labelled with.The '.spm' and '.037' files are the raw AFM images used in this figure.The .tar files are MD simulations data used to generate the snapshots shown in figure 4.File formats included in the Figure 4 sub folder: .spm, .txt, .pdf and .datFigure S1: Simulations data generated using the SerraLine program, showing the average and maximum deviations from planarity in relative and absolute numbers.Data were plotted suing the distributions_plot.py script.File formats included in the Figure S1 sub folder: .csv, .pdf, and .txtFigure S2a: MD measurement of the writhe over time as a '.dat file' and snapshots as '.pdb' files. File formats included in the Figure S2a sub folder: .pdb and .dat.Figure S2b: MD measurement of the writhe over time as a '.dat file' and snapshots as '.pdb' files.File formats included in the Figure S2b sub folder: .pdb and .dat.Figure S3: The AFM and MD measurements of bending angles including all profiles for MD simulations, generated using Serraline A, FM images and measurements in the form '251angles' '339 angles'.File formats included in the Figure S3 sub folder: .tiff, .txt and .pdb.Figure S4: AFM length analysis of the position of the triplex on linearised minicircles. 'Csv' file contains the length data measured by hand using the IMOD software.Plots: plots of the data raw AFM data: AFM data files used in the analysisFile formats included in the Figure S4 sub folder: .csv, .xlsx, .pdf and 0## files where ## represent numbers.Figure S5: Surface plasmon resonance (SPR) data show the effect of ions on the affinity of the triplex for varying superhelical densities of DNA minicircles, plotted using the script 'sprplot'. '.pdf' files are the plots of the various excel files.File formats included in the Figure S5 sub folder: .json, .pdf and .xlsx.Figure S6: SPR data in showing the affinity of the triplex for varying superhelical densities of DNA minicircles, plotted using the script 'sprplot'. '.pdf' files are the plots of the various excel files.File formats included in the Figure S6 sub folder: .json, .pdf, .xlsx and .pdf.Figure S7: An MS '.tar' file containing the snapshots shown in figure S7File formats included in the Figure S7 sub folder: .pdbFigure S8: AFM data used in figure s8, the '.gwy' files are AFM images of the wide view, and each of the time-lapse images. The '.txt' files are the profiles taken in those images and plotted in the figure.File formats included in the Figure S8 sub folder: .gwy and .txt.Figure S9: Simulations data showing the difference between the OL$ and BSC1 forcefields.File formats included in the Figure S9 sub folder: .datSimulations: The simulations data File formats included in the Simulations sub folder: .gro and .xtcSupp videos: The supplementary videosFile formats included in the SuppVideods sub folder: .pdb and .mpgSoftware needed to access data: 20151103_251_NAT_17ng_Ni_20mm_052DX.058 or AFM_339_TFO_HR_cs.037, spm files & all files included in the "Raw AFM data" sub folder - Gwyddion, Nanoscope Analysiseps files - illustrator/ pdf software.mpg - any movie player.gro - gromacs files- GRO files may be viewed on a computer using a supporting HP calculator emulator, such as Emu48.xtc files - gromacs files- a suitable software like XTrkCADsee http://manual.gromacs.org/documentation/2018/user-guide/file-formats.html for more information on gromacs files.Study aims and methodology: In the cell, DNA is arranged into highly-organised and topologically-constrained (supercoiled) structures. It remains unclear how this supercoiling affects the detailed double-helical structure of DNA, largely because of limitations in spatial resolution of the available biophysical tools. In this study, the authors combined high-resolution atomic force microscopy (AFM) with molecular dynamics (MD) simulations to reveal how supercoiling affects global and local DNA conformation, structure and dynamics in DNA minicircles of length 250-340 bp. The following procedures are described in more detail in the related article: generation and purification of small DNA circles, preparation and analysis of different topological species of minicircles, S1 nuclease digestions, atomic force microscopy, atomistic simulations and surface plasmon resonance. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Atomic_force_microscopy_and_atomistic_molecular...
 
Title Atomic force microscopy and atomistic molecular dynamics simulation data to resolve structures of negatively-supercoiled DNA minicircles at base-pair resolution. 
Description This data record consists 2 zipped folders: Full AFM raw data set.zip, and Source data .zip.The zipped folder Full AFM raw data set.zip contains all raw AFM data including repeats and experiments carried out in alternative conditionsThe primary subfolder names correspond to the method of DNA immobilisation:Nickel - use of 3 mM NiCl2 in Ph7.4 20 mM HEPES bufferPLLNaOAc - use of PLL and pH 5.4 50 mM NaOAc bufferHR images - high resolution images, obtained also using the nickel conditions.The secondary subfolder names correspond to the superhelical density as shown in figure 3 in the article, and these contain the raw AFM images as .spm isles, the sub folders within those are created by the program TopoStats, and are processed data from the raw AFM images. File formats included in the zipped folder: .spm, .tiff, .json, .txt and .pdf.The zipped folder Source data .zip comprises all relevant data, pdbs of all the structures depicted in the paper obtained from simulations and AFM. See below for details on each sub folder within Source data 2.zip. Each folder contains the data used to generate each figure ad supplementary figure in the article. Figure 1: AFM data: the AFM raw files for the high-resolution images shown in figure 1, and calculations of their aspect ratios as aspectratiomanual.xlsxAFM movie: the AFM raw files for the time-lapse images shown in figure 1.MD data: the MD images used for the high-resolution images shown in figure 1 and .tar files - the MD files used to generate the snapshots MD movie: the MD snapshots files for the time-lapse images shown in figure 1 and .tar files - the MD files used to generate the snapshotsFile formats included in the Figure 1 sub folder: 0## files where ## represent numbers, .gwy, .txt, .eps, .mpg and .xlsx.Figure 2: Kink and defect measurements - the measured bend angles shown in Fig 2 and an AFM image showing how the FAM bends were measuredMD Radgyr Writhe - measurements of radius of gyrations and writhe for each topoisomer.tar files - the MD files used to generate the snapshots in 2a.txt file - the profile shown in fig 2bFile formats included in the Figure 2 sub folder: .tiff, .txt and .datFigure 3: The subfolder names correspond to the superhelical density as shown in figure 3, and these contain the raw AFM images as '.spm' isles, the sub folders within those are created by the program TopoStats, and are processed data from the raw AFM images. The '.json' file contains the data used to make the plots shown in Figure 3File formats included in the Figure 3 sub folder: .spm, .tiff, .txt, .json and .pdfFigure 4: '.dat' files contain information from MD simulations used to create the subfigure they are labelled with.The '.spm' and '.037' files are the raw AFM images used in this figure.The .tar files are MD simulations data used to generate the snapshots shown in figure 4.File formats included in the Figure 4 sub folder: .spm, .txt, .pdf and .datFigure S1: Simulations data generated using the SerraLine program, showing the average and maximum deviations from planarity in relative and absolute numbers.Data were plotted suing the distributions_plot.py script.File formats included in the Figure S1 sub folder: .csv, .pdf, and .txtFigure S2a: MD measurement of the writhe over time as a '.dat file' and snapshots as '.pdb' files. File formats included in the Figure S2a sub folder: .pdb and .dat.Figure S2b: MD measurement of the writhe over time as a '.dat file' and snapshots as '.pdb' files.File formats included in the Figure S2b sub folder: .pdb and .dat.Figure S3: The AFM and MD measurements of bending angles including all profiles for MD simulations, generated using Serraline A, FM images and measurements in the form '251angles' '339 angles'.File formats included in the Figure S3 sub folder: .tiff, .txt and .pdb.Figure S4: AFM length analysis of the position of the triplex on linearised minicircles. 'Csv' file contains the length data measured by hand using the IMOD software.Plots: plots of the data raw AFM data: AFM data files used in the analysisFile formats included in the Figure S4 sub folder: .csv, .xlsx, .pdf and 0## files where ## represent numbers.Figure S5: Surface plasmon resonance (SPR) data show the effect of ions on the affinity of the triplex for varying superhelical densities of DNA minicircles, plotted using the script 'sprplot'. '.pdf' files are the plots of the various excel files.File formats included in the Figure S5 sub folder: .json, .pdf and .xlsx.Figure S6: SPR data in showing the affinity of the triplex for varying superhelical densities of DNA minicircles, plotted using the script 'sprplot'. '.pdf' files are the plots of the various excel files.File formats included in the Figure S6 sub folder: .json, .pdf, .xlsx and .pdf.Figure S7: An MS '.tar' file containing the snapshots shown in figure S7File formats included in the Figure S7 sub folder: .pdbFigure S8: AFM data used in figure s8, the '.gwy' files are AFM images of the wide view, and each of the time-lapse images. The '.txt' files are the profiles taken in those images and plotted in the figure.File formats included in the Figure S8 sub folder: .gwy and .txt.Figure S9: Simulations data showing the difference between the OL$ and BSC1 forcefields.File formats included in the Figure S9 sub folder: .datSimulations: The simulations data File formats included in the Simulations sub folder: .gro and .xtcSupp videos: The supplementary videosFile formats included in the SuppVideods sub folder: .pdb and .mpgSoftware needed to access data: 20151103_251_NAT_17ng_Ni_20mm_052DX.058 or AFM_339_TFO_HR_cs.037, spm files & all files included in the "Raw AFM data" sub folder - Gwyddion, Nanoscope Analysiseps files - illustrator/ pdf software.mpg - any movie player.gro - gromacs files- GRO files may be viewed on a computer using a supporting HP calculator emulator, such as Emu48.xtc files - gromacs files- a suitable software like XTrkCADsee http://manual.gromacs.org/documentation/2018/user-guide/file-formats.html for more information on gromacs files.Study aims and methodology: In the cell, DNA is arranged into highly-organised and topologically-constrained (supercoiled) structures. It remains unclear how this supercoiling affects the detailed double-helical structure of DNA, largely because of limitations in spatial resolution of the available biophysical tools. In this study, the authors combined high-resolution atomic force microscopy (AFM) with molecular dynamics (MD) simulations to reveal how supercoiling affects global and local DNA conformation, structure and dynamics in DNA minicircles of length 250-340 bp. The following procedures are described in more detail in the related article: generation and purification of small DNA circles, preparation and analysis of different topological species of minicircles, S1 nuclease digestions, atomic force microscopy, atomistic simulations and surface plasmon resonance. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Atomic_force_microscopy_and_atomistic_molecular...
 
Description Agnes Noy 
Organisation University of York
Country United Kingdom 
Sector Academic/University 
PI Contribution Synthesis of small DNA circles
Collaborator Contribution Simulations
Impact 3papers published so far: 1. A. D. Bates, A. Noy, M. M. Piperakis, S. A. Harris, A. Maxwell, Small DNA circles as probes of DNA topology. Biochem Soc Trans 41, 565-570 (2013). 2. A. Noy, A. Maxwell, S. A. Harris, Interference between Triplex and Protein Binding to Distal Sites on Supercoiled DNA. Biophys J 112, 523-531 (2017). 3. A. L. B. Pyne et al., Base-pair resolution analysis of the effect of supercoiling on DNA flexibility and major groove recognition by triplex-forming oligonucleotides. Nature communications 12, 1053 (2021). Multidisciplinary: Maxwell - biochemistry; Noy - simulations
Start Year 2011
 
Description Leake 
Organisation University of York
Country United Kingdom 
Sector Academic/University 
PI Contribution Provision of materials and expertise
Collaborator Contribution They are carrying out high-resolution microscopy experiments using material generated in my lab.
Impact One published paper
Start Year 2017
 
Description Pyne 
Organisation University College London
Country United Kingdom 
Sector Academic/University 
PI Contribution We have provided DNA minicircles for analysis
Collaborator Contribution She has been analysing DNA sample using atomic force microscopy.
Impact Paper published: 1. A. L. B. Pyne et al., Base-pair resolution analysis of the effect of supercoiling on DNA flexibility and major groove recognition by triplex-forming oligonucleotides. Nature communications 12, 1053 (2021). Multidisciplinary: biochemistry, AFM
Start Year 2015
 
Description Sarah Harris 
Organisation University of Leeds
Country United Kingdom 
Sector Academic/University 
PI Contribution We have been designing and making small DNA circle substrates
Collaborator Contribution Sarah's group have carried out atomistic simulations
Impact 3 published papers so far: 1. A. D. Bates, A. Noy, M. M. Piperakis, S. A. Harris, A. Maxwell, Small DNA circles as probes of DNA topology. Biochem Soc Trans 41, 565-570 (2013). 2. A. Noy, A. Maxwell, S. A. Harris, Interference between Triplex and Protein Binding to Distal Sites on Supercoiled DNA. Biophys J 112, 523-531 (2017). 3. A. L. B. Pyne et al., Base-pair resolution analysis of the effect of supercoiling on DNA flexibility and major groove recognition by triplex-forming oligonucleotides. Nature communications 12, 1053 (2021). Multidisciplinary: Maxwell - biochemistry; Harris - simulations
Start Year 2011