Detecting cytosine methylation at the single DNA molecule level

Lead Research Organisation: University of Southampton
Department Name: Optoelectronics Research Ctr (closed)

Abstract

Although the genomes of many organisms (humans, plants, invertebrates and vertebrates) have been sequenced and many of the genes identified, our understanding of the regulation of the genes is limited due to lack of analysis technology. DNA is composed of four nucleic acid bases, adenine, guanine, cytosine and thymine. Some of these nucleic acid bases can be modified by enzymes and as a result have an additional methyl group; here we will investigate new technologies for the detection of methylcytosine and unmethylated cytosine within single DNA molecules. The methylation of cytosine nucleic acid bases is associated with gene silencing. In humans DNA methylation is considered to play a critical role in development and is aberrant in many diseases, but as yet the complete role remains unclear. There are numerous techniques for the detection of methylated cytosine in DNA, but the current methodologies do not yet provide a simple, fast, reliable cheap approach. A major problem is the need to evaluate DNA from cell samples that will contain the same DNA sequence but which are heterogeneous with respect to the cytosine residues that are methylated. So an average is often obtained. Those techniques that do allow single DNA strands to be evaluated are highly laborious and limited. Here we will develop a new approach for detecting sequences containing methylated cytosines at the single molecule level. There are currently other groups working in the field of DNA sequencing of single molecules, but these methods are slow and the DNA is investigated as a single strand. We will interrogate double-stranded DNA and this will allow us to detect methylated or unmethylated cytosine molecules on each strand, called hemi-methylation. Our approach is to create an artificial form of DNA, an oligonucleotide, that associates and wraps within the major groove of double-stranded DNA molecule at specific sequences. This artificial form of DNA when associated is called a triplex and the molecules synthesised will also contain a fluorophore. When the DNA sample has been treated with these triplex forming oligonucleotides the helix will contain fluorophores at different points along it. We will inject the DNA sample into a small channel that will result in unravelling and straightening of the strand so that it is then threaded into an optical interrogation channel. The fluorophores will be excited with light, which in the presence of nanostructures within the nanochannel will result in fluorescence intensity changes. The change in intensity will provide a code that indicates the methylation status of the different cytosine containing sequences (unmethylated, hemi-methylated, doubly methylated). A simple technique to detect the methylated and unmethylated cytosines within DNA sequences will be important for a wide academic, clinical and industrial research community, since this will allow a greater understanding of gene regulation. There are many research areas where cytosine methylation is considered to play a significant role in humans, such as diet related disease, inflammatory diseases, embryonic development to name a few, or in plants for understanding the effect of environmental stress. But as noted above, cytosine methylation is important for many organisms, and a technique that allows for the analysis of the patterns of methylation within genes has the potential to be commercially valuable in the longer-term. First a better understanding of DNA methylation is required, but it is possible that a form of the approach proposed here will yield a diagnostic tool.

Technical Summary

There are a number of approaches that have been tried, tested, dropped and adopted, but it is perhaps true to say that of the current assays at the disposal to researchers in the field no single assay fits all needs. To map methylcytosines in DNA, the gold standard was considered to be treatment with bisulphite which results in the conversion of unmethylated cytosine to uracil and thus the resulting DNA strand can be amplified and cloned. The resulting strands can thus be detected by using microarrays, but this has limited the sequences that have been interrogated. Recently the combination of bisulphite treatment with high-throughput sequencing has allowed for the mapping of methylated cytosines in the genome of the plant arabidopsis thaliana. But the use of such a method for organisms with larger genomes is impractical because so many samples must be sequenced to study tissue and disease specific variability. The problem has been even more complex with the recent finding that 5-hydroxymethylcytosine is present and the bisulphite followed by sequencing methods does not resolve this. Single DNA molecule analysis is considered by many as the potential way forward. Here we will investigate a new way to interrogate individual double-stranded DNA sequences. Novel oligonucleotide probes that associate with specific DNA sequences as triplexes will be synthesised; these probes will be designed to associate to sequences containing methylated, hemi-methylated and unmethylated cytosine. . Probes will also be developed during the later stages of the programme for the detection of sequences containing 5-hydroxymethylcytosine.. These probes will be labelled with different fluorophores. The fluorescently labelled probes will be interrogated in fluidic channels using optical methods.

Planned Impact

Research in the field of epigenetics, notably DNA methylation, is severely hindered by a lack of simple, cheap, fast technologies for reliable analysis. There is enormous diversity of DNA methylation analysis techniques, and many of these techniques yield results that cannot be compared easily. Because DNA is collected from a number of cells either the result is obtained as an average methylation level across many DNA molecules or as a pattern along short sequences of individual DNA molecules. The level of information obtained is severely hindered by speed, cost and effort needed. In addition the information is also compromised by the potential flaws in some of the techniques used including polymerase bias, frequency of mutations of methylcytosine to thymine (and hence masking of information by the bisulphite conversion when used in combination with sequencing methods), and the more recent finding of 5-hydroxymethylcytosine, which is not detected by the bisulphite-sequencing approach. New techniques for the analysis of DNA methylation will significantly 'openup' the opportunities for researchers working in the field of epigenetics, as well as opening up opportunities for companies supplying these research tools. Eventually, an understanding of DNA methylation will provide significant opportunities in the fields of diagnostics, an area where the UK has current industrial growth. The UK has an ageing population and a society with a poor diet; many of the diseases associated with individuals with these profiles are considered to be a result of aberrant DNA methylation profiles. Diagnostics in the future are likely to contain assays for DNA methylation analysis. DNA methylation is also important for plants, invertebrates and other vertebrates. Understanding the impact of the environment on these organisms on DNA methylation will be important for a wide range of agricultural activities. This will be particularly important in understanding the role of stress, the environment, the length of a day, on various crops. But it is also important for understanding the interplay of temperature, nutritional, brain and reproductive networks in mammals, insects and fish. An understanding of these factors will be important for efficient food production in the future.
 
Description Our programme is to achieve a new approach for evaluating single genomic strands of DNA through optical, microfluidic and chemical methods. We have developed probes which distinguish whether the cytosine in the DNA is methylated or not for sequences such as CpA. We have developed fluidic channels for unravelling long DNA strands.
In the later part of the funding we have developed both theoretical and experimental data that fits our hypothesis for unravelling long genomic DNA strains and delivering these into optical interrogation channels (essentially using the fluid flow within the structured channels to unravel the strand, orient the end and thread the end into an interrogation channel). As an avenue to investigate alternatives for the detection of methylated CpG sequences (rather than only CpA sequences that were possible to detect using our probe approach developed earlier on in the funding) some investigations using various fluorescently labelled proteins and Intercalators were performed and we have obtained evidence for increased intercalation of a fluorescent dye at the methylated CpG site.
Exploitation Route The work is published in journals and some of the data is to be submitted for a patent. We are in discussion with industry, notably those companies involved in DNA sequencing with a view to adapt some of our findings for application. Due to the infrastructure failures publications will be published later than planned.
Sectors Electronics,Healthcare,Pharmaceuticals and Medical Biotechnology

 
Description 1) Discussions have been on-going with Oxford Nanopore with respect to the data we have obtained so far. Our studies concerning microfluidic manipulation of DNA strands is being considered for a patent, but the practical work has been stopped (grant suspended) due to a flood so we are holding back on pursuing this at the moment. 2) The grant supported a part-time technician who had not been involved in research since she left a career previously to bring up her children. Unfortunately due to problems out of her control she remained at home supporting her family for 13 years. The part-time post involved a fair amount of training, notably in computational skills and laboratory skills as these had changed significantly over the time. She had found it very difficult to get back into an academic environment prior to taking up the post. She has secured a permanent post immediately upon leaving this post when her contract ended; the post is appropriate to her qualifications and experience (PhD). 3) A further female returner-to-work was recruited in 2015 following restarting of the grant (following suspension following the catastrophic failure of a pressurised cooling system and a flood of 1000 gallons of solution over our optical system that had been built at the start of the funding). The part-time post involved a fair amount of training as she had not worked in a laboratory for >10 years. She was very motivated and made a significant contribution to the group and as a result not only regenerated her molecular biology/genetics skills but also developed skills in microscopy.
First Year Of Impact 2014
Sector Electronics,Pharmaceuticals and Medical Biotechnology,Other
Impact Types Societal,Economic

 
Description EU - Participation in misconduct assessment panel associated with large amount of research funding inappropriately used (in a research area closely related to my own RCUK funded grant)
Geographic Reach Europe 
Policy Influence Type Participation in a guidance/advisory committee
 
Title Fabrication of 3D structure gold films with thickness of the nanometer scale with precise geometries and high smoothness for DNA sequencing applications 
Description Based upon theoretical designs 3D gold films of specific geometries and high smoothness were fabricated. The method involved a templated method whereby a thin Teflon coating with a subsequent gold film evaporation provided a film which conformed to the arrays of pyramid structured surface. Following lift-off of the film, nanopores were milled into the structures at locations for DNA translocation. 
Type Of Material Technology assay or reagent 
Year Produced 2015 
Provided To Others? Yes  
Impact First published example of method based upon micro and nanoengineering of 3D structure film of designed geometries. This has only just been disseminated, but attracting interest of other research groups. 
 
Title Microfluidic channels for unravelling long genomic DNA strands 
Description Our aim was to unravel long genomic DNA strands and thread this into a confining channel for interrogation. Theoretical modelling was used to both design and optimise the channel dimensions. These were fabricated and shown to be successful in unravelling long genomic double stranded DNA. These strands were then possible to interrogate optically using single molecule fluorescence methods. 
Type Of Material Technology assay or reagent 
Year Produced 2017 
Provided To Others? No  
Impact This has only recently been completed, so publication preparation is on-going. 
 
Title Theoretical approaches for predicting the movement of long DNA fragments through shaped microfluidic channels 
Description A model was built based upon low Reynolds number flow of long polymer chains with physical parameters consistent with the persistence length, extension and dimensions of long genomic DNA fragments of 50kb. This model, created from first principles was then applied within the COMSOL software framework. The method allowed us to establish the mechanisms for DNA 'unravelling' and extension within shaped channels. 
Type Of Material Computer model/algorithm 
Provided To Others? No  
Impact So far some aspects have been disclosed at meetings, but the work is being written up for publication along with the experimental studies. 
 
Title Theoretical model of electric field enhancement in nanopores within 3D structures 
Description DiffractMOD employs the Rigorous Coupled Wave Analysis method to obtain the backward diffraction efficiency of the surface when illuminated by a plane wave from above (normal to the upper face of the pyramid cavities). A transmission line analysis is applied to an expansion of the refractive index and Electromagnetic field vectors in Fourier space on a cell-by-cell basis, across a fine mesh superimposed on the three-dimensional representation of the refractive index. In our simulation, periodic boundary conditions are used at the sides and number of harmonics is set to '7' in x and y coordinate transverse to the primary direction, z. The number of harmonics is used to expand the refractive index and field in Fourier space resulting in accurate simulation. Simulations are performed for a wide spectral range (400 nm to 1500 nm with 5 nm steps), typically with TM-polarisation. The designs were optimised to provide electric field enhancement within the nanopore to facilitate fluorescence 'read-out' of fluorophores as they transit the pore. 
Type Of Material Computer model/algorithm 
Year Produced 2015 
Provided To Others? Yes  
Impact This information was fully disclosed in the supplementary information of two papers published in 2015 and 2016. In addition the data was provided in movie format to facilitate full disclosure of the data. 
URL http://iopscience.iop.org/0957-4484/27/6/065302/media/video1.mp4
 
Description Cafe Scientifique 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact Cafe Scientifique at the Isle of Wright. There are significant problems for residents on the island to get any sort of opportunity to attend any sort of public dissemination activity associated with science. It has always been disappointing to me that the educational results on the island are lower than on the mainland of Hampshire. A trip to the Isle of Wright from my department in the University is challenging and required approximately 3 hrs of travel one-way. Thus although this was a time-consuming activity the audience was especially enthusiastic. It was clear that the organisers struggled to attract speakers. The presentation that I delivered was appropriate to an audience with a very limited knowledge of genetics (and epigenetics) and the application of sequencing for stratified medicine. This resulted in a discussion of 1.5 hrs and only stopped because it was necessary to get a boat back to Southampton.
Year(s) Of Engagement Activity 2017
 
Description Chemistry and Biology for DNA and RNA synthesis. 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? Yes
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Industry visit Presentation

no actual impacts realised to date
Year(s) Of Engagement Activity 2013
 
Description Click nucleic acid ligation: Chemistry, Biology and Applications 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? Yes
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Industrial visit Presentation and discussions

no actual impacts realised to date
Year(s) Of Engagement Activity 2013
 
Description Click nucleic acid ligation: Chemistry, Biology and Applications 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? Yes
Geographic Reach International
Primary Audience Participants in your research or patient groups
Results and Impact Presentation Presentation

no actual impacts realised to date
Year(s) Of Engagement Activity 2013
 
Description Click nucleic acid ligation: Chemistry, Biology and Applications 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? Yes
Geographic Reach International
Primary Audience Participants in your research or patient groups
Results and Impact Departmental seminar Presentation

no actual impacts realised to date
Year(s) Of Engagement Activity 2013
 
Description Exploring the limits of Nucleic Acid Chemistry 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? Yes
Geographic Reach International
Primary Audience Participants in your research or patient groups
Results and Impact Departmental seminar Presentation

no actual impacts realised to date
Year(s) Of Engagement Activity 2013
 
Description Exploring the limits of Nucleic Acid Chemistry 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? Yes
Geographic Reach local
Primary Audience Participants in your research or patient groups
Results and Impact Presentation Oxford University Department of Chemistry December 2013:

Exploring the limits of Nucleic Acid Chemistry
Presentation

no actual impacts realised to date
Year(s) Of Engagement Activity 2013
 
Description From DNA Chemistry to Biological Applications; Current Research and Future Aspirations. 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? Yes
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact Presentation to Oxford University Department of Oncology Presentation

no actual impacts realised to date
Year(s) Of Engagement Activity 2014
 
Description Meet The Scientist 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? Yes
Geographic Reach Local
Primary Audience Schools
Results and Impact Public dissemination The different ways DNA can be used for medicine and diagnostics, including super-fast DNA testing Presentation to 13/14yr old school children.

no actual impacts realised to date
Year(s) Of Engagement Activity 2013
 
Description Participation in Oxfordshire Science Festival 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? Yes
Geographic Reach International
Primary Audience Schools
Results and Impact Public dissemination at the Science Museum in Oxford to general public/children The Stand

Solving the Energy Crisis: From Ancient to Future Solar Fuels

What do giant rubber gloves, gut bacteria, crystal structures, and photosynthesis have in common? We need all four to learn how to produce sustainable fuels using energy from the sun. Visit our Solar Fuels stand to learn about green energy, extract chlorophyll from plants in a hands-on science experiment, touch incredible 3D printed models, chat with the scientists carrying out the research.
Stand

no actual impacts realised to date
Year(s) Of Engagement Activity 2014
URL http://www.bbsrc.ac.uk/news/food-security/2011/110708-pr-devastating-pig-disease.aspx
 
Description Science Cafe, at local pub 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? Yes
Geographic Reach Local
Primary Audience Public/other audiences
Results and Impact Presentation and questions and Answer session: The different ways DNA can

be used for medicine and diagnostics, including super fast DNA testing.

23 people, 1.5 hours interspersed with Q&A Presentation suitable for pub

request to participate in further dissemination activities to public
Year(s) Of Engagement Activity 2012