Bilateral NSF/BIO-BBSRC: Engineering Tunable Portal Hybrid Nanopores for High-Resolution Sequence Mapping

Lead Research Organisation: University of York
Department Name: Chemistry

Abstract

Genome technology is an important part of modern life and is used routinely in medicine, forensic and crop science. However, despite rapid advances in DNA sequencing technology, large regions of genomes, including the human genome, are poorly characterised. In this proposal we aim to develop a new hybrid nanopore technology for DNA analysis and explore the possibility of using this method for improved analysis of these poorly defined or 'dark' genomic regions.

These 'dark' regions are poorly defined largely due to the presence of multiple repeats. These 'repetitive elements' are difficult to analyse with currently available technology. This is because current methods rely on DNA polymerase enzyme activity which is prone to 'stuttering' and 'slipping' on AT rich repeats and 'stalling' on GC rich repeats.

Nanopore technology is a polymerase-independent method for mapping at near-basepair resolution of very long DNA fragments (>100 Kb). Indeed it is hoped that this technology will be advanced towards the de novo sequencing of whole genomes. However, despite great promise, these technologies are still in development and currently are between 60-90% accurate. While protein nanopores, such as the alpha-haemolysin protein, are easily reproducible and tunable, they are generally derivatives of membrane pores. Thus they are supported in relatively fragile lipid-like membranes, requiring detergent for handling and high (micromolar) DNA concentration for signal detection. Conversely, ultra-thin layer solid-state (SS) nanopores are more robust and require only nanomolar quantities of DNA. However, they are difficult to fabricate routinely with the tiny diameter (<4nm) required for high-resolution DNA mapping.

Hybrid nanopores can combine the advantages of both systems: namely (i) the reproducible production of tunable nanopores with diameters of 1-3 nm with (ii) robust properties that results in low (nanomolar) DNA concentrations required for signal detection. However, current protein nanopores require detergent and substantial chemical modification for integration into SS nanopores.
We propose to investigate the suitability of a natural DNA nanopore, the portal protein from a thermophilic virus, for hybrid nanopore production. This protein is the nanopore through which DNA passes during packaging of the viral genome and so naturally processes the characteristics designed for capture and directional transition of dsDNA. Additionally this bionanopore is thermostable, highly soluble, tractable for bioengineering purposes and easy to produce in large highly pure quantities. Furthermore, the available high-resolution X-ray structure of the portal protein allows the design of portal variants with modified DNA transition properties.

In this proposal, we specifically aim to explore and optimise the integration of the portal protein into SS nanopores and define the DNA transition dynamics. Bioengineering methods will be used to optimise the portal protein for 3 microseconds/basepair transition speeds. The hybrid nanopore will be calibrated for accurate analysis of DNA sequences containing multiple repeats.

Technical Summary

Nanopore based sequencing is proving useful in sequencing long genomic regions (>100 Kb) and may provide a method to accurately map regions containing repetitive DNA sequences. However, this technology is currently in development with several challenges to overcome. Hybrid nanopores may combine the advantages of both protein and solid-state (SS) nanopores to form a reproducible, robust nanopore that requires nanomolar concentrations of DNA for detection. However, to date the only successful hybrid nanopore reported was created using the membrane-associated alpha-haemolysin protein in a complex process that required detergents.

We propose to exploit the stable and soluble portal protein from the thermophilic bacteriophage G20C, as an alternative protein component in the production of hybrid nanopores. This protein is a natural DNA nanopore, serving as the gate through which DNA passes during the viral packaging process. Indeed preliminary experiments using voltage differentials and sensitive electronic detection have proven very promising and we aim to capitalise on this work by defining the method for efficient and stable insertion of portal in solid-state nanopores. Time resolved ion current signals will be used to characterise the DNA transition dynamics. We will use bioengineering and chemical biology approaches to tune the tunnel loops of the portal protein to acheive DNA transition rates of 3 microseconds/basepair and to create a stable portal SS-nanopre interface. Engineered proteins will be characterised using biophysical techniques, prior to characterisation of the DNA transition dynamics. Portal variants with the most suitable properties will be further characterised by X-ray crystallography. Finally, hybrid nanopores will be used to map the position and size of protein-bound DNA regions in selected viral genomes and to define the characteristic ion current signals for transition of homopolymeric ssDNA.

Planned Impact

Beneficiaries and interested parties:
Inspired by the powerful DNA motors present in viruses, we aim to exploit a synthetic molecular nanomachine for creation of a novel tool for DNA analysis. Specifically, we aim to create a hybrid nanopore that combines the advantages of both protein and solid-state nanopores as a DNA mapping tool. Such a tool would be used to define the repetitive elements in DNA that are not well characterised with currently available methods. To this end, we propose to combine a recently characterised natural DNA pore found in thermostable bacterial viruses with a synthetic nanopore drilled into an ultrathin metal membrane fabricated with the latest photolithographic technology. Such a hybrid nanopore is expected to have superior stability, sensitivity and resolution for DNA mapping uses compared with either protein or synthetic nanopores alone. It is hoped that this would lead to the development of rapid and accurate methods for mapping multiple repeat regions within genomes. In the future this technology could potentially lead to devices for sequencing whole genomes.

(1) The immediate beneficiaries include those researchers in academia (national and international) and in the private commercial sector (pharmaceutical companies), who are developing new hybrid biological and synthetic molecular nanotechnology approaches, in particular novel DNA and protein sequencing tools. These researchers will benefit from the research outcomes that we plan to publish in months 18, 28 and 36. They will also benefit from new structural data on modified viral portal proteins which will be deposited with the Protein Data Bank and made publicly accessible upon publication. Engagement with industry will involve participation in the BBSRC Knowledge Transfer Network for Nanotechnology events and the provision of information for both web-based and print media articles.

(2) Long-term direct and indirect beneficiaries would include: (i) Researchers in pharmaceutical companies targeting rapid, cost-effective genome sequencing technology for personalised medicine approaches and therapeutic gain; (ii) Crop scientists who rely on genome sequencing technologies for improvement of yield, drought and disease tolerance; (iii) Researchers in academia and industry that seek to understand the genetic regulation of potentially useful natural products; (iv) Industrial researchers interested in understanding the composition and function of difficult-to-sequence and non-protein-coding genomic regions how this relates to human disease; (v) Forensic scientists that require superior accuracy for defining repeat regions used to identify individuals; and (vi) The wider population who will benefit from improved health, nutrition and justice that would accompany accurate definition of these previously poorly characterised repetitive regions.

Engagement with the wider community will be facilitated by participation in Forum for the Future, a non-profit sustainability network that connects government, business and community groups. Finally, participation in general public engagement events, such as the Cambridge Science Fair and YorNight events will provide opportunities for communication with the general public about the potential contribution of nanopore sequencing technology to improvements in everyday life.

Wider Social and Economic Benefits:
Development of a sequencing technology capable of accurately mapping whole genomes, including the regions that are currently poorly defined, would have broad economic and social impact. The life sciences (including health, pharmaceutical and forensic industries) and agri-food industries would benefit from the sale of new genetic tests and treatments for disease, and from superior crop species, leading to a healthier and well-nourished society.
 
Description We developed a procedure for insertion of polar proteins with an inner pore into lipid membranes, and have applied this procedure for the thermophage G20c portal protein. A key step of this methodology is porphyrin conjugation to a cysteine residue introduced at the outer rim of the portal protein. Following protein insertion into the lipid membrane, we characterised molecular transport through the central pore of the protein.
We have also introduced a procedure for creation of a hybrid nanopore, by inserting the thermophage protein into a solid state membrane. This allowed to combine the robust nature of solid state membrane with the simplicity of engineering the protein pore through introducing mutations within the protein tunnel. We further showed that the hybrid pore allows sensing and discrimination between different types of biomolecules.
Exploitation Route The established procedures can be used for bio-nanotechnological applications, in particular for molecular sensing using the portal protein pore inserted into a lipid membrane or a solid state pore.
Sectors Healthcare,Manufacturing, including Industrial Biotechology,Pharmaceuticals and Medical Biotechnology

 
Title PDB code 5NGD 
Description Atomic model of bacteriophage G20c portal protein V325G mutant and experimental data. 
Type Of Material Computer model/algorithm 
Year Produced 2017 
Provided To Others? Yes  
Impact This mutant form of the portal protein has an internal pore with reduced diameter and modified properties. 
URL http://www.rcsb.org/structure/5NGD
 
Description Bilateral application with Meni Wanunu at Northeastern University as described in the Proposal 
Organisation Northeastern University - Boston
Department Department of Physics
Country United States 
Sector Academic/University 
PI Contribution As described in the grant proposal, we have engineered variants of the viral portal protein with defined traits , determned the structure and physicochemical characteristics of these variant proteins by Xray crystallography, negative stained TEM, guanidium and thermal stability assays. One such variant was engineered for chemical modification via surface cysteines. This has been used successfully for insertion of porphyrin coupled portals into lipid bilayers for electrical sensing studies. Further variants have altered the dimensions and the surface charge of the internal tunnel.
Collaborator Contribution As described in the joint proposal for this award, Meni Wanunu's lab has expertise in the generation of solid state nanopores and their use in electrical sensing experiments. The proteins generated and modified by the Antson group were sent to the Wanunu laboratory for insertion into lipid bilayers and solid state nanopores for electrical sensing experiments.
Impact This is a multi-disciplinary project that combines the molecular and structural biology expertise of the Antson Group with the nanopore and electrical sensing expertise of the Wanunu group has been made possible by this bilateral award. As of March 2018 outputs include one publication (ACS nano) and two conference poster presentations.
Start Year 2015
 
Title Lipid-Free Anchoring of Thermophilic Bacteriophage G20C Portal Adapter Into Solid-State Nanopores 
Description We have developed a stable and water-soluble portal protein channel based on the thermophilic bacteriophage G20C, which we have demonstrated to be a robust, chemically programmable channel that either voltage- or pressure-inserts into a solid-state nanopore matrix to form a hybrid nanopore sensor device. The signal for sensing using this device can be either electrical or optical, the latter offering high-density parallelized readout from multiple adjacent pores. Key elements of the innovation include mechanisms to obtain the hybrid structure, to stabilize it, and to modify it so that different types of biomolecules can be sensed. 
IP Reference 62/673,118 
Protection Patent application published
Year Protection Granted 2018
Licensed No
Impact Further funding for collaborating laboratory (Meni Wanunu at Northeastern University, Boston, USA) through Oxford Nanopore Technology is currently under discussion. Full patent applications in EU, UK and USA to be made in 2019. Licence requests for academic research made by Benjamin Cressiot now at Evry University in Paris.
 
Title Lipid-Free Anchoring of Thermophilic Bacteriophage G20c Portal Adapter into Solid-State Nanopores 
Description Hybrid nanopores, comprising a protein pore supported within a solid-state membrane, which combine the robust nature of solid-state membranes with the easily tunable and precise engineering of protein nanopores. In an embodiment, a lipid-free hybrid nanopore comprises a water soluble and stable, modified portal protein of the Thermus thermophilus bacteriophage G20c, electrokinetically inserted into a larger nanopore in a solid-state membrane. The hybrid pore is stable and easy to fabricate, and exhibits low peripheral leakage, allowing sensing and discrimination among different types of biomolecules. 
IP Reference US2019360998 
Protection Patent application published
Year Protection Granted 2019
Licensed Yes
Impact This IP is in the process of being licensed to Oxford Nanopore Technologies.
 
Description Online engagement with high school students. 
Form Of Engagement Activity Engagement focused website, blog or social media channel
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Schools
Results and Impact Online chats with high school science classes and web based question answer sessions.
Year(s) Of Engagement Activity 2017
URL https://imascientist.org.uk
 
Description York Festival of Ideas - Tour of York Structural Biology Laboratory 
Form Of Engagement Activity Participation in an open day or visit at my research institution
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Public/other audiences
Results and Impact Two 1 hour tours were organised in which three groups of up to seven people were rotate through three different interactive engagement stations. These included the general wetlab tour where a PhD student, Dorothy Hawkins, described the work we carry out in the wet lab and demonstrated an interactive experiment running a DNA agarose gel where participants were able to load a sample. The second station was a crystallisation station run by Sandra Greive and Olga Moroz, where participants where given a tour of the crystallisation facility and participated in preparing hanging drop crystallisation experiments using lysozyme. Participants then viewed their crystals using light microscopy. The final part of the tour was conducted by Professor Fred Antson - providing the participants with a behind the scenes tour of the X-ray labs and the opportunity to view structural models using 3D glasses. A total of 30 places were available with 21 places booked and around 15 people attending.
Year(s) Of Engagement Activity 2018