Building long-read high-fidelity sequencing resources to support Bioscience Research in the UK

Lead Research Organisation: University of Edinburgh
Department Name: Sch of Biological Sciences

Abstract

The genomics revolution is entering a new phase, with new sequencing technologies that can read long DNA molecules with unprecedented accuracy and low costs. These high fidelity long DNA sequence reads can be used to generate high-quality gapless reference genomes, to fully characterise transcriptomes, to describe methylomes, and to characterize microorganisms in complex microbial mixtures such as in the gut microbiome or in environmental samples of soil or water. The PacBio Revio System is the latest long-read DNA sequencing platform that can generate extremely large amounts of long read data at an affordable cost. A key feature of the Revio is that it can generate highly accurate long reads, by reading a DNA molecule multiple times to correct errors. This grant will acquire the Revio platform, the first in Scotland, embed this within Edinburgh Genomics, and allow us to offer low-cost, high quality long read sequencing in collaboration with researchers in the biological and biotechnology research community. We will use the investment to support researchers who want to generate genomic data for diverse topics such as agriculture, biotechnology, genetics, immunology and synthetic biology. We will also offer advanced genomics data training to support the uptake of the Revio within the wider user community.

Technical Summary

The Revio system represents the evolution of PacBio single molecule sequencing technology, which has much improved single-molecule real-time (SMRT) cell capacity, shorter sequencing times and higher read quality than its predecessor the Sequel IIe. This platform will allow Edinburgh Genomics to generate larger numbers of very high accuracy reads of long DNA and RNA molecules. Long-read technology has benefits over short-read technologies (e.g. Illumina) where assembly or mapping often result in ambiguities or errors. High complexity regions of genomes (DNA) and transcriptomes (RNA) are best studied with longer reads. When assembling genomes at the chromosome level, long reads help to resolve issues like repetitive regions (e.g. centromeres) and large structural variants. The advantages of the Revio are its greatly increased throughput, due to having 25 million Zero Mode Wave guide (ZMW) per SMRT cell (vs 8 million in the Sequel II), and its much improved read quality due to the implementation of deep learning algorithms. The increased throughput and cheaper reagents results in a much lower cost per base (3-4 fold less than the Sequel II). This means that research is more cost effective, allowing greater sequencing depth or a larger number of samples. For example, a single SMRT cell will yield the equivalent of a 30X coverage for a genome of 3 Gbases (e.g. cattle genome) which can produce a full de novo genome assembly. In addition, the high fidelity (or 'HiFi') reads that are generated by PacBio, have an unprecedented level of quality on the Revio (>90% of reads above Q30), thanks to the implementation of deep learning algorithms (i.e. DeepConsensus), quality comparable to the read qualities only known by short-read sequencers. This improved accuracy of the Revio will allow the detection of single nucleotide variants, resolve fully phased genome and transcriptome assemblies and will allow hard to sequence regions to be accurately sequenced at scale.

Publications

10 25 50
 
Description The PacBio Revio system was installed at Edinburgh Genomics, the University of Edinburgh, by the end of August 2023. A week after installation, PacBio's Field Applications Scientists trained all team members on performing library preparations, running the instrument, and SMRT Link data analysis. Throughout September 2023, the laboratory and bioinformatics teams validated library preparation workflows and data generation procedures, respectively.

The instrument has been fully operational since October 2023. So far, it has sequenced whole genomes for 32 samples of animals (cattle, chicken, wild pig), insects (mealybugs), and plants (cape primrose), allowing the generation of high-quality genomes at chromosome-level resolution. Currently, we have 42 samples (animals, plants, insects, and yeast) queued for sequencing on the Revio.

Our Revio system was upgraded to the latest software version (SMRT Link V13) in January 2024. This software upgrade expands its capabilities to sequence smaller DNA fragments (up to 1 Kbp) and can produce increased yield with the new Kinnex chemistry for full 16S rRNA gene sequencing, single-cell full-length RNA sequencing, and Iso-Seq RNA sequencing. Since this last upgrade, we have observed an increased yield and higher quality on the sequencing runs, with many SMRT cells generating more than 100 Gbases of HiFi data (PacBio's expected yield is 90 Gbases), with more than 90% of bases above Q30.

This new sequencer has considerably increased Edinburgh Genomics' capabilities to support bioscience research groups that require high-fidelity de novo genomes at a chromosome level and full-length characterization of genes and transcripts.
Exploitation Route The PacBio Revio's long-read high-fidelity sequencing technology offers exciting possibilities in various fields, and some expected outcomes taking forward the usage of this technology are:

- Improved livestock breeding: Dr. Emily Clark will use the technology to create a comprehensive library of full-length messenger RNA for various farm animals. This information can lead to improvements in health and breeding.

- Understanding genetic traits in cattle: Prof. James Prendergast will generate long-read sequence data to understand the genetic factors behind economically important traits in cattle, such as meat quality, milk production, or disease resistance. This knowledge can be used to improve livestock performance.

- Comprehensive reference genomes: Dr. Alex Twyford will use this technology to create comprehensive reference genomes for a wide range of organisms across the Tree of Life. This research will be valuable for biodiversity conservation, evolutionary biology, and understanding the genetic basis of traits across different species.

- Enhanced disease resistance in poultry: Prof. Kaufman's research will use this high-fidelity long-read technology to investigate the genetic variations of the major histocompatibility complex (MHC) in chickens. This research can help develop strategies to enhance immune responses and combat disease more effectively in poultry, and potentially other species.
Sectors Agriculture

Food and Drink

Healthcare

Pharmaceuticals and Medical Biotechnology

 
Description PacBio Revio Launch Event 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Professional Practitioners
Results and Impact This was an in-person, free event organised in Edinburgh to mark the launch of the new PacBio Revio sequencer. The event featured presentations from six scientists across different research institutes in Edinburgh who have used Edinburgh Genomics' PacBio sequencing services. The keynote speaker was the Head of Production Genomics at the Darwin Tree of Life - Wellcome Sanger Institute, Hinxton, Cambridge. These talks showcased how PacBio HiFi technology is used across applications such as genome sequencing for plants and animals, genome architecture, non-Mendelian genetics, giant genome evolution, and high-quality reference genome assemblies at scale.
The event had 118 registered attendees (PIs, scientists, postdocs, and doctoral students), mainly from Edinburgh, but also from Glasgow and Dundee. It aimed to raise awareness of the new PacBio long-read technology capabilities, showcase the services provided by Edinburgh Genomics' Revio system, facilitate networking between scientists, and gather interest and feedback on the new technology.
The event included Q&A sections, and there were also networking opportunities with speakers and sequencing experts. The event sparked discussions about the use of this technology in other applications, and we received a larger number of inquiries interested in the technology afterwards.
Year(s) Of Engagement Activity 2023
URL https://programs.pacb.com/l/1652/2023-09-14/445j6k