WormBase: expanding the reference resource for helminth research

Lead Research Organisation: Wellcome Sanger Institute
Department Name: Wellcome Trust Genome Campus

Abstract

Many developments in modern medicine are based on the enormous progress made over the last 50 years in our understanding of how our genes work. Many diseases are caused by genes malfunctioning in some way, and understanding the function of genes allows to design diagnostic tools and treatments against specific malfunctions. Much of the research that has led to this progress is difficult to do in humans. Because all animals are related by evolution, study of simple animals helps to understand human biology. Certain key model organisms have been the focus of intensive study. One of these is the tiny roundworm, or nematode, Caenorhabditis elegans. C. elegans research has been supported by the MRC for over 50 years, leading to two Nobel prizes, and is a key contributor to our understanding of basic processes such as cell death and ageing. In addition to its relevance for understanding human biology, research on C. elegans is particularly important to our understanding of the biology of parasitic worms. Globally, more than one billion people are infected with parasitic nematodes, including filarial nematodes, such as Onchocerca that causes river blindness, and Brugia that causes elephantiasis; intestinal nematodes such as hookworms, whipworms and giant roundworms. Typically infections are associated with and promote poverty. Genetic research has been revolutionized by obtaining the complete DNA sequence (the genome) of the organism being studied, which contains all the gene sequences. C. elegans was the first animal to have its genome sequenced and now major programmes are collecting genome sequences for parasitic species. To use genome sequences and all the new information about genes requires information resources that tie together DNA information with experimental data, and relate corresponding genes across animals. This application will support the development of WormBase, the reference information resource for all genomic and biological data for C. elegans that is essential for fundamental research using this model organism. The application will also support the expansion of WormBase to include the curation of genomic and experimental information for increasing numbers of parasitic nematodes, and an expansion in scope to include Schistosoma mansoni, a parasitic flatworm. Research into both types of parasitic worms (flatworms and nematodes) has much in common despite their evolutionary distance and WormBase will now support fundamental research in both these areas. By relating parasitology and basic science research, WormBase is uniquely placed to enable community-wide exploitation of data emerging for parasites by applying information, technological infrastructure, and curatorial expertise developed for C. elegans.

Technical Summary

We will integrate, curate, analyse and disseminate (through the community website wormbase.org) molecular and other research data from C. elegans - a widely used model organism - and eight important species of parasitic worms. Our approach is to import (especially sequence-based) data from relevant public archives, analyse it automatically using state-of-the-art tools (particularly for the structural and functional annotation of reference genome sequences), to revise these data by manual review and to supplement them by abstraction of functional information and experimental metadata from the scientific literature into ontological vocabularies that can be used for semantic querying. We will support this process using a mixture of community-standard and in-house bioinformatics tools and technologies, including the WormBase database infrastructure (which is in the process of migrating to a new backend using the Datomic database); the Apollo annotation tool; InterProScan for annotation of protein domains; the Gene Ontology and other, worm-specific ontologies; in-house software for literature-identification and triage; state-of-the-art tools (e.g. Kallisto) for RNA alignment. We will use common formats (e.g. FASTA, GFF, BAM/CRAM, VCF, track and assembly hubs) to collect, share and disseminate data according to type. We will scan the molecular archives to automatically identify important new data sets, and additionally keep in regular communication with members of the research community to ensure that no recent or forthcoming data is overlooked. Where requested, we will help community members prepare new data for submission to the public archives.

We will coordinate these efforts with other projects of the UK WormBase team, and with our international partners on the WormBase project, managing a series of regular data releases in an efficient fashion and making a significant contribution from the UK to the global development of the WormBase resource.

Planned Impact

The potential beneficiaries of this research are, in the short term, academic researchers interested in helminth-mediated disease in humans (and farm animals), and more generally researchers interested in human biology using C. elegans as a model. Basic molecular insights in C. elegans have driven major fundamental advances in modern medicine, illustrated by 2 Nobel Prizes: the 2002 prize to Brenner, Sulston and Horvitz for identifying genes that regulate organ development and apoptosis in C. elegans, along with the equivalent genes in man; and the 2006 prize to Fire & Mello for the discovery of RNA interference.

Parasitic worms are studied with the aim of killing or controlling them. WormBase will significantly facilitate the direct exploitation and application of large-scale data from sequence based experiments towards this aim, through the provision of accessible, accurate, well-described reference data. Furthermore, the proposed resource will leverage the in-depth and highly organised knowledge-base of C. elegans into the poorer-resourced areas of parasitic nematode research, and from the reference data produced in this project to even less-well studied diseases.

In the longer term, potential beneficiaries include drug companies, farmers, and most strikingly the millions of individuals whose lives are currently blighted and or curtailed but which could be remedied by appropriate programs of treatment or remediation. The application of genomic science towards medical improvements is still in its infancy, but for pathogens with smaller and simpler genomes such as viruses and bacteria, genomic insights are already being translated into tangible benefits e.g. tracking pathogen transmission and the monitoring of drug resistance. For parasitic nematodes, genomic research is expected to deliver significant benefits in the medium term, for which ready access to the data is a clear requirement. The primary beneficiaries will be the people directly suffering from nematode infections, which includes about 2 billion people infected with soil-transmitted helminths alone (WHO 2012). Advances in drug treatment, transmission reduction or vaccination could improve the lives of many people who may otherwise suffer from serious gastrointestinal disease, stunted growth and mental development, malnutrition and fatigue, disfigurement, blindness, or liver and bladder pathologies. Although some effective anthelmintics exist, the available arsenal of drugs is limited and the spread of drug resistance a real danger. Furthermore, large-scale improvements in the treatment and control of helminthiases are likely to bring huge socio-economic benefits to some of the least developed countries.

In addition to the direct health improvements from a reduction in helminth infections, people in endemic areas could also benefit indirectly e.g. by an improved response to vaccinations, by reduced transmission or by an improved disease outcome for other diseases such as tuberculosis, malaria, and HIV/AIDS, as co-infections with helminths have been shown to have potentially adverse effects (see Elliott and Yazdanbakhsh, 2012, and other articles in the same issue of this journal for recent reviews).

Looking further into the future, a thorough understanding of nematodes and their interactions with the human immune system may lead to fundamental new insights that will allow a much more sophisticated manipulation of the human immune system for medical purposes. Such knowledge may ultimately be exploited for and benefit the effective treatment of allergies and other (autoimmune) diseases.

Secondary beneficiaries include UK school children (through their participation in the IRIS project, which we will support), and the UK science community more generally, which benefits from the involvement of young people in actual scientific work, which is likely to kindle interest in scientific careers.

Publications

10 25 50

publication icon
International Helminth Genomes Consortium (2019) Comparative genomics of the major parasitic worms. in Nature genetics

publication icon
Howe KL (2020) Ensembl Genomes 2020-enabling non-vertebrate genomic research. in Nucleic acids research

publication icon
Harrison PW (2023) Ensembl 2024. in Nucleic acids research

publication icon
Harris TW (2020) WormBase: a modern Model Organism Information Resource. in Nucleic acids research

publication icon
Grove C (2018) Using WormBase: A Genome Biology Resource for Caenorhabditis elegans and Related Nematodes. in Methods in molecular biology (Clifton, N.J.)

publication icon
Gene Ontology Consortium (2021) The Gene Ontology resource: enriching a GOld mine. in Nucleic acids research

 
Description Bioinformatics and Biological Resources Fund
Amount £500,000 (GBP)
Funding ID BB/P024610/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 07/2017 
End 01/2021
 
Title WormBase ParaSite Release 17 
Description An update to the open access database WormBase ParaSite. Contain the following: Integration of AlphaFold 3D protein structures for 8 species. Addition of 11 new genome assemblies of which 6 are new species. Annotation updates for 2 genomes. Gene-phenotype associations 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact Usage stats not yet analysed 
URL https://parasite.wormbase.org
 
Title WormBase- ongoing releases 
Description WormBase is primarily a model organism resource which provides deeply curated genomic and functional genomic data to enable C elegans research. Six WormBase data releases are produced per year, reflecting ongoing curation and data integration. 
Type Of Material Database/Collection of data 
Provided To Others? Yes  
Impact Four helminth species (Brugia malayi, Onchocerca volvulus, Trichuris muris and Strongyloides ratti) are now also integrated into WormBase. These genomes can benefit from the deeper level of curation that the WormBase infrastructure supports, including gene model curation (with full history tracking) and literature curation. 
URL https://wormbase.org/
 
Description WormBase consortium 
Organisation WormBase (Biology and Genome of C.Elegans)
Country United States 
Sector Charity/Non Profit 
PI Contribution WormBase Consortium is led by Paul Sternberg of CalTech, Kevin Howe of the EBI, Matt Berriman of the Wellcome Sanger Institute, and Lincoln Stein of the Ontario Institute for Cancer Research. The consortium runs a model organism database containing data from research on C. elegans and other nematodes. WormBase Parasite provides searching and data access capabilities that are not available through the WormBase website
Collaborator Contribution WormBase curates reference genomes which are then imported into WormBase Parasite and provide important functional information for understanding the genomes of comparator species.
Impact Provision of annotated genomes for C. elegans and Brugia malayi
Start Year 2014
 
Description Advanced Course in Helminth Bioinformatics 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Three members of the WormBase ParaSite team acted as instructors in a 5 day workshop on Helminth Bioinformatics, hosted by the West African Centre for Cell Biology of Infectious Pathogens, Accra, Ghana. The workshop had approximately 20 participants, all postgraduate or post-doctoral researchers based in institutions across Africa. Material covered included use of the WormBase ParaSite database, an introduction to the Linux command line, transcriptomics, genome assembly, and population genetics. The main goal was to increase the genomics/bioinformatics capacity of helminth researchers in Africa. The course received excellent formal feedback, with WormBase Parasite being voted one of the most useful sections by many participants.
Year(s) Of Engagement Activity 2019
URL https://coursesandconferences.wellcomegenomecampus.org/our-events/helminth-bioinformatics-ghana-2019...
 
Description BSP drop in helpdesk 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Postgraduate students
Results and Impact We ran a drop in desk at the annual British Society of Parasitology Spring meeting. We answered questions and demonstrated features from the user community.
Year(s) Of Engagement Activity 2019
 
Description BioMart webinar 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact A webinar (comprising presentation and question and answer session) aimed at worm researchers, covering data mining strategies in BioMart. The webinar remains available on the WormBase YouTube channel.
Year(s) Of Engagement Activity 2021
URL https://www.youtube.com/watch?v=IM2j7-OPmtQ
 
Description Helminth Bioinformatics (Asia) (Virtual) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Three member of the team participated as trainers trainers in a bioinformatics course that targeted helminth researchers in Asia. The course curriculum included extensive use of WormBase Parasite.
Year(s) Of Engagement Activity 2021
URL https://coursesandconferences.wellcomeconnectingscience.org/event/helminth-bioinformatics-asia-virtu...
 
Description Helminth genomics workshop, Shanghai 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact A WormBase ParaSite staff member acted as lead instructor on a 2 day course in Helminth Bioinformatics in Shanghai, China. The course had approximately 40 participants, largely postgraduate students, from across China. Material covered included basic concepts in bioinformatics and use of the WormBase ParaSite database. Formal feedback was very positive.
Year(s) Of Engagement Activity 2019
 
Description KS2 seminar/workshop at local primary school 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact Gave an assembly to ~120 pupils from Key stage 2 at local primary school. Included live microscopy and a science talk.
Year(s) Of Engagement Activity 2019
 
Description Poster presentation at the Parasitic Helminths: New Perspectives in Biology and Infection meeting 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A WormBase ParaSite staff member presented a poster on the WormBase ParaSite resource at the Parasitic Helminths: New Perspectives in Biology and Infection conference (Hydra, Greece). Users were informed of new features and given the chance to ask questions on use of the resource.
Year(s) Of Engagement Activity 2019
 
Description Short talk in "Bridging the Divide" workshop, International C elegans meeting, UCLA 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A WormBase ParaSite staff member gave a short talk on the resource in a parasitology session at the International C elegans Meeting. The audience included both parasitologists and C elegans researchers.
Year(s) Of Engagement Activity 2019
 
Description Training workshop at Schistosomiasis conference in Brazil 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Postgraduate students
Results and Impact A training workshop was held at the Schistosomiasis symposium in Brazil. The workshop introduced participants to WormBase ParaSite and showed them how to perform queries to extract research data
Year(s) Of Engagement Activity 2022
 
Description Virtual workshop at British Society for Parasitology online confernce 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Operated a virtual booth at a three-day conference, which included a 2-hr open workshop session on the last day
Year(s) Of Engagement Activity 2021
URL https://bsp.uk.net/2020/07/24/parasites-online-2021/
 
Description WormBase ParaSite webinar 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Two WormBase ParaSite staff members ran a webinar on the use of the resource, aimed at all helminth researchers. The webinar received positive feedback, and remains available online as training material.
Year(s) Of Engagement Activity 2019
URL https://www.ebi.ac.uk/training/online/course/introduction-wormbase-parasite-resources