WormBase: expanding the reference resource for helminth research
Lead Research Organisation:
Wellcome Sanger Institute
Department Name: Wellcome Trust Genome Campus
Abstract
Many developments in modern medicine are based on the enormous progress made over the last 50 years in our understanding of how our genes work. Many diseases are caused by genes malfunctioning in some way, and understanding the function of genes allows to design diagnostic tools and treatments against specific malfunctions. Much of the research that has led to this progress is difficult to do in humans. Because all animals are related by evolution, study of simple animals helps to understand human biology. Certain key model organisms have been the focus of intensive study. One of these is the tiny roundworm, or nematode, Caenorhabditis elegans. C. elegans research has been supported by the MRC for over 50 years, leading to two Nobel prizes, and is a key contributor to our understanding of basic processes such as cell death and ageing. In addition to its relevance for understanding human biology, research on C. elegans is particularly important to our understanding of the biology of parasitic worms. Globally, more than one billion people are infected with parasitic nematodes, including filarial nematodes, such as Onchocerca that causes river blindness, and Brugia that causes elephantiasis; intestinal nematodes such as hookworms, whipworms and giant roundworms. Typically infections are associated with and promote poverty. Genetic research has been revolutionized by obtaining the complete DNA sequence (the genome) of the organism being studied, which contains all the gene sequences. C. elegans was the first animal to have its genome sequenced and now major programmes are collecting genome sequences for parasitic species. To use genome sequences and all the new information about genes requires information resources that tie together DNA information with experimental data, and relate corresponding genes across animals. This application will support the development of WormBase, the reference information resource for all genomic and biological data for C. elegans that is essential for fundamental research using this model organism. The application will also support the expansion of WormBase to include the curation of genomic and experimental information for increasing numbers of parasitic nematodes, and an expansion in scope to include Schistosoma mansoni, a parasitic flatworm. Research into both types of parasitic worms (flatworms and nematodes) has much in common despite their evolutionary distance and WormBase will now support fundamental research in both these areas. By relating parasitology and basic science research, WormBase is uniquely placed to enable community-wide exploitation of data emerging for parasites by applying information, technological infrastructure, and curatorial expertise developed for C. elegans.
Technical Summary
We will integrate, curate, analyse and disseminate (through the community website wormbase.org) molecular and other research data from C. elegans - a widely used model organism - and eight important species of parasitic worms. Our approach is to import (especially sequence-based) data from relevant public archives, analyse it automatically using state-of-the-art tools (particularly for the structural and functional annotation of reference genome sequences), to revise these data by manual review and to supplement them by abstraction of functional information and experimental metadata from the scientific literature into ontological vocabularies that can be used for semantic querying. We will support this process using a mixture of community-standard and in-house bioinformatics tools and technologies, including the WormBase database infrastructure (which is in the process of migrating to a new backend using the Datomic database); the Apollo annotation tool; InterProScan for annotation of protein domains; the Gene Ontology and other, worm-specific ontologies; in-house software for literature-identification and triage; state-of-the-art tools (e.g. Kallisto) for RNA alignment. We will use common formats (e.g. FASTA, GFF, BAM/CRAM, VCF, track and assembly hubs) to collect, share and disseminate data according to type. We will scan the molecular archives to automatically identify important new data sets, and additionally keep in regular communication with members of the research community to ensure that no recent or forthcoming data is overlooked. Where requested, we will help community members prepare new data for submission to the public archives.
We will coordinate these efforts with other projects of the UK WormBase team, and with our international partners on the WormBase project, managing a series of regular data releases in an efficient fashion and making a significant contribution from the UK to the global development of the WormBase resource.
We will coordinate these efforts with other projects of the UK WormBase team, and with our international partners on the WormBase project, managing a series of regular data releases in an efficient fashion and making a significant contribution from the UK to the global development of the WormBase resource.
Planned Impact
The potential beneficiaries of this research are, in the short term, academic researchers interested in helminth-mediated disease in humans (and farm animals), and more generally researchers interested in human biology using C. elegans as a model. Basic molecular insights in C. elegans have driven major fundamental advances in modern medicine, illustrated by 2 Nobel Prizes: the 2002 prize to Brenner, Sulston and Horvitz for identifying genes that regulate organ development and apoptosis in C. elegans, along with the equivalent genes in man; and the 2006 prize to Fire & Mello for the discovery of RNA interference.
Parasitic worms are studied with the aim of killing or controlling them. WormBase will significantly facilitate the direct exploitation and application of large-scale data from sequence based experiments towards this aim, through the provision of accessible, accurate, well-described reference data. Furthermore, the proposed resource will leverage the in-depth and highly organised knowledge-base of C. elegans into the poorer-resourced areas of parasitic nematode research, and from the reference data produced in this project to even less-well studied diseases.
In the longer term, potential beneficiaries include drug companies, farmers, and most strikingly the millions of individuals whose lives are currently blighted and or curtailed but which could be remedied by appropriate programs of treatment or remediation. The application of genomic science towards medical improvements is still in its infancy, but for pathogens with smaller and simpler genomes such as viruses and bacteria, genomic insights are already being translated into tangible benefits e.g. tracking pathogen transmission and the monitoring of drug resistance. For parasitic nematodes, genomic research is expected to deliver significant benefits in the medium term, for which ready access to the data is a clear requirement. The primary beneficiaries will be the people directly suffering from nematode infections, which includes about 2 billion people infected with soil-transmitted helminths alone (WHO 2012). Advances in drug treatment, transmission reduction or vaccination could improve the lives of many people who may otherwise suffer from serious gastrointestinal disease, stunted growth and mental development, malnutrition and fatigue, disfigurement, blindness, or liver and bladder pathologies. Although some effective anthelmintics exist, the available arsenal of drugs is limited and the spread of drug resistance a real danger. Furthermore, large-scale improvements in the treatment and control of helminthiases are likely to bring huge socio-economic benefits to some of the least developed countries.
In addition to the direct health improvements from a reduction in helminth infections, people in endemic areas could also benefit indirectly e.g. by an improved response to vaccinations, by reduced transmission or by an improved disease outcome for other diseases such as tuberculosis, malaria, and HIV/AIDS, as co-infections with helminths have been shown to have potentially adverse effects (see Elliott and Yazdanbakhsh, 2012, and other articles in the same issue of this journal for recent reviews).
Looking further into the future, a thorough understanding of nematodes and their interactions with the human immune system may lead to fundamental new insights that will allow a much more sophisticated manipulation of the human immune system for medical purposes. Such knowledge may ultimately be exploited for and benefit the effective treatment of allergies and other (autoimmune) diseases.
Secondary beneficiaries include UK school children (through their participation in the IRIS project, which we will support), and the UK science community more generally, which benefits from the involvement of young people in actual scientific work, which is likely to kindle interest in scientific careers.
Parasitic worms are studied with the aim of killing or controlling them. WormBase will significantly facilitate the direct exploitation and application of large-scale data from sequence based experiments towards this aim, through the provision of accessible, accurate, well-described reference data. Furthermore, the proposed resource will leverage the in-depth and highly organised knowledge-base of C. elegans into the poorer-resourced areas of parasitic nematode research, and from the reference data produced in this project to even less-well studied diseases.
In the longer term, potential beneficiaries include drug companies, farmers, and most strikingly the millions of individuals whose lives are currently blighted and or curtailed but which could be remedied by appropriate programs of treatment or remediation. The application of genomic science towards medical improvements is still in its infancy, but for pathogens with smaller and simpler genomes such as viruses and bacteria, genomic insights are already being translated into tangible benefits e.g. tracking pathogen transmission and the monitoring of drug resistance. For parasitic nematodes, genomic research is expected to deliver significant benefits in the medium term, for which ready access to the data is a clear requirement. The primary beneficiaries will be the people directly suffering from nematode infections, which includes about 2 billion people infected with soil-transmitted helminths alone (WHO 2012). Advances in drug treatment, transmission reduction or vaccination could improve the lives of many people who may otherwise suffer from serious gastrointestinal disease, stunted growth and mental development, malnutrition and fatigue, disfigurement, blindness, or liver and bladder pathologies. Although some effective anthelmintics exist, the available arsenal of drugs is limited and the spread of drug resistance a real danger. Furthermore, large-scale improvements in the treatment and control of helminthiases are likely to bring huge socio-economic benefits to some of the least developed countries.
In addition to the direct health improvements from a reduction in helminth infections, people in endemic areas could also benefit indirectly e.g. by an improved response to vaccinations, by reduced transmission or by an improved disease outcome for other diseases such as tuberculosis, malaria, and HIV/AIDS, as co-infections with helminths have been shown to have potentially adverse effects (see Elliott and Yazdanbakhsh, 2012, and other articles in the same issue of this journal for recent reviews).
Looking further into the future, a thorough understanding of nematodes and their interactions with the human immune system may lead to fundamental new insights that will allow a much more sophisticated manipulation of the human immune system for medical purposes. Such knowledge may ultimately be exploited for and benefit the effective treatment of allergies and other (autoimmune) diseases.
Secondary beneficiaries include UK school children (through their participation in the IRIS project, which we will support), and the UK science community more generally, which benefits from the involvement of young people in actual scientific work, which is likely to kindle interest in scientific careers.
Publications
International Helminth Genomes Consortium
(2019)
Comparative genomics of the major parasitic worms.
in Nature genetics
Harrison P
(2024)
Ensembl 2024
in Nucleic Acids Research
Howe KL
(2020)
Ensembl Genomes 2020-enabling non-vertebrate genomic research.
in Nucleic acids research
Yates AD
(2022)
Ensembl Genomes 2022: an expanding genome resource for non-vertebrates.
in Nucleic acids research
Wood V
(2020)
Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns.
in Open biology
Gene Ontology Consortium
(2021)
The Gene Ontology resource: enriching a GOld mine.
in Nucleic acids research
Grove C
(2018)
Using WormBase: A Genome Biology Resource for Caenorhabditis elegans and Related Nematodes.
in Methods in molecular biology (Clifton, N.J.)
Davis P
(2022)
WormBase in 2022-data, processes, and tools for analyzing Caenorhabditis elegans.
in Genetics
Harris TW
(2020)
WormBase: a modern Model Organism Information Resource.
in Nucleic acids research
Description | Bioinformatics and Biological Resources Fund |
Amount | £500,000 (GBP) |
Funding ID | BB/P024610/1 |
Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
Sector | Public |
Country | United Kingdom |
Start | 06/2017 |
End | 01/2021 |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S1 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S1 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S2 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S2 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S3 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S3 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S4 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S4 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S5 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S5 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S6 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | Supplementary material from "Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns" |
Description | Table S6 |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | https://rs.figshare.com/articles/dataset/Supplementary_material_from_Term_Matrix_a_novel_Gene_Ontolo... |
Title | WormBase ParaSite Release 17 |
Description | An update to the open access database WormBase ParaSite. Contain the following: Integration of AlphaFold 3D protein structures for 8 species. Addition of 11 new genome assemblies of which 6 are new species. Annotation updates for 2 genomes. Gene-phenotype associations |
Type Of Material | Database/Collection of data |
Year Produced | 2022 |
Provided To Others? | Yes |
Impact | Usage stats not yet analysed |
URL | https://parasite.wormbase.org |
Title | WormBase- ongoing releases |
Description | WormBase is primarily a model organism resource which provides deeply curated genomic and functional genomic data to enable C elegans research. Six WormBase data releases are produced per year, reflecting ongoing curation and data integration. |
Type Of Material | Database/Collection of data |
Provided To Others? | Yes |
Impact | Four helminth species (Brugia malayi, Onchocerca volvulus, Trichuris muris and Strongyloides ratti) are now also integrated into WormBase. These genomes can benefit from the deeper level of curation that the WormBase infrastructure supports, including gene model curation (with full history tracking) and literature curation. |
URL | https://wormbase.org/ |
Description | WormBase consortium |
Organisation | WormBase (Biology and Genome of C.Elegans) |
Country | United States |
Sector | Charity/Non Profit |
PI Contribution | WormBase Consortium is led by Paul Sternberg of CalTech, Kevin Howe of the EBI, Matt Berriman of the Wellcome Sanger Institute, and Lincoln Stein of the Ontario Institute for Cancer Research. The consortium runs a model organism database containing data from research on C. elegans and other nematodes. WormBase Parasite provides searching and data access capabilities that are not available through the WormBase website |
Collaborator Contribution | WormBase curates reference genomes which are then imported into WormBase Parasite and provide important functional information for understanding the genomes of comparator species. |
Impact | Provision of annotated genomes for C. elegans and Brugia malayi |
Start Year | 2014 |
Description | Advanced Course in Helminth Bioinformatics |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Three members of the WormBase ParaSite team acted as instructors in a 5 day workshop on Helminth Bioinformatics, hosted by the West African Centre for Cell Biology of Infectious Pathogens, Accra, Ghana. The workshop had approximately 20 participants, all postgraduate or post-doctoral researchers based in institutions across Africa. Material covered included use of the WormBase ParaSite database, an introduction to the Linux command line, transcriptomics, genome assembly, and population genetics. The main goal was to increase the genomics/bioinformatics capacity of helminth researchers in Africa. The course received excellent formal feedback, with WormBase Parasite being voted one of the most useful sections by many participants. |
Year(s) Of Engagement Activity | 2019 |
URL | https://coursesandconferences.wellcomegenomecampus.org/our-events/helminth-bioinformatics-ghana-2019... |
Description | BSP drop in helpdesk |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Postgraduate students |
Results and Impact | We ran a drop in desk at the annual British Society of Parasitology Spring meeting. We answered questions and demonstrated features from the user community. |
Year(s) Of Engagement Activity | 2019 |
Description | BioMart webinar |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | A webinar (comprising presentation and question and answer session) aimed at worm researchers, covering data mining strategies in BioMart. The webinar remains available on the WormBase YouTube channel. |
Year(s) Of Engagement Activity | 2021 |
URL | https://www.youtube.com/watch?v=IM2j7-OPmtQ |
Description | Helminth Bioinformatics (Asia) (Virtual) |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Three member of the team participated as trainers trainers in a bioinformatics course that targeted helminth researchers in Asia. The course curriculum included extensive use of WormBase Parasite. |
Year(s) Of Engagement Activity | 2021 |
URL | https://coursesandconferences.wellcomeconnectingscience.org/event/helminth-bioinformatics-asia-virtu... |
Description | Helminth genomics workshop, Shanghai |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | A WormBase ParaSite staff member acted as lead instructor on a 2 day course in Helminth Bioinformatics in Shanghai, China. The course had approximately 40 participants, largely postgraduate students, from across China. Material covered included basic concepts in bioinformatics and use of the WormBase ParaSite database. Formal feedback was very positive. |
Year(s) Of Engagement Activity | 2019 |
Description | KS2 seminar/workshop at local primary school |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | Local |
Primary Audience | Schools |
Results and Impact | Gave an assembly to ~120 pupils from Key stage 2 at local primary school. Included live microscopy and a science talk. |
Year(s) Of Engagement Activity | 2019 |
Description | Poster presentation at the Parasitic Helminths: New Perspectives in Biology and Infection meeting |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | A WormBase ParaSite staff member presented a poster on the WormBase ParaSite resource at the Parasitic Helminths: New Perspectives in Biology and Infection conference (Hydra, Greece). Users were informed of new features and given the chance to ask questions on use of the resource. |
Year(s) Of Engagement Activity | 2019 |
Description | Short talk in "Bridging the Divide" workshop, International C elegans meeting, UCLA |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | A WormBase ParaSite staff member gave a short talk on the resource in a parasitology session at the International C elegans Meeting. The audience included both parasitologists and C elegans researchers. |
Year(s) Of Engagement Activity | 2019 |
Description | Training workshop at Schistosomiasis conference in Brazil |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Postgraduate students |
Results and Impact | A training workshop was held at the Schistosomiasis symposium in Brazil. The workshop introduced participants to WormBase ParaSite and showed them how to perform queries to extract research data |
Year(s) Of Engagement Activity | 2022 |
Description | Virtual workshop at British Society for Parasitology online confernce |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Operated a virtual booth at a three-day conference, which included a 2-hr open workshop session on the last day |
Year(s) Of Engagement Activity | 2021 |
URL | https://bsp.uk.net/2020/07/24/parasites-online-2021/ |
Description | WormBase ParaSite webinar |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Two WormBase ParaSite staff members ran a webinar on the use of the resource, aimed at all helminth researchers. The webinar received positive feedback, and remains available online as training material. |
Year(s) Of Engagement Activity | 2019 |
URL | https://www.ebi.ac.uk/training/online/course/introduction-wormbase-parasite-resources |