Making connections with GO: an integrative approach to highlighting medically relevant Drosophila data

Lead Research Organisation: University of Cambridge
Department Name: Physiology Development and Neuroscience

Abstract

At first glance fruit flies and humans seem very different. But if we look closer we see that they share certain features: limbs, a brain, a heart and much more besides. If we look even more closely we see that this is because we share genes in our DNA. We can use these similarities to study human disease because three quarters of genes associated with human disease are found in fruit flies too. Disrupting genes in fruit flies can tell us how they normally work in humans, how they cause diseases when they are not working well and give us clues about how to we might be able to fix problems caused by the faulty genes. Fruit flies have many advantages for studying disease. For example, we can do many different experiments with flies very quickly, and they get old within weeks, so they are really useful for studying neurodegenerative conditions such as Alzheimer's disease.
Extensive research is carried out using fruit flies by scientists around the world, generating a huge amount of data. It is important that scientists are able to easily and quickly access these data so that they can build on existing knowledge to devise new experiments in their own research. This is where biological databases come in: they have dedicated teams of people (curators) who read the published research papers and enter the information into a computer database. Curators do this by linking scientific terms, taken from special dictionaries, to biological objects such as genes. The aim of our project is to use one such scientific dictionary, called the Gene Ontology, to attach labels to genes that describe what they do and where they do it. We will focus on those fruit fly genes that are equivalent to human genes that are implicated in diseases. This work will be done in the context of gene networks - knowing how genes function together gives a better understanding of how they contribute to our day to day life, and how their disruption leads to disease. Altogether, this project will facilitate the transfer of knowledge gained in fruit flies to the medical community, ultimately helping the development of effective treatments of human diseases.

Technical Summary

The aim of this research proposal is to enhance the utility and accessibility of data from Drosophila to the biomedical research community. Our approach will center on the Gene Ontology (GO), a unifying vocabulary for describing the attributes of gene products across species and databases. We will focus our efforts on improving the annotation of pathways and protein complexes relevant to human disease. In parallel, we shall undertake a comprehensive annotation of non-coding RNAs - an important emerging area in human health. Whilst achieving these aims, we will work with the GO consortium to develop the ontology itself and adopt their enhanced annotation framework. Furthermore, in collaboration with FlyBase, we shall present pathways and complex components as easily accessible lists that are highly integrated with all other data in FlyBase. Subsequently, we will enhance these pages by adding graphical models of pathways that will highlight intersections with human disease gene orthologs and those genes studied in Drosophila disease models. Overall, our approach will improve the functional annotation of Drosophila genes related to human disease, and present these data in intuitive and accessible ways to the wider research community to maximise the impact of fly research on biomedical advances.

Planned Impact

The proposed research will also have beneficiaries in non-academic sectors and the general public in the UK and abroad. There will be an economic benefit to the pharmaceutical industry. The number of new drugs coming on to the market has declined significantly and traditional in vitro high-throughput screening techniques are failing to yield new therapies. As we gain more understanding of human disease, it is clear that most diseases are multi-factorial; the hopes of 'the-one gene-one-drug' or 'magic bullet' model will not be realized in the majority of cases. The data we collate can be used to inform basic pathway biology in humans. As the data from Next-Generation Sequencing of human genomes enters the clinical arena, we will need good functional annotations to interpret the masses of data to aid the hunt for new therapeutic targets. Drosophila can be used at all stages of the drug-discovery process, from modeling the disease (e.g. Alzheimer's disease, Parkinson's disease, cancers) to an in vivo model for high-throughput screening. Therefore linking gene function data to Drosophila disease models and human disease genes is of great value to such research strategies. Ultimately, this will help bring new drugs to the market and improve the health of the population. With an ageing population there is increased financial pressure on the health service, charities, local government and ultimately the UK taxpayer. The benefits of Drosophila as a model for neurodegenerative and cardiovascular disease will be particularly needed as we head towards an ageing population. We will facilitate this by disseminating the functional annotation of Drosophila gene products by providing our most up-to-date data every 2 months to the GO consortium, the FlyBase database, UniprotKB and RNAcentral. From these sites, the data will be freely available and searchable. All tools for the enhanced searching of Drosophila data are freely available from the FlyBase website.
Drosophila is also an important model for researchers studying insects that seriously impact on human health. There are many disease-causing parasites and viruses borne by insect hosts (vector borne diseases, e.g. sleeping sickness, malaria, dengue). They have a huge medical and economic impact in the developing world and as climate patterns shift, the effects could reach further into Europe and the UK. Globally, insect crop pests have a serious economic impact on agriculture. In the UK it is estimated that the agri-food sector contributes 7% of the economic activity of the nation. With a rise in pesticide resistance and the negative environmental impact of using such technologies, the understanding of Drosophila biology underpins the development of many new strategies to pest control. Functional interpretation of the genomes of disease-carrying insects such as mosquitoes, the Tsetse-fly, and crop pests such as the black fly and the Mediterranean fruit fly, relies heavily on the extensive experimental data available in Drosophila. By making this data freely available, it can be used to annotate the genomes of these closely-related species.

Publications

10 25 50
 
Description FlyMet Scientific Advisory Board Member
Geographic Reach Multiple continents/international 
Policy Influence Type Participation in a guidance/advisory committee
URL http://FlyMet.org
 
Description Introduction of an annotation framework and standards for high-throughput data using the Gene Ontology
Geographic Reach Multiple continents/international 
Policy Influence Type Membership of a guideline committee
 
Title Developing pipeline to produce lists of selected references in FlyBase 
Description An algorithm was developed based on curated data in FlyBase that selects research publications most likely to contain substantial information about a particular gene/gene product. This allows users of FlyBase to view a selected subset of the publications where a gene is studied. 
Type Of Material Computer model/algorithm 
Year Produced 2018 
Provided To Others? Yes  
Impact Previously key papers on the function of a particular gene were lost in a list of all papers on the gene. These lists provide key papers for each gene accelarating researchers ability to perform their own research on that gene. 
URL http://flybase.org/reports/FBgn0000014.html#pubs
 
Title Drosophila GO slim 
Description A 'generic' GO slim - a subset of terms in the ontology, has been made to facilitate analysis of Drosophila data (for example, in enrichment analysis). The biological process slim comprises 87 terms relating to discrete processes, the molecular function slim comprises 47 activity classifications, and the cellular component slim comprises 20 terms encompassing the major sites of localization. 
Type Of Material Data analysis technique 
Year Produced 2019 
Provided To Others? Yes  
Impact This is available to many GO analysis tools. Notably, we are collaborating with the DRSC/TRiP Functional Genomics Resources to provide a suite of Drosophila data analysis tools. 
URL http://geneontology.org/docs/download-ontology/#subsets
 
Title FlyPhoneDB 
Description Collaborated on an online tool to predict cell-cell communication between cell types from single-cell RNA sequencing data in Drosophila. 
Type Of Material Data analysis technique 
Year Produced 2022 
Provided To Others? Yes  
Impact Directly cited in 6 research articles up to March 2023. 
URL http://www.flyrnai.org/tools/fly_phone
 
Title Gene Ontology and Evidence and Conclusion Ontology 
Description Helen Atrill led a working group on annotation rules and standards in high-throughput data annotation using the GO. Introduced new evidence codes to make annotation derived from high-throughput experiments visible. Wrote guidelines for use. 
Type Of Material Data handling & control 
Year Produced 2017 
Provided To Others? Yes  
Impact Members of the GO consortium were asked to review lists of potential high-throughput annotations. Most groups have now reviewed these sets and retrofitted where required (either by removing the annotations as they do not satisfy criteria for GO annotation or using a high-throughput evidence code. This has helped raise the quality of GO annotated data across many contributing groups. 
URL http://wiki.geneontology.org/index.php/Guide_to_GO_Evidence_Codes
 
Title Interactive pathway network diagrams 
Description We have developed evidence-weighted interactive network diagrams which combine GO annotation data and FlyBase physical interaction data. The networks are generated using experimental data - node size is based on the number of research papers that have been annotated as demonstrating the gene's pathway membership and the connections between nodes are derived from physical interaction data. The networks are interactive - 2 viewing options are available: a pathway view, which distinguishes regulatory interactors, and a functional view, which colors nodes depending on the functional class (e.g. kinase) of the gene product. 
Type Of Material Computer model/algorithm 
Year Produced 2019 
Provided To Others? Yes  
Impact Researchers can see connections between multiple sets of data (GO annotation, physical interaction, gene group data) in a graphical summary, without having to trawl papers or look at separate sections of a webpage. They can see where there are gaps in experimental knowledge or see connections that are not readily seen in an unconnected data model. 
URL http://flybase.org/lists/FBgg/pathways
 
Title Introduction of Pathways to Alliance of Genome Resources Gene pages 
Description As part of a wrorking group, have implemented the display of pathway data from Reactome and GO-CAMs on pathway gene pages. 
Type Of Material Computer model/algorithm 
Year Produced 2021 
Provided To Others? Yes  
Impact This is the first instance of a use-case for GO Casual Activity Models (GO-CAMs). 
URL https://www.alliancegenome.org/
 
Title Machine learning pipeline to predict novel pathway interactors 
Description Using our high-quality set of curated pathway members lists, we have developed a machine learning pipeline to predict novel candidate pathway interactors. By using the curated members of a pathway as a positive training set, and random genes as a negative set, we trained random forest classifier models to recognise the biological properties of the pathway. The input features were mostly drawn from datasets available in FlyBase and included gene expression levels across a large number of tissues and developmental stages, protein-protein interactions, genetic interactions and evolutionary gene ages. We applied the classifier models for the 16 curated pathways to all known Drosophila protein-coding genes, to produce lists of new candidate members. Each gene is assigned a probability of belonging to the pathway, allowing us to rank the lists by this probability. These lists are available upon request. 
Type Of Material Computer model/algorithm 
Year Produced 2020 
Provided To Others? Yes  
Impact Two fly research labs have requested and received pathway candidate lists. 
 
Title Mapping between FlyBase Drosophila genes and RNAcentral entries & GO annotations to ncRNAs in RNAcentral 
Description Reciprocal pipeline established to provide Drosophila gene identifiers to RNAcentral and to update RNAcentral identifers and linkouts in FlyBase. 
Type Of Material Data handling & control 
Year Produced 2018 
Provided To Others? Yes  
Impact FlyBase included as an RNAcentral expert database. There was a significant review of ncRNAs by FlyBase, and FlyBase now supplies high-quality, manually reviewed ncRNAs for Drosophila every release. There are reciprocal links between FlyBase and RNAcentral, allowing easy transit between the entries at each site. RNAcentral also imports GO annotations made by FlyBase/MRC curators. 
URL http://ftp://ftp.flybase.net/releases/current/precomputed_files/genes/ncRNA_genes_fb_2020_01.json.gz
 
Title Pathway Pages in FlyBase - Phase 1 
Description The annotation of genes using the Gene Ontology has been used to build pathway reports in FlyBase. Using an evidence-weighted model of curation, we have produced high-quality lists of pathway members and regulators for 12 major signaling pathways. This project is in its first iteration and will form the foundation for more advanced pathway modeling. 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
Impact This has allowed users to quickly access validated lists of pathway members/regulators and assess how much experimantal evidence supports their inclusion. Concurrent improvements to GO annotation allow this benefit to spread beyond FlyBase to users of secondary bioinformatics tools. The data is being used to predict new pathway members and drive network modeling. 
URL http://flybase.org/lists/FBgg/pathways
 
Title Pathway Pages in FlyBase- Phase 2 
Description The annotation of genes using the Gene Ontology has been used to build pathway reports in FlyBase. Using an evidence-weighted model of curation, we have produced high-quality lists of pathway members and regulators for 15 major signaling pathways. We have undertaken a second pass of literature curation, to build the evidence model and add more regulatory components. The pathway memebers/regulators tables are now interactive and can be customised to order and display information of interest, such as available antibodies, classical/Insertion alleles, transgenic constructs, experimental human disease models in flies, computed potential disease links via ortholgy and human orthologs. In addition, the pathway pages now host interactive network diagram (see related entry in Research Databases & Models) 
Type Of Material Database/Collection of data 
Year Produced 2020 
Provided To Others? Yes  
Impact This delivers a major goal of the grant - linking multiple types of experimental data to facilitate translational research and easy access to research tool. 
URL http://flybase.org/lists/FBgg/pathways
 
Title Pathway Pages in FlyBase- Phase 3 
Description Enhancement of Pathways pages by adding: 1. Pathway Thumbnails: with the aim of providing a consistent "text book" representation of signaling pathways 2. Implementation of stacked GO Ribbons to allow easy comparison of functional information for genes of Gene Groups and Pathways. http://flybase.org/lists/FBgg/pathways http://flybase.org/lists/FBgg/ 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
Impact Thumbnails are available to download and modify by all users - very positive user feedback. Thumbnails have been used by DRSC/TRiP Functional Genomics Resources & DRSC-BTRR for scRNAseq tool. Both thumbnails and stacked ribbons enable users to get a quick over-view of the data. 
 
Title Presenting human disease data in FlyBase 
Description For D.mel genes, a graphical over-view of human diease fly models used in experimental research and potential disease models using orthology to human genes annotated by an OMIM disease association. A pipeline to map OMIM-human conenctions to the Disease Ontology-fly orthologs was developed and new evidence codes added support the provenance of these annotations was added to FlyBase. 
Type Of Material Computer model/algorithm 
Year Produced 2019 
Provided To Others? Yes  
Impact Making fly research on human diseases more accessible to users of FlyBase and providing an over-view of potential disease-model genes. 
 
Title Summarizing Gene Expression Data in FlyBase 
Description A graphical overview of expression data describing temporal and spatial gene expression patterns. 
Type Of Material Computer model/algorithm 
Year Produced 2019 
Provided To Others? Yes  
Impact Expression data has been made more accessible to users. 
 
Title Summarizing Gene Ontology data in FlyBase 
Description We helped generate Gene Ontology Summary Ribbons summarizing Gene Ontology data for each gene in FlyBase. These Ribbons are derived from the functional data associated with each gene captured using the Gene Ontology. They are graphical gene signatures, designed to give an immediate overview of a gene product's properties and depth of characterization. 
Type Of Material Computer model/algorithm 
Year Produced 2018 
Provided To Others? Yes  
Impact The feedback from the research community that uses FlyBase has been very positive. Researchers report that it provides a very useful overview of what can be an overwhelming amount of data. 
URL http://flybase.org/reports/FBgn0000014.html#go_summary
 
Title Updated Human and Drosophila Biological Process slim 
Description Updated the human 'generic' Biological Process GO slim (a subset of terms in the ontology) in collaboration with members of the GOC - to better facilitate analysis of human data (for example, in enrichment analysis). Also, updated Drosophila slim to align with human slim, but also to improve coverage and biological meanfulness. 
Type Of Material Data analysis technique 
Year Produced 2021 
Provided To Others? Yes  
Impact Improves the coverage of enrichment analysis. 
URL http://geneontology.org/docs/download-ontology/#subsets
 
Description Alliance of Genome Resources 
Organisation National Institutes of Health (NIH)
Department National Human Genome Research Institute (NHGRI)
Country United States 
Sector Public 
PI Contribution FlyBase is one of the founding members of the Alliance of Genome Resources (Alliance). Helen Attrill has contributed to the Expression working group as part of aligning FlyBase expression data and Alliance ribbon summarization displays and to the Biological Function working group, dealing with the display of GO data on Alliance gene pages and cross-species comparison of GO data. Giulia Antonazzo also contributes to the Biological Function working group, attending monthly calls. Helen Attrill and Giulia Antonazzo are currently working with the Alliance Pathway working group to harmonize and display pathway data on Alliance pages.
Collaborator Contribution The Alliance has contributes from major model organism databases:FlyBase, Mouse Genome Database (MGD), Saccharomyces Genome Database (SGD), Rat Genome Database (RGD), WormBase, and the Zebrafish Information Network (ZFIN), and the Gene Ontology Consortium (GOC). Each member database contributes to the work on supplying, harmonizing and displaying cross-species data to compare with human data and facilitate translational research.
Impact The principal output is the Alliance website: https://www.alliancegenome.org/ and cross-database collaboration on data harmonization and sharing of resources and infrastructure. PMID:31552413
Start Year 2017
 
Description Collaboration with Complex Portal 
Organisation EMBL European Bioinformatics Institute (EMBL - EBI)
Country United Kingdom 
Sector Academic/University 
PI Contribution Curated protein complexes and supplied Complex Portal, EMBL-EBI with data to populate entries.
Collaborator Contribution QC/QA and addition of data to Complex Portal database.
Impact Fly complexes will be searchable at both flybase.org and https://www.ebi.ac.uk/complexportal/home
Start Year 2021
 
Description DRSC/TRiP Screening Center-Biomedical Technology Research Resource 
Organisation Harvard University
Department Harvard Medical School
Country United States 
Sector Academic/University 
PI Contribution We have collaborated with the Drosophila RNAi Screening Center (DRSC), Transgenic RNAi Project (TRiP) and Drosophila Research & Screening Center-Biomedical Technology Research Resource (DRSC-BTRR) to help them in the development of bioinformatics tools by providing data and feedback. This includes a gene set enrichment tool and a single cell RNA sequence data analysis tool.
Collaborator Contribution The Screening Center-Biomedical Technology Research Resource develop the tools and integrate data.
Impact DRSC/TRiP-FGR tools can be found at https://fgr.hms.harvard.edu/tools Publications: Gene2Function: An Integrated Online Resource for Gene Function Discovery. PMID:28663344 FlyPhoneDB: an integrated web-based resource for cell-cell communication prediction in Drosophila. PMID:35100387
Start Year 2018
 
Description FlyBase Consortium membership 
Organisation FlyBase Consortium
Country Global 
Sector Academic/University 
PI Contribution We make GO annotations to D.mel genes which are housed in FlyBase and ensure that the Gene Ontology is updated for each release of FlyBase and revise the GO annotations in line with ontology changes. We request new ontology terms from GOC as required by FlyBase curators. We train FlyBase curators to make functional annotations using the GO. As part of the GO consortium, we keep FlyBase updated on changes to GO annotation policy and work with FlyBase curators and developers to implement these changes. We attend FlyBase consortium meetings and lobby for changes to FlyBase that will make Drosophila research more accessible to researchers studying human genes or disease and assist in implementing these changes. We answer GO related-queries from FlyBase users. We are have worked with FlyBase to develop a text-mining approach to triage papers for GO and disease model annotation. We have worked with FlyBase to develop visual summaries of data including expression, GO and signalling pathways data. We curate gene groups (gene families and macromolecular complexes) into FlyBase. We add and keep author-submitted gene snapshots to FlyBase.
Collaborator Contribution They maintain the FlyBase database and associated website where our GO annotations, pathways and gene groups are stored and displayed. They provide developer support for implementing new data types associated with GO annotation, disease model curation and data visualization. FlyBase literature curators make GO annotations that supplement those made by the MRC funded GO curator.
Impact PMID:22127867 PMID:22554788 PMID:23125371 PMID:23160412 PMID:24234449 PMID:24715220 PMID:25398896 PMID:26109356 PMID:26109357 PMID:26467478 PMID:26935103 PMID:27494710 PMID:27730573 PMID:27799470 PMID:27930807 PMID:28663344 PMID:29761468 PMID:30364959 PMID:31933406 PMID:31960022 PMID:33219682
Start Year 2006
 
Description Gene Ontology (GO) consortium membership 
Organisation Gene Ontology Consortium
Country Global 
Sector Charity/Non Profit 
PI Contribution We make GO annotations and are the responsible for collating all Drosophila melanogaster annotation and submitting them to GOC. We contribute to the development of the Gene Ontology (request new terms, participate in specialist term development workshops, report errors). We attend GO consortium project meetings, workshops and regular conference calls where we contribute to discussion on all aspect of the Gene Ontology project particularly decisions related to annotation policy and quality control.
Collaborator Contribution The GO consortium load our Drosophila GO annotation set into their database (along with annotations from other species) and make it available for searching and download via their website. They provide us with quality control reports and suggest Drosophila additional annotations (based on phylogenetic analysis and inferences from relationships between terms in the ontology). They provide editorial assistance to change the Gene Ontology in response to our requests for new terms or error fixes.
Impact PMID:22102568 PMID:23161678 PMID:25428369 PMID:27899567 PMID:30395331 PMID:30715275 PMID:33290552
Start Year 2006
 
Description HUGO gene nomenclature committee (HGNC) 
Organisation HUGO Gene Nomenclature Committee
Country United Kingdom 
Sector Charity/Non Profit 
PI Contribution A yearly meeting with HGNC to compare strategies for collating, presenting and aligning human and fly gene lists (Gene Groups). Where possible, we make links to Gene Groups at HGNC Gene Groups. FlyBase supplies a correspondance file for HGNC so that they can add reciprocal links to FlyBase from their pages. There are now 464 links between FlyBase Gene Groups and equivalent human sets at HGNC.
Collaborator Contribution A yearly meeting to compare strategies for collating, presenting and aligning human and fly gene groups. HGNC add reciprocal gene group links to FlyBase from their pages.
Impact This facilitates comparison between protein complexes and functional classes (such as glycoside hydrolases) between D.mel and human Gene Groups.
Start Year 2016
 
Description RNAcentral collaboration 
Organisation RNAcentral
Sector Public 
PI Contribution FlyBase became on of the Expert Databases that contribute data to RNAcentral. FlyBase has made its GO annotations to ncRNA for D.melanogaster available to RNAcentral via the GOA database.
Collaborator Contribution RNAcentral has worked on QC/QA and establishing a pipeline/links to FlyBase and has imported GO annotations to ncRNA made by FlyBase.
Impact GO annotations to D.melanogaster ncRNAs have been made available to users of RNAcentral and QuickGO users. Update of Sequence Ontology in FlyBase to provide more descriptive labeling of ncRNA classes and better alignment with external resources, including RNAcentral. PMID:33106848 PMID:30395267
Start Year 2017
 
Description UniProtKB, Gene Ontology Annotation (GOA), InterPro collaboration 
Organisation EMBL European Bioinformatics Institute (EMBL - EBI)
Department Protein Sequences Resources
Country United Kingdom 
Sector Academic/University 
PI Contribution We provide GO annotations and mappings between FlyBase genes and UniProt proteins. We incorporate GO annotations for Drosophila made by UniProt into FlyBase and display them on our webisite. We provide feedback to the InterPro group about their mappings between protein domains and GO terms. UniProt display our GO annotations on their website. We add GO annotations for non-Drosophilid species, including human, to directly to the GOA database. We have collaborated on a review of Drosophila RNA polymerases with curators at UniProtKB.
Collaborator Contribution UniProtKB display our GO annotations on their website and make Drosophila GO annotations that we display on our website. They assign InterPro domains to UniProtKB entries and maintain a mapping between InterPro domains and GO terms; this information is used to infer automatic GO annotations in FlyBase. We have used the EBI GOA database and curation interface, Protein2GO, for GO curation since 2017. We have collaborated on a review of Drosophila RNA polymerases with curators at UniProtKB.
Impact GO annotations based on InterPro domains were updated with each release of FlyBase. In Oct 2013 there were 12,227 such annotations for Drosophila melanogaster. PMID:31933406
Start Year 2006
 
Description Alliance of Genome Resources All-Hands Meeting 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A three day meeting of the Alliance of Genome Resources consortium to discuss integration of data, working groups, workflows, co-ordination and aims of the project. Attended by Giulia Antonazzo, Helen Attrill and Nick Brown.
Year(s) Of Engagement Activity 2020
 
Description Bi-yearly GO Consortium Meeting 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A bi-yearly GO consortium meeting. The purpose of this meeting is to improve GO annotation practice, flag priorities and co-ordination of projects and pipelines. Helen Attrill often presents on updates to our project progress or proposals for changes to practices.
Year(s) Of Engagement Activity 2018,2019,2020,2021
URL http://wiki.geneontology.org/index.php/Consortium_Meetings_and_Workshops
 
Description Biological Function Working Group 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact Contribution (Helen Attrill) to Biological Function working group of Alliance (of genome resources) to help harmonise the way in which GO data is presented in a mulit-organism platform.
Year(s) Of Engagement Activity 2017,2018,2019,2020
 
Description Causal Activity Modeling Workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Attended a week-long workshop on making causal models using Noctua, the GO consortium's annotation tool that is in development, to learn about new features and testing for bugs and usability.
Year(s) Of Engagement Activity 2020
 
Description Departmental symposium talk 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Professional Practitioners
Results and Impact Giulia Antonazzo presented at Department of Physiology, Development and Neuroscience symposium aimed at advertising Drosophila pathway prediction candidates generated using a machine learning pipeline. As a result, two Drosophila labs expressed interest in using these prediction lists in their research. We provided them the top 200 candidates for those pathways for experimental characterization.
Year(s) Of Engagement Activity 2020
 
Description Expression Working Group 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact Contribution (Helen Attrill) to Expression Working group of Alliance (of genome resources) to help harmonise the way in which expression data is presented in a mulit-organism platform and inform the development of Expression summary displays in FlyBase.
Year(s) Of Engagement Activity 2018,2019
 
Description FlyBase Twitter 
Form Of Engagement Activity Engagement focused website, blog or social media channel
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact @FlyBaseDotOrg Used for announcing new features and meeting attendance. Tweetorials (#FlyBaseTweetorial) aimed at walking users through features on FlyBase.
Year(s) Of Engagement Activity 2016,2017,2018,2019,2020,2021
 
Description Introduction to Bioinformatic Resources and Gene Ontology Workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Postgraduate students
Results and Impact Assisted at a 2 day event organized and hosted by UCL aimed at helping researchers with accessing bioinformatics. The event was a hands-on workshop in which participants tried to use several databases and analysis tool via a step-by-step guide followed by helping them with analyzing them with own their data.
Year(s) Of Engagement Activity 2017
 
Description Lead working group on the annotation of gene product function from high throughput studies 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact Helen Attrill lead a working group to establish standards in the annotation of results from highpthroughput studies using the GO. As a result, 13 annotation projects reviewed and retrofitted data annotated using the GO. A paper resulted from this work and, at the time of publishing, the number of high throughput-evidenced annotations in the GO database was: 34,533 annotations, representing 4.5% of the total number of experimentally evidenced annotations in the GO database (from AmiGO, 2018-12-02, 10.5281/zenodo.1899458).
Year(s) Of Engagement Activity 2018
 
Description Metamorphosis workshop for schools 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact School hands-on event for 22 Year 1/Foundation year children looking at life cycles of frogs, dragonflies and solitary bees. Followed by questions from children about animals and work in science. To allow observation and recording of lifecycles as part of the children's topic work, this activity was extended raising frogs and butterflies in the classroom.
Year(s) Of Engagement Activity 2017
 
Description Organization and attendance at GO consortium meeting 2017 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This annual meeting of the GO consortium is essential for improving GO annotation and developing priorities for the upcoming year
Year(s) Of Engagement Activity 2018
URL http://wiki.geneontology.org/index.php/2017_Cambridge_GOC_Meeting_Agenda
 
Description Pathway Advisor Interviews & Consultation 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Other audiences
Results and Impact Four experts from different fields of pathway research were interviewed about what data they would find useful for their research and how FlyBase could provide this. This information was then used for generate options for phase 1 and 2 of FlyBase pathway pages. These options were then reviewed by researchers and the FlyBase team to generate the specification for the first phase.
The information was used to inform the evidene-weighted model for pathway curation.
Year(s) Of Engagement Activity 2017
 
Description Pathway Working Group 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Helen Attrill and Giulia Antonazzo are part of a working group examining methods of presenting pathway information for difference species on the Alliance of Genome Resources webpage.
Year(s) Of Engagement Activity 2020,2021
URL https://www.alliancegenome.org/
 
Description Protein Complex Working Group 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Participation (Helen Attrill) in protein complex working group for annotation standards and practices for protein-containing complexes. Representatives from different databases meeting on weekly - monthly basis.
Year(s) Of Engagement Activity 2018,2019
 
Description School Science Week Activity - Purification & Infectious Diseases 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact Helen Attrill designed and ran a hands-on science activity for a primary school (of ~120 children) looking at separation and purification techniques and discussion of water-borne diseases. Involved testing/comparison, recording outcomes and analysis of results using the scientific method.
Year(s) Of Engagement Activity 2020
 
Description School visit (Cambridgeshire) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact Primary school hands-on activity for children aged between 5-6 years old. Fun activity aimed at increasing interest in science and as a primer for evolution for younger audiences.
Year(s) Of Engagement Activity 2018
 
Description School visit - Form and Function 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact Demonstration and hands-on activity within school looking at the link between form and function for 30 children between the ages of 5-7. At the end, children engaged in a craft activty encouraging them to think about an animal's environment and behavour and how that relates to coloration.
Year(s) Of Engagement Activity 2019
 
Description Signaling Pathway Working Group 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact Curator-led working group to review and revise signaling pathway annotations for wnt, ras and GPCR signaling. Subgroups identified annotations to review from other databases and supervised revision.
Year(s) Of Engagement Activity 2018
 
Description Signaling Pathway Workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Organization and participation in a Signaling Pathways Workshop aimed at improving GO annotation practices of signaling pathway. Approximately ~20 attended from different annotation groups, databases and bioinformatics projects to discuss harmonizing curation of Signaling pathways.
Year(s) Of Engagement Activity 2018
 
Description Talk at Postgraduate Certificate in Biocuration course 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Postgraduate students
Results and Impact Presentation and discussion on the pros, cons and pitfalls of curating high-throughput data to a group of ~10 students taking a PG certificate in biocuration, plus ~5 leaders of other databases/curation projects. Students were from different backgrounds, including commercial settings, and many aspects were discussed, imparticular FAIR principles.
Year(s) Of Engagement Activity 2018
 
Description Transcription Factor Annotation Workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A three day workshop held by Gene Regulation Ensemble Effort for the Knowledge Commons (GREEKC), GOC and UCL Gene Function Annotation group aimed at accurately classifying DNA binding transcription factors and co-regulators using GO was attended by Helen Attrill. Workshop consisted of curating example papers, discussing and refining rules and producing guidelines. This was followed up by a review and revision annotations.
Year(s) Of Engagement Activity 2020
URL https://www.greekc.org/working-groups/