📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

2020BBSRC-NSF/BIO: REDEFINE - Development of efficient, large-scale metagenomics sequence comparison algorithms to facilitate novel genomic insights

Lead Research Organisation: European Bioinformatics Institute
Department Name: Genome Assembly and Annotation

Abstract

Microbes are ubiquitous and perform essential roles that help sustain life on earth, for e.g. environmental oxygenation, soil nutrient cycling to support plant growth or facilitating animal digestion. They cause many diseases in plants and animals and have the ability to rapidly evolve to exploit new niches and/or combat antimicrobials. A relatively new field, metagenomics is a culture independent method that applies sophisticated DNA sequencing technologies to analyse the total microbial genetic material from any environment. It is now possible to reassemble the millions of short DNA sequences to produce representations of the microbial genomes in a sample, termed metagenome assembled genomes (MAGs), especially for bacteria. While this approach remains computationally expensive, the computer algorithms used to recover these genomes have been substantially improved to increase accuracy of MAGs. Just in the past five years, many large-scale studies, including our own, have successfully applied these techniques to cumulatively generate millions of MAGs. This has provided scientists with novel insights into ~99% of organisms yet to be experimentally cultured and dramatically expanded the Tree of Life. These MAGs are reshaping our understanding of microbial community structure and the functional capacities of constituent members.

This explosion in MAG numbers nevertheless presents new challenges. These large-scale analyses can generate genomes at magnitudes that match GenBank's large genome collection, which is derived from traditional techniques of sequencing experimentally isolated microbes. Such genome collections have taken decades to build and are managed by large data centres. Yet, there is now the need for groups to routinely perform comparisons between new MAG collections and such large reference genome collections. We propose to use a particular class of algorithm called MinHash, which rapidly estimates similarity between two sets based on the number of shared entities, in our case short sequences. Most implementations of this approach have focused on the rapid comparison of one genome to another. In this proposal, we aim to use a range of computational techniques to enable the comparison of a large query dataset to a large reference database, with the purview of being applied to microbial genomes, MAG collections and metagenomic sequences. We will develop and apply this tool to a range of datasets, particularly those housed in MGnify, a leading database of metagenomic data. The key applications are the identification of errors in MAGs which were introduced by the computational methods, data reduction by identifying duplicate MAGs between datasets, the rapid incorporation of MAGs into catalogues of genomes that have been found in a particular environment, taxonomic classification of MAGs (by converting similarity distances to evolutionary distances), and the profiling of metagenome datasets to determine which genomes are likely to be found. The latter set of profiles will also enable the delineation of datasets that are poorly characterised by MAG/genome collections and prioritise them for analysis (i.e. MAG generation).

The outputs of this proposal are manifold. The first is a suite of software tools and associated workflows that can be installed and run on the computer command line. The application of the tool will lead to multiple new data outputs (refined MAGs, improved catalogues and metagenomic profiles) which will be made available via MGnify's web interfaces. To provide rapid access to these MAG catalogues, we will also deploy new web interfaces (implementing the new tools) that allow users to compare their own MAGs against established collections. This will not only democratise scientific research but also reduce the need for data duplication. We will also use specific use cases to demonstrate the utility of our tools and provide training and support for their use.

Technical Summary

Over the past five years, assembled metagenomic sequence data have been generated at such accelerated rates that they have overtaken the volume of sequence data from isolate microbial genomes. Furthermore, the numbers of distinct bacterial strains recovered from metagenomes already match the orders of magnitude of isolate strains (100,000s). This massive and continual expansion in size and numbers of metagenomics datasets are increasingly yielding metagenome assembled genomes (MAGs). Thus, there is an urgent need to produce new tools and resources that enable large-scale genome comparisons, as no existing approaches sufficiently scale to deal with both large queries and large reference databases. In this proposal, we will extend the functionality of sourmash, a widely used tool for performing sequence comparison using MinHash approaches and apply it to datasets in MGnify. To achieve the necessary scalability we will optimise search through algorithmic optimisations (e.g. heuristics, caching), precalculation of sketches for reference databases and horizontal scaling using multiple compute nodes. Additional improvements will be achieved via implementations in Rust, the multi-paradigm programming language. Collectively, these will enable us to perform large-scale database comparisons that will enable:
(1) Detection of contaminating contigs in MAGs and reference genomes;
(2) Removal of redundancies between MAG collections by intra- and inter-dataset comparisons;
(3) Assignment of taxonomy to MAGs by converting MinHash distances into evolutionary distances;
(4) Profiling of short-read metagenomics datasets to detect novelty and permit matching to MAG collections.
These tools will be made available to the community as standalone software packages and via new web interfaces in MGnify, which will provide unparalleled access to the MGnify MAG catalogues. These tools will also be applied to MGnify data to improve MAG quality and help prioritise datasets for analysis.

Publications

10 25 50
 
Description One of the aims of this project is to use new tools to evaluate the quality of genomes produced from metagenomes, i.e. metagenome assembled genomes (MAGs). A key part of this work package was to evaluate whether the Charcoal tool could be utilised to systematically identify mis-placed contigs within a MAG. After developing a benchmark based on both isolate genomes and MAGs, our work identified two critical findings which meant that Charcoal could not be used to systematically identify erroneously placed contigs. Firstly, a comparison of isolate genomes showed that between species there are major discrepancies coming from horizontally transferred contigs, which represent strain-diversity when comparing conspecific genomes. Secondly, there are many isolate genomes that are heavily contaminated, either because they have two or more genomes represented within the contigs, or have contigs that charcoal can match to genomes corresponding to other species. As such, we decided to evaluate alternative approaches to the use of Charcoal. For refining the MAGs produced by MGnify we have adopted the following approach - (i) when generating MAGs, we now systematically search for both prokaryotic and eukaryotic MAGs; (ii) our binning approach always uses multiple binners, and we employ the logic of MetaWrap to take the consensus bin; and (iii) we use CheckM2 to evaluate the completeness of our bins, with the addition of checking the fraction of viral contigs in the bin, which has been identified as a point of failure of CheckM2. After MAG-generation, we submit the MAGs to the European Nucleotide Archive (ENA), such that there is complete provenance of their generation. Following this, both MGnify-generated MAGs and those produced by the community are collected in a biome-specific fashion, and further quality controlled. Where a species is represented by only one genome, we use GUNC to try to detect additional points of contamination. This has been integrated into our MGnify Genomes catalogue generation pipeline.

As part of this work, we have established a systematic pipeline for the generation of the catalogue, which produces a set of representative species (collapsing redundant genomes) to facilitate the representation of species found within a biome. We have established a comprehensive pipeline to build and update MGnify Genomes catalogues, which leveraged some of the efforts in WP3 to identify additional datasets. To date, MGnify has 12 different biome-specific genome catalogues (https://www.ebi.ac.uk/metagenomics/browse/genomes), which encompass over 0.5 millions genomes (isolates and MAGs), which represent nearly 30,000 species. Each species has a representative genome, which is richly annotated with functions, pathways, and biosynthetic gene clusters, as well as the annotation of integrated phage genomes and regions that represent components of the mobilome. The pipeline also produces a range of widely used data, including gene and protein catalogues, and databases for widely used tools (e.g. Kraken). The species representative of the catalogues are made available with a genome browser via the MGnify website. This pipeline is currently being migrated to Nextflow to harmonise the systems within the resource.

We have also developed a procedure to use the sourmash tool to identify datasets with potential to contribute the most novelty to a MAG catalogue. To this end, the catalogue was compared to unassembled metagenomes from the same biome, and those that showed the least overlap were prioritised for assembly and MAG generation. This was applied iteratively to incorporate additional datasets. After each iteration, the number of new species contributed by an assembly to a MAG catalogue was assessed. In the case of new catalogues, this approach was successful in identifying datasets that should be prioritised. However, this approach was limited when it came to more complete catalogues like the human gut. We suspect that this is linked to differences being largely manifested by low abundant organisms that can not be easily assembled, as well as detecting sequence differences below the taxonomic level of species (i.e. strain differences).


Elements of sourmash have now been incorporated into the MGnify website. Here MGnify has produced a JavaScript component that generates a sketch of a query genome (an efficient representation of the genome sequence) on the client/web browser. This sketch is then sent to the MGnify server for searching against the MAG catalogues using sourmash, providing efficiency in both search and data transfer. This approach has facilitated fast and efficient searches against the MAG collections, more so as it allows users to start comparing their own MAG collections against MGnify. More recently, we have been sent a collection of sketches that represent all metagenomes deposited in the nucleotide archives. We are currently working on building a search interface on these sketches to identify the presence of a genome on these metagenomes.

Considerable effort has been invested in developing the infrastructure around enabling rapid searches of MAGs and the interconnectivity of MAGs to metagenomes. A key aim of this work has been to better connect MAGs to the original studies containing the raw-data from which they were derived. For MGnify-generated MAGs, this can be easily achieved, as the corresponding assembly is in MGnify, and we primarily focus on single run assembly and binning. For community generated MAGs however, this process is more complicated, as there is often less connectivity with the original sequence dataset(s) used to generate the MAG. To enable this, we have developed a process of linking MAGs to metagenome assemblies using a two step approach. For those that have direct linkage, we can connect, cross-referencing each contig in the MAG to those found in the assembly. This second phase involves the completion of the Branchwater search system, a comprehensive search system that is currently in development.
Exploitation Route The MGnify MAG collections are already being used as references databases by a wide range of scientists for different applications, such as discovering novelty in their corresponding catalogues, assessing diversity of genomes within a particular environment, or evaluating overlap and differences between similar biomes, for example between the human and mouse gut microbiota. Another important application has been the use of sketching tools to compare 'secure' genomes (e.g. The Global Microbiome Conservancy) against these catalogues. Normally, the genome sequences from these secure collections cannot easily be shared, with some even reluctant to submit the sequences into public search interfaces. However, the "sketching" of a genome allows it to be shared as the original genome cannot be reconstructed. This may pave a suitable way for open and transparent integration of such data.

User feedback indicates that the Kraken databases are being used by a variety of audiences. Others, including ourselves, are investigating how these reference databases and sourmash can be coupled into new search regimes for shallow shotgun approaches, which may facilitate deeper insights into microbiomes at limited costs.

The large metagenomic search system that we are currently developing is gaining much interest from those interested in pathogen surveillance and biodiversity monitoring as it will enable the detection of organisms within an environmental context, and help understanding of transmission in a One Health context.
Sectors Aerospace

Defence and Marine

Agriculture

Food and Drink

Communities and Social Services/Policy

Digital/Communication/Information Technologies (including Software)

Environment

Healthcare

Manufacturing

including Industrial Biotechology

Pharmaceuticals and Medical Biotechnology

URL https://github.com/EBI-Metagenomics/genomes-catalogue-pipeline
 
Description British Embassy Brussels - AMR innovation event
Geographic Reach Europe 
Policy Influence Type Contribution to a national consultation/review
Impact Increasing the awareness of current genomics best practices and the current issues faced with collecting AMR related data. Building connections with the UKHSA, to improve data flows and ensure that phenotypic data can be collected appropriately, and linked with genotypic information and presented to the global community. Also, started to discuss how MGnify data could be used in commercial diagnostics.
 
Description British Embassy Paris - Tackling AntiMicrobial Resistance, Together
Geographic Reach Europe 
Policy Influence Type Participation in a guidance/advisory committee
Impact The panel discussed recent improvements in diagnostics including training, resources and advances in technology. Specifically, there is a deep cycle between data flows, microbiologists and global surveillance. This has resulted in new connections being established with the European Centre for Disease Prevention and Control and UKHSA, to improve monitoring programs.
URL https://www.pasteurfoundation.uk/tackling-amr-together
 
Description European Commission meeting with EMBL
Geographic Reach Europe 
Policy Influence Type Participation in a guidance/advisory committee
 
Description Microbiology Society microbiome safety workshop
Geographic Reach Multiple continents/international 
Policy Influence Type Contribution to new or improved professional practice
URL https://microbiologysociety.org/events/microbiology-society-microbiome-safety-workshop.html
 
Description SeqCode Genome Standards Working Group Member
Geographic Reach Multiple continents/international 
Policy Influence Type Participation in a guidance/advisory committee
URL https://www.isme-microbes.org/governance
 
Description BlueRemediomics: Harnessing the marine microbiome for novel sustainable biogenics and ecosystem services
Amount € 7,649,827 (EUR)
Funding ID 101082304 
Organisation European Commission 
Sector Public
Country Belgium
Start 12/2022 
End 11/2026
 
Description MICROBE: MICRObiome Biobanking (RI) Enabler
Amount € 5,804,683 (EUR)
Funding ID 101094353 
Organisation European Commission 
Sector Public
Country Belgium
Start 02/2023 
End 01/2027
 
Title Metagenome Assembled Genome Catalogues 
Description Metagenome assembled genome catalogues for a variety of biomes, with species representatives made available via the MGnify website, and hundreds of thousands of genomes available via FTP site. For each species, and where appropriate, pangenome analysis has also been made available. Also, for each catalogue, there is an associated protein catalogue. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
Impact Increasingly metagenome assembled genomes can be generated from metagenomic assemblies. However, these are currently not consistently collected and compared to each other. We have developed the necessary workflows to aggregate over multiple different sources, compare the genomes and select the most representative species. This allows comparison of new databases, to understand the novelty that additional datasets may contain. 
URL https://www.ebi.ac.uk/metagenomics/browse#genomes
 
Title mettannotator 
Description Prokaryotic genome annotation pipeline. 
Type Of Technology Software 
Year Produced 2024 
Open Source License? Yes  
Impact In recent years there has been a surge in prokaryotic genome assemblies, coming from both isolated organisms and environmental samples. These assemblies often include novel species that are poorly represented in reference databases creating a need for a tool that can annotate both well-described and novel taxa, and can run at scale. Here, we present mettannotator - a comprehensive Nextflow pipeline for prokaryotic genome annotation that identifies coding and non-coding regions, predicts protein functions, including antimicrobial resistance, and delineates gene clusters. The pipeline summarises the results of these tools in a GFF (General Feature Format) file that can be easily utilised in downstream analysis or visualised using common genome browsers. Mettannotator has been tested on both novel and known taxa, and used across Ensembl Bacteria and MGnify. 
URL https://workflowhub.eu/workflows/1069
 
Description 26th Annual Meeting EDF Plenary Guest Lecture "Role of microbial communities in skin health and disease" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Plenary guest lecture by PI Rob Finn at the 26th Annual Meeting of the European Dermatology Forum.
Year(s) Of Engagement Activity 2023
URL https://www.edf-meeting.com/en/program/plenary-guest-lectures
 
Description Annual metagenomics course "Metagenomics bioinformatics at MGnify" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Annual metagenomics course provided by the MGnify team at EMBL-EBI. This course is highly sought after in the field and is always oversubscribed. Course participants learnt about the tools, processes, and analysis approaches used by MGnify in the field of genome-resolved metagenomics. The course was run by MGnify team members Ms Lorna Richardson, Drs Alexander Rogers and Tatiana Gurbich.
Year(s) Of Engagement Activity 2023
URL https://www.ebi.ac.uk/training/events/metagenomics-bioinformatics-1/#vf-tabs__section--tab1
 
Description BIOCEV Special Lecture "Genome resolved metagenomics analysis for understanding the composition of the human gut microbiome" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Special Lecture by PI Rob Finn at the Microbial Communities: Function, Structure, and Complexity" conference, which was organized in BIOCEV (Vestec).
Year(s) Of Engagement Activity 2022
URL https://www.biocev.eu/en/about/events/microbial-communities-function-structure-and-complexity.294?ty...
 
Description BioSB Computational Metagenomics Course talk "MGnify and metagenome resources at EBI" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify Bioinformatician Tatiana Gurbich at the metagenomics course in Wageningen, Netherlands (online).
Year(s) Of Engagement Activity 2022
URL https://www.dtls.nl/courses/computational-metagenomics/
 
Description Business Insider Interview titled "Scientists are racing to explore more of the ocean hoping to discover medical breakthroughs and address the climate crisis" 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact Business Insider interview with MGnify PI Dr Robert Finn on using MGnify outputs for propelling novel Biodiscovery in the marine domain. Dr Finn coordinates an EU HORIZON initiative BlueRemediomics that is centred around the MGnify microbiome resource data and technology. The interview was reposted by Business Insider India https://www.businessinsider.in/science/news/scientists-are-racing-to-explore-more-of-the-ocean-hoping-to-discover-medical-breakthroughs-and-address-the-climate-crisis/articleshow/102210340.cms
Year(s) Of Engagement Activity 2023
URL https://www.businessinsider.com/marine-biodiscovery-could-unlock-answers-health-climate-crises-2023-...
 
Description ETIM 2022 talk "Genome resolved metagenomics: understanding the metabolic potential of microbial communities" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify PI Rob Finn at the ETIM 2022 meeting on Artificial Intelligence and Bioinformatics held at Essen
Year(s) Of Engagement Activity 2022
URL https://etim.uk-essen.de
 
Description ICG-17 Keynote talk "Genome-level resolution metagenomics: from viruses to eukaryotes" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Keynote speech by PI Rob Finn at the ICG-17 Conference held at Riga, Latvia.
Year(s) Of Engagement Activity 2022
URL https://www.youtube.com/watch?v=x8WJysdL5zA&ab_channel=ICG-17Riga
 
Description ISME 18 Poster "MGnify Genomes: a resource for biome-specific genome catalogues" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Poster by MGnify Bioinformatician Tatiana Gurbich at the 18th International Symposium on Microbial Ecology conference.
ABSTRACT
MGnify provides a free to use platform for assembly, analysis and archiving of microbiome data from multiple environments. Recently we expanded the resource with the release of biome-specific non-redundant microbial genome catalogues that were generated using isolate and metagenome-assembled genomes that were assembled by MGnify or submitted to the European Nucleotide Archive by third parties. All genomes within a biome-specific catalogue are dereplicated to remove equivalences at the strain level. For species that contain multiple conspecific genomes after dereplication, we choose the highest quality genome as the species representative, always prioritising an isolate genome over a metagenome-assembled genome. For each catalogue we provide genomic sequences, functional annotations, pan-genomes for species that contain multiple conspecific genomes, a protein catalogue, a kraken2 database, and assembly statistics and metadata. The genomes and functional annotations can be browsed on the MGnify website or downloaded from the FTP server along with the rest of the data. We also provide a suite of search tools that allow users to compare their own gene sequences, whole genomes, or sets of genomes, against the catalogues. At the time of the writing, we have catalogues for four biomes available (human gut, human oral, cow rumen, and marine). In total, these catalogues are made up of nearly 300,000 genomes that are clustered into a total of 9,421 species representatives. The resource will continue to expand with addition of new catalogues and updates to the existing catalogues.
Year(s) Of Engagement Activity 2022
URL https://isme18.isme-microbes.org/poster-program
 
Description Industry Workshop on "MGnify: The EMBL-EBI metagenomics analysis portal" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Industry targeted workshop organised at the 23rd Genomic Standards Consortium Meeting held in Thailand in August 2023. The workshop was presented by MGnify PI Dr Robert Finn along with two MGnify teams members Ms Lorna Richardson and Dr Tatiana Gurbich.
Year(s) Of Engagement Activity 2023
URL https://genomicsstandardsconsortium.github.io/GSC23-Bangkok/program.html
 
Description Invited talk titled "MGnify - a hub for the archiving, analysis, and discovery of microbiome derived sequence data" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Invited talk by PI Dr Robert Finn titled "MGnify - a hub for the archiving, analysis, and discovery of microbiome derived sequence data" at the 2023 RoBioinfo Conference organised by the Romanian Society of Bioinformatics (RSBI) in Bucharest, May 2023. The 2023 conference focused on two main themes, namely human genomics and biodiversity-microbiome and Dr Finn talk was featured in the latter.
Year(s) Of Engagement Activity 2023
URL https://rsbi.ro/evenimente/2023-robioinfo-conference/
 
Description Keynote talk titled "Understanding the community composition of different microbiomes using resolved metagenomics" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Keynote address by MGnify PI Dr Robert Finn at the at the 16th Brazilian Symposium on Bioinformatics BSB 2023 held in Brazil, June 2023. https://bsb.sbc.org.br/2023/invited-speakers/
Year(s) Of Engagement Activity 2023
URL https://bsb.sbc.org.br/2023/program/
 
Description Microbiome 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Microbiome-targeting products are becoming increasingly prevalent, but there remains a significant need to develop a robust system for safety assessment. To gain a best cross sector understanding, the Microbiology Society convened an academic-industry-policy workshop involving ~30 members, to discuss the challenges of evaluating microbiome perturbations across the oral, skin, and gut ecosystems.
Year(s) Of Engagement Activity 2024
URL https://microbiologysociety.org/blog/a-de-brief-from-the-microbiology-safety-workshop-2024.html
 
Description Research Seminar titled "EBI resources for Microbiome Research" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Research Seminar by MGnify PI Dr Robert Finn at the University of Turku, Finland as part of another EC project that utilises MGnify infrastructure and resources.
Year(s) Of Engagement Activity 2023
 
Description Talk titled "ELIXIR Microbiome Community" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify PI and ELIXIR Microbiome Community Lead Dr Robert Finn at the ELIXIR Marine Metagenomics/Microbiome Community meeting held in Barcelona in September 2023.
Year(s) Of Engagement Activity 2023
URL https://elixir-europe.org/events/elixir-emerging-microbiomemarine-metagenomics-community-f2fhybrid-m...
 
Description Talk titled "MGnify - the platform for the analysis and discovery of microbiome sequence data" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify PI Dr Robert Finn at the Functional and Evolutionary Genomics Event - Big Data and Resources in Microbiome Research event held in Cambridge in September 2023.
Year(s) Of Engagement Activity 2023
 
Description Talk titled "Microbial Ecosystems" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Policymakers/politicians
Results and Impact Talk by MGnify PI and EMBL Senior Scientist at the annual EMBL meeting with the European Commission delegates. The meeting was focused on forging new collaborations with the EC to address important applications in healthcare and AI technology for life sciences. Dr Finn is the co-chair of the microbial ecosystems scientific programme and is a key player in the development of strategic and scientific initiatives that will positively impact the research and applications landscape.
Year(s) Of Engagement Activity 2024
 
Description Talk titled "Microbiome research at EMBL" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Policymakers/politicians
Results and Impact Talk by MGnify PI Dr Robert Finn at the annual meeting of EMBL with European Commission delegates. The talk provided updates on microbiome research being undertaken at EMBL and perspectives of harnessing this for applications that benefit society.
Year(s) Of Engagement Activity 2023
 
Description Talk titled "Mining microbial communities for novel functions and biodiversity" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify PI at the Scientific Advisory Board (SAB) Meeting for the Latvian Biomedical Research and Study Centre (LBMC). Dr Finn is a SAB member and is at the forefront of establishing strategic connections with the LBMC as a wider European initiative. His talk was focused on identifying new approaches to collaborating with LBMC researchers and capacity building.
Year(s) Of Engagement Activity 2024
 
Description Talk titled "Towards producing representative genome catalogues for microbial communities" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify PI Dr Robert Finn at the 23rd Genomic Standards Consortium Meeting held in Thailand in August 2023. This talk was featured in the Session on "Genomic Standards for Comparative Genomics".
Year(s) Of Engagement Activity 2023
URL https://genomicsstandardsconsortium.github.io/GSC23-Bangkok/program.html
 
Description Talk titled "Understanding the functional potential of microbial communities" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify PI and NMDC SAB member Dr Robert Finn at the Joint Genome Institute, USA in October 2023.
Year(s) Of Engagement Activity 2023
 
Description Virtual training course "Genome-resolved metagenomics bioinformatics" 2022 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Annual EMBL-EBI course delivered by the Microbiome Informatics Team which administers the MGnify microbiome resource. Participants learnt about the tools, processes and analysis approaches used in the field of genome-resolved metagenomics.
https://www.ebi.ac.uk/training/materials/genome-resolved-metagenomics-bioinformatics-materials/
Year(s) Of Engagement Activity 2022
URL https://www.ebi.ac.uk/training/events/metagenomics-bioinformatics-2022/#vf-tabs__section--tab1
 
Description Workshop talk titled "MGnify overview" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify PI Dr Robert Finn at a workshop organised on three important EMBL-EBI resources with data flows, namely MGnify, ENA and BioSamples.
Year(s) Of Engagement Activity 2024
 
Description Workshop talk titled "Microbiome view on annotations and systems modelling" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk by MGnify PI and ELIXIR Microbiome Community Lead Dr Robert Finn at the "Systems modelling of the microbiome" workshop organised as part of the ELIXR All Hands Meeting in June 2023.
Year(s) Of Engagement Activity 2023
URL https://elixir-events.eventscase.com/EN/elixirallhands2023/Agenda