Next Generation Tools For The Identification Of Metabolites In Global Metabolomic Studies (Lead application: BB/N023013/1)

Lead Research Organisation: European Bioinformatics Institute
Department Name: Chemoinformatics and Metabolism

Abstract

Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.

Technical Summary

The chemical identification of metabolites is a crucial step in untargeted metabolomic studies but is currently limited by our understanding of the complex processes operating in electrospray ion sources when coupled with liquid chromatography-mass spectrometry. Metabolite identification remains the rate limiting step in metabolomics studies. Research by the investigators has shown that a large number of different adducts, isotopic, multiply charged and fragment peaks can be observed when analyzing crude biological extracts applying liquid chromatography-electrospray mass spectrometry. Here we propose to further develop, validate and make publicly available an integrated computational workflow (PUTMEDID) for the grouping annotation of each metabolite with a molecular formula and metabolite name(s) with a statistical score of confidence. We will (i) for the first time fully characterize the large diversity of adducts, isotopic, multiply charged and fragment peaks detected when analyzing crude biological extracts applying liquid chromatography-electrospray mass spectrometry; (ii) further develop computational tools to identify all adducts, isotopic, multiply charged and fragment peaks in a sample, instrument and analytical method specificity; (iii) enhance a currently available computational resource (PUTMEDID) to increase the number of true positive annotations. To allow the greatest impact the bioinformatics resource and associated code will be made available to researchers globally through incorporation in the well-maintained and stable computational infrastructure at the European Bioinformatics Institute in the UK. In summary we will DEVELOP A COMPUTATIONAL TOOL for enhanced metabolite annotation, APPLY the tool to construct an OPEN ACCESS RESOURCE for all researchers and DISSEMINATE AND TRAIN the scientific community. This innovative new approach will significantly enhance our capabilities to annotate all metabolites detected in metabolomics studies.

Planned Impact

WHO WILL BENEFIT FROM THE RESEARCH?
There are many national and international groups who will benefit from the publically accessible metabolite annotation bioinformatics resource to be constructed. These include
(i) Academic researchers performing non-targeted metabolomics using LC-MS. The resource developed will benefit research in to microbes, plants and animals in areas including synthetic biology, crop production and human ageing.
(ii) Industry scientists performing non-targeted metabolomics research with LC-MS. The resource developed will provide greater understanding of the metabolism underlying the production of pharmaceuticals and chemicals and in improved crop production.
(iii) Government agencies in the UK performing non-targeted metabolomics research with LC-MS platforms. For example, the Department for Environment, Food and Rural Affairs in the UK who through the FERA facility apply non-targeted metabolomics for food safety and food authenticity testing and crop protection.
(iv) Commercial instrument suppliers, specifically those supplying mass spectrometers as the resource will be applicable to a range of different mass spectrometers from different commercial instrument suppliers.
(vi) A post-doctoral research associate employed during the research through training in different scientific disciplines and through personal development.

HOW WILL THEY BENEFIT FROM THIS RESEARCH?
There will be a number of direct or indirect benefits observed by academic and industrial research groups, commercial industrial companies, and the research staff employed for the proposed research. The first direct benefit will be the ability of academic and industrial research groups to perform higher quality biological research through increased abilities to annotate a larger number of metabolites in non-targeted metabolomics studies and therefore provide higher quality data for biological interpretation at the systems level. This will readily be achieved via the metabolite annotation bioinformatics resource. The ability of researchers to apply systems-level approaches to understand the interactions of metabolites with other metabolites and biochemicals and to be able to integrate large biochemical databases from holistic data acquisition at different functional levels is a growing requirement in biological research. The second benefit will be to provide a commercial impact through higher quality biological research performed in industry and which increases the efficiency of production of chemicals, pharmaceuticals and crops. The third benefit will be to provide a further commercial impact, specifically to mass spectrometer suppliers through their improved ability to annotate metabolites in biological studies and the impact this will have on mass spectrometer sales versus, for example, the relative decline in the use of NMR spectroscopy over the last few years. The public will indirectly benefit from the developed resource through higher quality biological research and the impact on improvements in crop and drug production and in our understanding of healthy ageing. Finally, the post-doctoral researcher and investigators performing the proposed research will benefit from training in new concepts in a multi-disciplinary environment.

Mechanisms for dissemination to scientists and to the general public are described in our Pathways to Impact document.

Publications

10 25 50
 
Description A new tool called BEAM is now available at https://more.bham.ac.uk/beams/. The package includes several automated and seamless computational modules that are applied to putatively annotate metabolites detected in untargeted ultra (high) performance liquid chromatography-mass spectrometry or untargeted direct infusion mass spectrometry metabolomic assays in a single and automated process.
Exploitation Route It will speed up and inform analysis of metabolites which is a major bottleneck in our field. This tool has been promoted at meetings and conferences and it is also planned to be used with other tools in workflows to be made available through a galaxy instance at EMBL-EBI
Sectors Agriculture, Food and Drink,Chemicals,Healthcare,Manufacturing, including Industrial Biotechology,Pharmaceuticals and Medical Biotechnology

URL https://more.bham.ac.uk/beams
 
Title Preparation of data for testing the PutMedID 
Description We have been preparing test set data for the tools to be used for validation and testing. 
Type Of Material Database/Collection of data 
Provided To Others? No  
Impact This will ensure that the tools are effective. 
 
Title Galaxy tools for Python package BEAMS 
Description Galaxy wrapper for BEAMS (Birmingham mEtabolite Annotation applying Mass Spectrometry). BEAMS is a Python Package to annotate LC-MS and DIMS data 
Type Of Technology Software 
Year Produced 2018 
Open Source License? Yes  
Impact Graphical tool that will be generally available for any Galaxy based environments. Galaxy is an open, web-based platform for data intensive biomedical research. Whether on the free public server or your own instance, you can perform, reproduce, and share complete analyses. 
URL https://github.com/computational-metabolomics/beams-galaxy