Unlocking the chemical potential of plants: Predicting function from DNA sequence for complex enzyme superfamilies

Lead Research Organisation: European Bioinformatics Institute
Department Name: Thornton Group

Abstract

Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.

Technical Summary

Our strategy is to integrate powerful data-driven computational approaches with experimental investigation of enzyme function to understand the functions and kingdom-specific expansion of an exemplar complex enzyme superfamily - the triterpene synthases (TTSs). The TTS enzyme superfamily is an ideal test case for our purposes, since these enzymes are able to generate an enormous diversity of cyclized triterpene scaffolds from a single common precursor molecule. Through iterative cycles of computational and experimental investigations we aim to develop sophisticated predictive analytic approaches that will enable us to relate DNA sequence to enzyme function with ever-increasing power and resolution, and in so doing to generate and test hypotheses about enzyme function, mechanisms and evolution. Our aims are to: (1) experimentally determine the chemical diversity encoded by diverse members of the TTS superfamily selected based on our initial CATH-FunFam classification; (2) expand the sequence data for the CATH TTS superfamily and integrate sequence- and structure-based computational approaches to refine our strategies for identifying TTS features implicated in determination of product specificity and for functional classification, and test TTS function predictions; (3) exploit a novel machine learning approach to predict known and novel TTSs; (4) understand TTS function and diversification by determining the product specificities of natural and engineered TTS variants, guided by computational predictions from (1)-(3).

Publications

10 25 50
 
Description Interview for Swiss National Radio 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Media (as a channel to the public)
Results and Impact I was interviewed for a Swiss Radio Science Programme about Protein modelling.
Year(s) Of Engagement Activity 2022
 
Description Presentation to Life Science Industry representatives 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact A presentation to EMBL-EBI's Indutry Programme, which includes representatives from almost 30 (largish) companies in Life Sciences.
The title of the talk was 'Computational enzymology: towards better tools for capturing flexibility, function and mechanisms'.
Year(s) Of Engagement Activity 2022
 
Description School Visit (Borlase Grammar School) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact I gave a lecture on 'Women in Stem - the fascinating world of Proteins' to the Lower 6th (year 12) students at Borlase Grammar School. The intention was to encourage the girls to apply to study a STEM related subject at university and to promote Research as a Career to all the pupils (girls and boys). I went to the school.
Year(s) Of Engagement Activity 2023
 
Description Webinar for the NeuroIPS conference. 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact I gave a virtual presentation at an ELLIS workshop as part of the NeuroIPS conference (which is outside my usual field of study).
The title of the talk was 'Critical assessment of molecular machine learning workshop' aspart of a [ML4Molecules] | ELLIS workshop, VIRTUAL,
Year(s) Of Engagement Activity 2022