Unlocking the chemical potential of plants: Predicting function from DNA sequence for complex enzyme superfamilies
Lead Research Organisation:
European Bioinformatics Institute
Department Name: Thornton Group
Abstract
Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.
Technical Summary
Our strategy is to integrate powerful data-driven computational approaches with experimental investigation of enzyme function to understand the functions and kingdom-specific expansion of an exemplar complex enzyme superfamily - the triterpene synthases (TTSs). The TTS enzyme superfamily is an ideal test case for our purposes, since these enzymes are able to generate an enormous diversity of cyclized triterpene scaffolds from a single common precursor molecule. Through iterative cycles of computational and experimental investigations we aim to develop sophisticated predictive analytic approaches that will enable us to relate DNA sequence to enzyme function with ever-increasing power and resolution, and in so doing to generate and test hypotheses about enzyme function, mechanisms and evolution. Our aims are to: (1) experimentally determine the chemical diversity encoded by diverse members of the TTS superfamily selected based on our initial CATH-FunFam classification; (2) expand the sequence data for the CATH TTS superfamily and integrate sequence- and structure-based computational approaches to refine our strategies for identifying TTS features implicated in determination of product specificity and for functional classification, and test TTS function predictions; (3) exploit a novel machine learning approach to predict known and novel TTSs; (4) understand TTS function and diversification by determining the product specificities of natural and engineered TTS variants, guided by computational predictions from (1)-(3).
Publications
Bouillé A
(2025)
Lineage-specific patterns in the Moraceae family allow identification of convergent P450 enzymes involved in furanocoumarin biosynthesis
in New Phytologist
Riziotis I
(2024)
Paradigms of convergent evolution in enzymes
in The FEBS Journal
| Description | We have performed MD simulations of the proposed specificity determining positions (SDPs) in binding pocket for committed OSCs to understand how these changes effect product conversion. To perform this, we have run MD simulation of both wild Cycloartenol synthase from Arabidopsis thaliana and mutated the SDPs in the binding site of Cycloartenol synthase to cucurbitadienol producing SDPs (S609C, T531S, Y118L and I365L) to understand conversion of cycloartenol to cucurbitadienol. Similarly, we are performing MD simulation of both wild Cycloartenol synthase from Arabidopsis thaliana and after mutating the SDP's to lanosterol SDP's (I481V and T531S) to understand the conversion of cycloartenol to lanosterol. The simulations were performed with precursor intermediates which will lead to cucurbitadienol and lanosterol. Further we analysed wild and mutant trajectories both in respect of either protecting the intermediates cations from quenching by water and from elimination prior to deprotonation by a well-positioned enzymatic base. The water population was very less within and equal to 3A of termination point of cucurbitadienol forming intermediates in case of wild Cycloartenol synthase. However, after the mutations in SDP's of cucurbitadienol in the binding pocket of Cycloartenol synthase the water accessibility increases. Based on our analysis of quad mutant Cycloartenol synthase, we proposed a mutation in wild Cucrbitadienol synthase i.e. D425L which might affect product formation. Analysis of wild and mutant trajectories of Cucrbitadienol synthase shows that water accessibility was more within and equal to 3A of cucurbitadienol forming intermediates in wild type compared to mutant Cucrbitadienol synthase. These results suggest role of water at termination site after mutation |
| Exploitation Route | It presents a framework for studying the specificity of enzyme function |
| Sectors | Agriculture Food and Drink Chemicals Education Environment Healthcare Manufacturing including Industrial Biotechology Pharmaceuticals and Medical Biotechnology |
| Description | EMBL-EBI Structural Bioinformatics Course |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | Gave a lecture as part of the structural bioinformatics course that is organised and held at EMBL-EBI |
| Year(s) Of Engagement Activity | 2024 |
| Description | FEBS enzyme course |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | Approximately 30-40 PhD students and postdocs attended the course to learn about computational approaches to engineering enzyme catalysis. |
| Year(s) Of Engagement Activity | 2023 |
| Description | Interview for Swiss National Radio |
| Form Of Engagement Activity | A press release, press conference or response to a media enquiry/interview |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Media (as a channel to the public) |
| Results and Impact | I was interviewed for a Swiss Radio Science Programme about Protein modelling. |
| Year(s) Of Engagement Activity | 2022 |
| Description | Presentation to Life Science Industry representatives |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Industry/Business |
| Results and Impact | A presentation to EMBL-EBI's Indutry Programme, which includes representatives from almost 30 (largish) companies in Life Sciences. The title of the talk was 'Computational enzymology: towards better tools for capturing flexibility, function and mechanisms'. |
| Year(s) Of Engagement Activity | 2022 |
| Description | School Visit (Borlase Grammar School) |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Schools |
| Results and Impact | I gave a lecture on 'Women in Stem - the fascinating world of Proteins' to the Lower 6th (year 12) students at Borlase Grammar School. The intention was to encourage the girls to apply to study a STEM related subject at university and to promote Research as a Career to all the pupils (girls and boys). I went to the school. |
| Year(s) Of Engagement Activity | 2023 |
| Description | Series of lectures in institutes in India (IIT Bombay; IISc Bangalore: Tata Institute for Fundamental Research NCBS Bangalore ) |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | Regional |
| Primary Audience | Undergraduate students |
| Results and Impact | Janet Thornton gave a series of lectures in India to female undergraduate students to present this work and inspire them to go into STEM careers. |
| Year(s) Of Engagement Activity | 2024 |
| Description | Stockholm University lecture |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Other audiences |
| Results and Impact | Professor Thornton gave a lecture at the University of Stockholm open to all researchers there including students/postdocs and faculty, which sparked questions and discussion afterwards. |
| Year(s) Of Engagement Activity | 2023 |
| Description | Structural Bioinformatics EMBO Workshop |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | Professor Thornton gave a lecture as part of the Computational Structural Biology EMBO Workshop on "Computational Enzymology: towards using knowledge of structure and function to identify paradigms of evolution and to predict enzyme mechanisms." followed by questions and discussions |
| Year(s) Of Engagement Activity | 2023 |
| URL | https://www.embl.org/about/info/course-and-conference-office/events/csb23-01/#vf-tabs__section-progr... |
| Description | UCL Biological Society outreach activity in Windsor |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Undergraduate students |
| Results and Impact | Professor Thornton gave a lecture as part of an outreach activity organised by the UCL Biological Society for undergraduate students. |
| Year(s) Of Engagement Activity | 2024 |
| Description | Webinar for the NeuroIPS conference. |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Professional Practitioners |
| Results and Impact | I gave a virtual presentation at an ELLIS workshop as part of the NeuroIPS conference (which is outside my usual field of study). The title of the talk was 'Critical assessment of molecular machine learning workshop' aspart of a [ML4Molecules] | ELLIS workshop, VIRTUAL, |
| Year(s) Of Engagement Activity | 2022 |
| Description | lecture at 3D-Bioinfo conference in Prague |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Other audiences |
| Results and Impact | Professor Thornton gave a lecture at the 3D-BioInfo community meeting on structural bioinformatics |
| Year(s) Of Engagement Activity | 2023 |
