RNA synthetic biology

Lead Research Organisation: University of Edinburgh
Department Name: UNLISTED

Abstract

In this project, we will develop efficient methods to predict the effects of mutations in our DNA.
This is important, due to the ongoing national investment in research to read out the mutations people carry in their DNA. We will only be able to use that information if we know what individual mutations are doing and how they may affect health and disease.
Our methods rely on cell culture models. In such models, many thousands of cells are grown together in a flask, and we engineer every cell to contain a different mutation in a selected gene. We then ask which mutations cause the cells to die, and which parts of the cell are affected by each mutation.
We can then use this information to predict which mutations cause disease. We have teamed up with biomedical researchers in Edinburgh to study mutations that cause diabetes, eye malformations, and metabolic disorders.

Technical Summary

As large-scale sequencing projects uncover new variation in the human genome, understanding the consequences of this variation becomes increasingly important. In the next five years, we will use synthetic biology, next-generation sequencing, and computational modelling to study the relationships between gene sequence, expression, structure, and function. We will focus on studies of noncoding RNAs and synonymous mutations in protein-coding genes. We believe that a detailed understanding of genotype-phenotype relations for selected model transcripts will uncover principles applicable to many RNA and protein molecules. Specifically, we aim:
1) To understand the molecular mechanisms underlying an RNA fitness landscape.
2) To understand the effects of codon usage on various stages of gene expression.
3) To develop new applications of genotype-phenotype mapping.
In aim 1, we will use yeast U3 snoRNA as a model system to study genotype-phenotype relations. We previously constructed a library of 60,000 mutants of U3 to study the effects of mutations on fitness in wild-type yeast. To study the role of gene-gene interactions, we will express the library in a collection of strains depleted of U3-interacting proteins, and to study gene-environment interactions, we will express the mutants in a range of environmental conditions. We will also develop high-throughput assays to measure the effects of mutations on U3 RNA abundance and RNA-protein interactions.
In aim 2, we will use human cell culture models to measure the effects of synonymous mutations on transcription, RNA stability, RNA export and translation. We will then use machine learning to uncover the sequence determinants of expression, and to understand which stages of expression are most strongly influenced by mutations. In collaboration with partners in industry (ThermoFisher) and academia (Laurence Hurst, University of Bath), we will then use our data to develop new strategies for codon optimization.
In aim 3, we will collaborate with University of Edinburgh researchers to apply our genotype-phenotype mapping methods to understand human disease mutations. We will focus on mutations in genes relevant to the local research community, encoding a range of transcription factors, hormone receptors, and metabolic enzymes.

People

ORCID iD

Publications

10 25 50

publication icon
Gabryelska MM (2022) Global mapping of RNA homodimers in living cells. in Genome research

publication icon
Kudla G (2022) Lighting up protein design. in eLife

publication icon
Kudla G (2020) RNA Conformation Capture by Proximity Ligation. in Annual review of genomics and human genetics

publication icon
Mittal P (2018) Codon usage influences fitness through RNA toxicity in Proceedings of the National Academy of Sciences

Related Projects

Project Reference Relationship Related To Start End Award Value
MC_UU_00007/1 01/04/2018 31/03/2023 £662,000
MC_UU_00007/2 Transfer MC_UU_00007/1 01/04/2018 31/03/2023 £3,730,000
MC_UU_00007/3 Transfer MC_UU_00007/2 01/04/2018 31/05/2022 £3,053,000
MC_UU_00007/4 Transfer MC_UU_00007/3 01/04/2018 31/03/2023 £1,772,000
MC_UU_00007/5 Transfer MC_UU_00007/4 01/04/2018 31/03/2023 £4,524,000
MC_UU_00007/6 Transfer MC_UU_00007/5 01/04/2018 31/03/2023 £2,878,000
MC_UU_00007/7 Transfer MC_UU_00007/6 01/04/2018 31/03/2023 £2,829,000
MC_UU_00007/8 Transfer MC_UU_00007/7 01/04/2018 31/12/2022 £4,072,000
MC_UU_00007/9 Transfer MC_UU_00007/8 01/04/2018 31/03/2023 £3,137,000
MC_UU_00007/10 Transfer MC_UU_00007/9 01/04/2018 31/03/2023 £6,948,000
MC_UU_00007/11 Transfer MC_UU_00007/10 01/04/2018 31/03/2023 £2,421,000
MC_UU_00007/12 Transfer MC_UU_00007/11 01/04/2018 31/03/2023 £1,205,000
MC_UU_00007/13 Transfer MC_UU_00007/12 01/04/2018 31/03/2023 £1,174,000
MC_UU_00007/14 Transfer MC_UU_00007/13 01/04/2018 31/03/2023 £1,838,000
MC_UU_00007/15 Transfer MC_UU_00007/14 01/04/2018 31/03/2023 £2,551,000
MC_UU_00007/16 Transfer MC_UU_00007/15 01/04/2018 31/03/2023 £1,496,000
MC_UU_00007/17 Transfer MC_UU_00007/16 01/04/2018 31/03/2023 £1,886,000
 
Description Membership of Foundation for Polish Science grant panel
Geographic Reach Europe 
Policy Influence Type Participation in a guidance/advisory committee
 
Description Context-Dependent Gene Optimization 
Organisation Thermo Fisher Scientific
Country United States 
Sector Private 
PI Contribution Coding sequence optimization has traditionally relied on the adjustment of the codon composition of genes to the preferred codon usage of the host species. This strategy is based on the assumption that preferred codons correspond to abundant tRNAs, and that the use of such codons should result in increased translation elongation rates and in higher protein yields. Modern tools, such as GeneOptimizer, employ a strategy known as multi-parameter coding sequence optimization, which, in addition to codon composition, adjusts a range of parameters, such as GC%, homopolymer content, internal repeats, and the presence of certain DNA sequence motifs. It has been shown that the application of multiple parameters in optimization can result in increased protein yields (Kudla 2006, Kudla 2009, Fath 2011, Cambray 2018). However, there is growing evidence that a single codon optimization strategy is not sufficient to achieve efficient expression in all relevant applications in a given host species, because the effects of codon usage are context-dependent. For example, as part of our "Next Generation Gene Optimization" project, we have shown that the effects of codon usage on gene expression are different in intron-containing and intronless genes, and that the same codons have different effects on expression when located at 5' ends or 3' ends of genes (Mordstein 2020). Others have shown that the effects of codon usage on gene expression depend on the host tissue (Gingold 2014), on the promoter from which the RNA is expressed (Zhou 2016), on whether host cells have been infected by a virus and are undergoing an antiviral response (Watson 2019), and on the presence or absence of specific RNA-binding proteins in the host cell (Radhakrishnan 2016, Takata 2017, Zuckerman 2020). Preliminary experiments performed in the Kudla lab also show that gene optimization parameters depend on the subcellular compartment in which the heterologous gene is expressed (nucleus vs cytoplasm, relevant for expression of viral genes and for viral-based expression systems), and whether the genetic material is supplied to the cell as DNA or mRNA. The following aims will be pursued in this project: 1. Investigate the compatibility of codon optimization procedures with systems used for heterologous expression of genes in human cells. 2. Investigate the interdependence between innate immune response pathways in human cells, and codon-dependent gene regulation. 3. Integrate the experimental results into the GeneOptimizer tool to allow context-dependent gene optimization.
Collaborator Contribution The industrial partner (ThermoFisher) is funding this project with $150,000, which covers the costs of hiring a PhD student for 3 years, as well as consumable and travel costs.
Impact Not available.
Start Year 2022
 
Title COMRADES - A program for folding RNA using constraints from proximity ligation experiments 
Description RNA proximity ligation methods such as CLASH, HiC, PARIS, SPLASH, and COMRADES generate chimeric reads that represent RNA-RNA interactions. Comrades inputs a set of mapped chimeric reads (produced by hyb, https://github.com/gkudla/hyb), and outputs a ranked list of predicted basepairs. It then combines these basepairs into short structural elements, ranks the elements by number of supporting chimeric reads, and uses the ranked elements as constraints for RNA folding. The output is an RNA structure or set of structures in UNAFold .ct format or vienna dot-bracket format. 
Type Of Technology Software 
Year Produced 2018 
Open Source License? Yes  
Impact Publication: Ziv O, et al., COMRADES determines in vivo RNA structures and interactions (2018). Nat Methods 15(10): 785-788. 
 
Description EMBO Workshop on Codon Usage 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Grzegorz Kudla is the main organizer of the 1st EMBO workshop on codon usage. The workshop will take place on 8-11 April 2022 and will bring together academic and industry leaders in this area. It will cover recent research in the evolution, function and mechanism of codon usage, as well as practical developments in gene synthesis and codon optimization.
Year(s) Of Engagement Activity 2022
URL https://meetings.embo.org/event/21-codon