Mechanisms of alternative translation initiation codon selection in the regulation of eukaryotic gene expression

Lead Research Organisation: University of Southampton
Department Name: Centre for Biological Sciences

Abstract

The DNA sequence in every cell of the body, termed the genome, stores the instructions to make all the proteins that are the essential building blocks needed for living. The code for each protein is stored within shorter stretches of DNA called genes. Converting the DNA sequence of a gene (which has only four 'letters') to that of its corresponding protein (which is made up of twenty different kinds of amino acids) requires two major processes, transcription and translation. As their names suggest, transcription is the copying of the sequence information in the DNA into a molecule called messenger RNA (mRNA) whereas translation involves the decoding of the mRNA sequence into the amino acid sequence. The complex molecular machine that translates the mRNA sequence is called the ribosome, and the positioning of the ribosome on the mRNA sequence requires several other proteins, termed translation initiation factors. The ribosome starts making a protein when it finds a particular sequence in the mRNA called a translation initiation codon. Originally it was thought that each gene only made one protein but it is now well established that multiple proteins can be produced from a single gene. This makes a lot of sense when we consider that the genomes of several animal species have now been sequenced and the human genome only contains between 25,000 and 30,000 genes, yet humans are a much more complex animal than a fruitfly (which has 13,600 genes) or a nematode worm (19,000 genes). Different proteins can be made from the same gene if the transcription process begins at different places within the gene or if the mRNA is edited differently in a process called alternative splicing. Another way in which different proteins can be produced from the same gene is by the ribosome beginning to translate the protein at different positions on the mRNA, meaning that longer or shorter versions of the protein are made. I am interested in how and why different translation initiation codons are used and just how widespread this phenomenon is. So far, there are only a few examples where this has been discovered but those that are known are very important, making proteins which can have completely different functions or go to different places within the cell. Importantly, most of the examples of proteins that use alternative initiation codons are implicated in cancers, suggesting that making a certain form of the protein might be better for the cell than when an alternative start site is selected. The work I propose to carry out will determine what factors influence the ribosome to start translating a protein at one position versus another. This could be due to certain sequences being recognised in the mRNA, or it could be due to other translation initiation factors being involved in the selection process. I will be using a few examples of proteins that are known to be made from different initiation codons for this work, but the discoveries I make will be useful in improving our understanding of how ALL proteins can be made, both under normal cellular conditions and also when this process goes wrong. We are going to carry out this work by changing the mRNA sequences surrounding the translation initiation codon to see if we can make their selection better or worse. We will also manipulate the levels of the initiation factors in different cell lines that we grow in the laboratory to see if having more or less of these factors also has an effect on initiation codon choice. Another part of the project will monitor which particular versions of proteins are being made in patient samples, and we may therefore be able to use our knowledge of what role the versions play in the cell to begin to improve the diagnosis of diseases.

Technical Summary

The regulation of eukaryotic translation initiation is a key point in gene expression, and can rapidly alter the temporal and spatial expression of either the whole transcriptome or specific mRNAs, without requiring new transcription. Central to translation initiation is the selection of the translation initiation codon (TIC), usually an AUG. However, other AUG and non-AUG codons may be used for translation and I intend to investigate the usage of alternative initiation codons and how these result in the synthesis of N-terminally extended or truncated proteins. My preliminary work has demonstrated that the open reading frame of the multi-domain translation initiation factor eIF4GII can be extended by over 140 amino acids through the use of a non-canonical CUG initiation codon, in addition to the originally assigned AUG. This project will expand this work to determine the mechanisms of alternative TIC selection using the examples of eIF4GII, eIF4GI (translated as 5 isoforms from alternative AUGs), and BAG-1 (with 3 isoforms, the longest from a CUG). Using a simple assay I have developed, I will demonstrate the usage of alternative TICs on these mRNAs and confirm translation occurs from these codons and other non-canonical triplets by mutagenesis. Regions of primary sequence or secondary structure that influence initiation codon choice will also be examined. Quantitative RT-PCR and immunoblotting will relate the different protein species arising from alternative TICs to the endogenous mRNA. Further work will examine the role of trans-acting factors in the selection of initiation codons using a combination of overexpression and siRNA. Signalling events that impinge upon translational machinery will also be examined to determine whether particular kinases control TIC choice. This work will begin to examine the importance of alternative TIC selection in the generation of protein isoform diversity, a previously neglected aspect of gene expression.

Planned Impact

This proposal is the beginning of several long term projects I am undertaking to examine the expansion of the proteome by use of alternative translation initiation codons, which will have major implications in the areas of biotechnology (in particular proteomics studies) and human health. This work will make us reconsider mechanisms of gene expression and I believe that alternative initiation codons may represent as important a method for generating diversity in the proteome as alternative splicing. This work will benefit those working in the proteomics community as the expansion of the proteome due to use of alternative translation initiation codons is not often considered in such studies. For example, peptides arising from mass spectrometric analyses of a protein with multiple translation initiation codons (e.g. eIF4GI) may not be assigned correctly as they will be truncated compared to the expected peptide size. Information regarding the use of alternate initiation codons, including estimates of their usage in different tissues (see below) will therefore be useful in performing such analyses, so will be made available in public gene expression databases. I am currently engaged in work to identify further candidates which are likely to be expressed as different isoforms from alternative translation initiation codons. This work will therefore be of interest to workers in many fields, not just gene expression. As this work expands, the physiological roles of such different isoforms will be explored and work by others has recently shown that alternate isoforms of the two pore domain potassium (K2P) channels TREK1 and TREK2 that arise from different translation initiation codons show altered biophysical properties and ion selectivity. A long-term objective of this research is to determine whether the use of alternative initiation codons can be used as a biomarker to determine the status of gene expression in diseased tissues. The groundwork for this approach has already been proven for BAG-1, one of the examples that will be studied during this project. Increased nuclear staining of BAG-1 in breast cancer (correlating with increased usage of the most 5' non-canonical CUG initiation codon) is associated with improved long-term prognosis. Being able to assay a patient sample for different isoforms of a protein that arise from alternative initiation codons where the isoforms have defined properties may enable diagnoses to be made. One simple way in which this could be performed would be a simple ELISA test, if suitable antibodies to different isoforms could be developed. There is already a large and successful infrastructure in place at the University of Southampton to exploit novel biological research in the clinical environment. This, coupled with the extensive networking opportunities afforded to me in the newly built Institute for Life Sciences and as an active member of the Southampton Nucleic Acids Partnership and the South Coast RNA Network as well as the wider Translation Control community will ensure that I am able to disseminate this work to the largest audience possible and create strong collaborations in the future. This work is also amenable to therapeutic interventions and prospective collaborations with industrial partners will be sought at the earliest stages of this project, in order to create the biggest potential impact in improving quality of life and generating economic success.
 
Description We have developed assays and bioinformatic pipelines to identify novel instances of translation initiation at positions within messenger RNAs that have not been previously annotated within genome databases (Coldwell, M.J., Sack, U., Cowan, J.L., Barrett, R.M., Vlasak, M., Sivakumaran, K. and Morley, S.J. (2012) Multiple isoforms of the translation initiation factor eIF4GII are generated via use of alternative promoters, splice sites and a non-canonical initiation codon. Biochem J, 448, 1-11.) A further paper exploring this phenomenon in further novel candidates is currently being revised prior to submission.

We have also developed tools to analyse the factors and sequences that regulate the choice of these alternative initiation codons. Part of this has been to help develop our understanding of how the components of the machinery interact in novel ways (Davey, N.E., Cowan, J.L., Shields, D.C., Gibson, T.J., Coldwell, M.J. and Edwards, R.J. (2012) SLiMPrints: conservation-based discovery of functional motif fingerprints in intrinsically disordered protein regions. Nucleic Acids Res, 40, 10628-10641.) or in different cellular conditions (Coldwell, M.J., Cowan, J.L., Vlasak, M., Mead, A., Willett, M., Perry, L.S. and Morley, S.J. (2013) Phosphorylation of eIF4GII and 4E-BP1 in response to nocodazole treatment: A reappraisal of translation initiation during mitosis. Cell Cycle, 12, 3615-3628.)
Exploitation Route The bioinformatic pipeline can be used by others to identify instances of alternative translation initiation within their own genes of interest, in species further to the human and murine genes that were the subject of our own study.

Knowledge of the true nature of the proteome (i.e. how extensions/truncations to proteins that arise through alternative initiation) may help inform the pharmaceutical industry develop novel drugs to target specific isoforms.
Sectors Education,Healthcare,Manufacturing, including Industrial Biotechology,Pharmaceuticals and Medical Biotechnology

 
Description No external impacts to date, other than publications
 
Description Outreach event 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Public/other audiences
Results and Impact We have been working with a colleague creating textile pieces to represent different aspects of cellular processes for an upcoming Science and Engineering open day. We have been working with members of the public on this, learning e.g. felting techniques from them, as we chat to them about the science of what we are creating. The actual open day is soon, and we will be chatting to participants as they contribute to a larger piece about e.g. gene expression and how ribosomes decode a mRNA, or how mitochondria produce ATP.
Year(s) Of Engagement Activity 2017
URL https://unit11studios.wordpress.com/currentartists/halina-bainbridge/