Structure and Folding of Proteins with Identical Tandemly Arrayed Domains

Lead Research Organisation: University of Cambridge
Department Name: Chemistry

Abstract

Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.

Technical Summary

Very recent work from the Clarke lab used single-molecule techniques to detect misfolding events in a synthetic construct containing adjacent domains of the same fold and identical sequence. These misfolding events disappeared when the sequence identity between domains was reduced to ~40%. Many multi-domain proteins contain strings of domains with the same fold and these misfolding events are thus the likely explanation for the apparent evolutionary pressure to maintain sequence diversity in adjacent domains in almost all such proteins.
Very recent work in the Potts lab has solved the structure of a native protein with virtually identical tandemly arrayed domains. In SasG, a Staphylococcus aureus surface protein involved in bacterial biofilm formation, domain identity is driven at the DNA level and appears to be a mechanism facilitating antigenic variation. The protein domains lack the normally expected compact hydrophobic core and we propose this novel structure has evolved to avoid misfolding events when the presence of identical domains is otherwise advantageous. The overall aim of the research is to use SasG to enhance our understanding of how proteins fold and, importantly, how they avoid misfolding.

The specific objectives of the proposed research are to:
1. Identify the sources of thermodynamic stability and to determine the folding mechanism of the single layer beta-sheet SasG domains.
2. Test a number of different SasG constructs for rare misfolding events using a recently-developed single molecule FRET technique. Are specific residues important for resisting misfolding?
3. Understand the role of SasG domains and specific residues in resisting mechanical unfolding.
4. Determine the structure of a different bacterial protein with identical tandemly arrayed domains (to establish if it also has an unusual fold).
5. Lay the groundwork for potential applications of designed SasG constructs of specific length in nanotechnology.

Planned Impact

This research proposal seeks to investigate a novel multidomain protein whose structure challenges many of our ideas about how proteins fold and how they avoid misfolding. Our hypothesis is that it is the extraordinary non-globular structure that proteins the proteins against misfolding. SasG, the protein of interest, is one of a series of extra-cellular bacterial proteins that contain nearly identical duplicated repeats. We are also going to investigate the structure of a second such protein where no structural information is available at all.

One of the groups of beneficiaries is the members of the community of protein folding researchers, in particular to those who use computational methods. Our studies of these proteins will allow us to reassess out understanding of how proteins fold. SasG is predicted to be unfolded by most algorithms which predict disorder. Yet it is clearly not. In the "omics" era computational methods have become particularly important in the analysis and of and prediction from the wealth of genomic and structural data. In the fields of bioinformatics and structure prediction, force fields and algorithms are based on experimental data, so studies of novel folds such as that of SasG are essential for furthering our understanding.

The two post docs to be employed on this project are also beneficiaries. The strong collaboration between our two groups, which a different yet complementary set of skills, provides an outstanding opportunity for two young scientists to be trained in world class laboratories. They will then be in an excellent position to continue their career, whether in academia, or very likely, in industry, where there is a real national demand for highly skilled scientists with the skill set we are offering. Of the many students and post-docs to have passed through the Clarke laboratory, for instance, a significant proportion are now employed in the biotechnology industry, some in small start-up companies, and others in companies which have now been taken into the mainstream pharmaceutical industry sector (such as Medimmune).

Although not explicitly a project which aims to have a technological outcome, this project may have some future applications, resulting in some economic or well-being benefits. The proteins we are investigating are involved in biofilm formation. Biofilms are of significant economic importance both in terms of medicine (where they are detrimental as they are involved in bacterial infection) and in the industrial context (where they may be either detrimental or helpful). We are not directly investigating biofilm formation, yet a fundamental understanding of the biophysical and mechanical properties of the proteins which mediate biofilm formation may be valuable to underpin future research in this area. The use of biological molecules in nanotechnology is in its infancy. We will investigate whether SasG, a highly soluble, rigid molecule may be suitable to provide molecules in the nanometre length scale that are tuneable both in terms of size and in terms of sites which can be functionalised.

Publications

10 25 50

publication icon
Gruszka DT (2016) Disorder drives cooperative folding in a multidomain protein. in Proceedings of the National Academy of Sciences of the United States of America

 
Description We have used light and small angle X-ray scattering (SAXS), single-molecule fluorescence microscopy and mechanical unfolding methods, combined with simulations, to show that, despite being monomeric and lacking covalent crosslinks, SasG maintains a highly extended conformation in solution and is mechanically strong.
We have used protein-folding studies to show that the extension is mediated through obligate folding cooperativity of the intrinsically disordered E domain that couples non-adjacent G5 domains thermodynamically, forming interfaces that are more stable than the domains themselves. Thus, counter-intuitively, the elongation of the protein appears to be dependent on the inherent instability of its domains.

We have used Atomic force spectroscopy to demonstrate that SasG is remarkably mechanically resistant. Molecular dynamics simulations have revealed that the strength of SasG arises solely from tandemly arrayed 'clamp' motifs within the folded G5 and E domains, formed from long, directly hydrogen-bonded, ß-strands.

Our findings reveal an elegant minimal solution for the efficient assembly of monomeric mechanoresistant tethers of variable length.

We have also investigated the folding mechanism of these domains, and are currently exploring the role of the conserved residues, in particular the conserved glycines that give the G5 domains their name.

We have used kinetic and thermodynamic methods to investigate the folding pathway of SasG. We find that it folds via a highly unusual, polarised transition state. In 2-domain E-G5 constructs we find that the entire protein folds cooperatively, again via formation of a discrete nucleus at the extreme C-terminal end of the molecule. The interface between the domains is responsible for this cooperative behaviour. Quite remarkably we discovered that SasG can fold by an alternative, parallel pathway: if the "normal" nucleus is destabilised in the 2-domain protein the folding of the interface between the two domains drives the folding, but critically, again in a cooperative manner. We propose that it is the intrinsic instability of the E domain that ensures SasG is a cooperative protein.
Exploitation Route Our understanding of the means by which a single protein can produce a long strong molecule might have utility of nanotechnology. Our understanding of the role disorder can play in structure folding and function.
Sectors Education,Pharmaceuticals and Medical Biotechnology

 
Description We have presented these findings at a number of conferences, in poster and lecture form We are currently finalizing a first manuscript, a second is in preparations and we anticipate at least one more publication shortly. Final impact: A visitor worked on this project for 6 months. a a result she has been accepted to do a M Phil in Chemistry A training and development impact. 2016 update: A second manuscript is in preparation. On the basis of her work on this project Carol Mendonca was awarded a MPhil and has been accepted to do a PhD in Cambridge University. Dr Dominika Gruscka gained a post doc position in the Crick Institute.
Sector Education,Other
Impact Types Economic

 
Description Investigation of mechanical strength of SasG by AFM 
Organisation University of Leeds
Country United Kingdom 
Sector Academic/University 
PI Contribution We have collected and analysed atomic force microscopy (AFM) data demonstrating that SasG tandem domains are resistant to mechanical stress.
Collaborator Contribution Expertise in AFM
Impact none as yet
Start Year 2013
 
Description Investigation of the molecular mechanism of forced unfolding of SasG 
Organisation University of Leeds
Country United Kingdom 
Sector Academic/University 
PI Contribution We have performed molecular dynamics simulations in collaboration with the University of Leeds to characterise the molecular mechanism of forced unfolding of SasG domains.
Collaborator Contribution Dr Emanuele Paci trained DG in simulation methods, and performed some of the analysis
Impact none yet
Start Year 2013