Protein structure determination from nuclear magnetic resonance (NMR) spectroscopy using swarm intelligence

Lead Research Organisation: University of Portsmouth
Department Name: Inst of Biomedical and Biomolecular Sc

Abstract

Collectively, social insects are capable of performing complex tasks such as foraging for food, cooperative hunting and nest building that are beyond the capabilities of the isolated individuals. Whilst exploring their environment searching for food, ants deposit pheromones on the ground. Once food is found, a laden ant retraces its steps following the pheromone trail back to the nest. The pheromone signals slowly decay, but those ants which, by chance, find short routes from nest to food and back again will spend a greater proportion of their time along any one section of that route, and thus intensify the pheromone trail. These stronger trails draw in other ants through stigmergy - an innate attraction to the pheromone scent - resulting in a self-optimisation of the shortest routes and thus the development of ideal foraging pathways. Such 'swarm intelligence' - the emergence of collective intelligence amongst groups of simple individuals - has been used in mathematical algorithms to solve complex optimisation problems encountered in communication networks and robotics, but never before has it been applied to structural biology. We want to apply such algorithms to the problem of protein structure determination from nuclear magnetic resonance (NMR) spectroscopy data, a task which typically takes a trained spectroscopist weeks or even months to perform. We have constructed a swarm of biomolecular ants which are capable of exploring their conformational space by molecular dynamics simulations, analogous to a colony of real ants exploring their own physical territory. The ants communicate via a pheromone trail: a global list of ideal interatomic distances that are laid down by the ants following simulation of the NMR data, and that are applied as geometric restraints in the molecular dynamics simulations. If any one of the biomolecular ants encounters a structural element that satisfies the experimental data (such as an alpha-helical turn or a beta-hairpin loop), this knowledge is communicated to the other ants in the swarm via the geometric distance restraints thus encouraging them to adopt the same local conformation. Therefore, over time, the ants cooperate in finding the optimal set of inter-atomic distances and therefore determine their own solution structure. Our initial trials were so successful that we patent-protected the swarm-intelligence NMR (SI-NMR) concept. We now want to develop the technique to the point where it is universally applicable to all proteins, and also to the structure determination of protein-protein and protein-inhibitor complexes. It is hoped that this brand new SI-NMR technology would then become accepted as the state-of-the-art method for protein structure determination in both the academic and industrial NMR communities.

Technical Summary

Nuclear magnetic resonance (NMR) spectroscopy holds the unique position amongst spectroscopic and diffraction methods as the only technique that can produce atomic resolution structures of proteins in solution. In de novo protein structure determinations by NMR, the key geometric restraints are distance limits derived from inter-proton nuclear Overhauser effects (NOEs). The donor and acceptor protons in an NOE are identified from the chemical shifts of the NOE spectroscopy (NOESY) cross-peak. When each chemical shift relates to only a single proton resonance, the donor and acceptor protons can be assigned unambiguously. However, when either of the chemical shifts can be attributed to two or more proton resonances, the NOE is said to be ambiguous. To resolve any remaining ambiguity, we need to know the protein conformation, but to determine the conformation we need to assign all the NOEs. This dilemma is known as the NOE assignment problem. To date, the most successful strategies for resolving this problem have used 'iterative assignment' schemes and, in this respect, NMR protein structure determination has changed very little since the structures were solved in the 1980s. Instead, we will use 'swarm intelligence' to solve structures in a single-shot, non-iterative manner. These algorithms originate from studies of the collective behaviour of social insects. Collectively, these creatures are capable of performing complex tasks such as foraging for food and nest building that are beyond the capabilities of the isolated insects. Such algorithms have previously been used to solve complex combinatorial optimisation problems encountered in communication networks and robotics, but have never before been applied to structural biology. Our initial trials were so successful that we patent-protected the concept. We now wish to further develop the technology to the point where it is the state-of-the-art method for protein structure determination by NMR.

Publications

10 25 50
 
Description Collectively, social insects are capable of performing complex tasks such as foraging for food, cooperative hunting and nest building that are beyond the capabilities of the isolated individuals. Whilst exploring their environment searching for food, ants deposit pheromones on the ground. Once food is found, a laden ant retraces its steps following the pheromone trail back to the nest. The pheromone signals slowly decay, but those ants which, by chance, find short routes from nest to food and back again will spend a greater proportion of their time along any one section of that route, and thus intensify the pheromone trail. These stronger trails draw in other ants through stigmergy - an innate attraction to the pheromone scent - resulting in a self-optimisation of the shortest routes and thus the development of ideal foraging pathways.



Such "swarm intelligence" - the emergence of collective intelligence amongst groups of simple individuals - has been used in mathematical algorithms to solve complex optimisation problems encountered in communication networks and robotics, but never before has it been applied to structural biology. We want to apply such algorithms to the problem of protein structure determination from nuclear magnetic resonance (NMR) spectroscopy data, a task which typically takes a trained spectroscopist weeks or even months to perform.



We have constructed a swarm of biomolecular ants which are capable of exploring their conformational space by molecular dynamics simulations, analogous to a colony of real ants exploring their own physical territory. The ants communicate via a pheromone trail: a global list of ideal interatomic distances that are laid down by the ants following simulation of the NMR data, and that are applied as geometric restraints in the molecular dynamics simulations. If any one of the biomolecular ants encounters a structural element that satisfies the experimental data (such as an alpha-helical turn or a beta-hairpin loop), this knowledge is communicated to the other ants in the swarm via the geometric distance restraints thus encouraging them to adopt the same local conformation. Therefore, over time, the ants cooperate in finding the optimal set of inter-atomic distances and therefore determine their own solution structure.



The most significant achievements from the grant were as follows:-

1. Further development and validation of the swarm intelligence (SI) NMR technology;

2. Design and development of the NMRSwarm program that implements the SI-NMR

methodology in a robust, versatile and user-friendly piece of software;

3. Successful transfer of the knowledge to industry (Pfizer, Sandwich).
Exploitation Route We filed for patent protection of the intellectual property with the European Patent Office through Isis Innovation (the technology transfer company owned by Oxford University). In the written opinion of the International Searching Authority, the patent examiner found the technique to possess novelty, an inventive step and industrial applicability. Subsequently, the approved patent was published on the European Patent Office website (WO2007023289). Patent protection is also being sought in the

US and Japan.



An exclusive short-term license was granted by ISIS Innovation to a major NMR instrument manufacturer for a preliminary investigation of the technology. However, the company declined to further extend the license reasoning that, from a business perspective, it's hard for an instrument manufacturer who sells relatively small volumes of very expensive instruments to charge extra money for a piece of software that enhances (rather than enables) the structure determination process.



Given the above (perfectly rational) objection to long-term licensing from a major NMR manufacturer, the most likely route of commercial exploitation is through providing an NMR structure determination service by a spin-out company. This option is currently being considered by ISIS, although further commercial demonstrations of the technology's applicability would be required before venturing along this path.
The swarm intelligence NMR-based structure determination of three estrogen-receptor targeting stapled peptides (see Phillips et al., 2011), undertaken in association with Pfizer, generated NMR data and atomic coordinates. The NMR chemical shifts have been deposited in the Biological Magnetic Resonance Bank (BMRB) with accession codes 17657-17659. The atomic coordinates have been deposited in the Protein Databank (PDB) with accession codes 2LDA, 2LDC and 2LDD. In addition, the chemical shift assignments for the hemopexin domain of matrix metalloproteinase-1 will shortly be deposited in the BMRB.
Sectors Pharmaceuticals and Medical Biotechnology,Other

 
Description In collaboration with scientists from Pfizer, the swarm intelligence algorithms have been used in the structure determination of chemically-synthesised peptides targeting the estrogen receptor (published in Phillips et al, 2011), and of sodium channel blockers (results currently held by industrial partner).
First Year Of Impact 2010
Sector Pharmaceuticals and Medical Biotechnology
Impact Types Economic

 
Title NMRSwarm : Software for NMR structure determination from NOESY data 
Description "Improvements to research infrastructure" is not a good description, but is the most appropriate of the available options. NMRSwarm is a self-contained program for swarm intelligence-based structure determination from NOESY data. Full details are provided in the supplementary information to our publication Phillips et al. (2011). 
Type Of Material Improvements to research infrastructure 
Year Produced 2012 
Provided To Others? Yes  
Impact No major impact. 
 
Title NMR chemical shifts of stapled peptide SP1 (Ac-HXILHXLLQDS-NH2) 
Description NMR chemical shifts of stapled peptide SP1 (Ac-HXILHXLLQDS-NH2) 
Type Of Material Database/Collection of data 
Year Produced 2011 
Provided To Others? No  
Impact No actual impacts realised to date 
URL http://www.bmrb.wisc.edu/data_library/summary/index.php?bmrbId=17658
 
Title NMR chemical shifts of stapled peptide SP2 (Ac-HKXLHQXLQDS-NH2) 
Description NMR chemical shifts of stapled peptide SP2 (Ac-HKXLHQXLQDS-NH2) 
Type Of Material Database/Collection of data 
Year Produced 2011 
Provided To Others? No  
Impact No actual impacts realised to date 
URL http://www.bmrb.wisc.edu/data_library/summary/index.php?bmrbId=17657
 
Title NMR chemical shifts of stapled peptide SP6 (Ac-EKHKILXRLLXDS-NH2) 
Description NMR chemical shifts of stapled peptide SP6 (Ac-EKHKILXRLLXDS-NH2) 
Type Of Material Database/Collection of data 
Year Produced 2011 
Provided To Others? No  
Impact No actual impacts realised to date 
URL http://www.bmrb.wisc.edu/data_library/summary/index.php?bmrbId=17659
 
Title NMR structure of stapled peptide SP1 (Ac-HXILHXLLQDS-NH2) 
Description RCSB protein databank Accession code: 2LDC URL: http://www.rcsb.org/pdb/explore.do?structureId=2LDC 
Type Of Material Database/Collection of data 
Year Produced 2011 
Provided To Others? No  
Impact No actual impacts realised to date 
URL http://www.rcsb.org/pdb/explore.do?structureId=2LDC
 
Title NMR structure of the estrogen receptor-binding stapled peptide SP2 (Ac-HKXLHQXLQDS-NH2) 
Description RCSB protein databank Accession code: 2LDA URL: http://www.rcsb.org/pdb/explore.do?structureId=2LDA 
Type Of Material Database/Collection of data 
Year Produced 2011 
Provided To Others? No  
Impact No actual impacts realised to date 
URL http://www.rcsb.org/pdb/explore.do?structureId=2LDA
 
Title Solution structure of the estrogen receptor-binding stapled peptide SP6 (Ac-EKHKILXRLLXDS-NH2) 
Description Solution structure of the estrogen receptor-binding stapled peptide SP6 (Ac-EKHKILXRLLXDS-NH2) 
Type Of Material Database/Collection of data 
Year Produced 2011 
Provided To Others? No  
Impact No actual impacts realised to date 
 
Description NMR structural characterisation of stapled peptides targetting the estrogen receptor 
Organisation Pfizer Ltd
Country United Kingdom 
Sector Private 
PI Contribution Synthetic peptides that specifically bind nuclear hormone receptors offer an alternative approach to small molecules for the modulation of receptor signaling and subsequent gene expression. We undertook the structural characterisation, using swarm intelligence NMR, of a series of novel stapled peptides that bind the coactivator peptide site of estrogen receptors.
Start Year 2010
 
Title BIOMOLECULAR STRUCTURE DETERMINATION INVOLVING SWARM INTELLIGENCE 
Description A method and computer program for determining biomolecular structures from experimental data that includes ambiguous experimental observables comprising the steps of : creating a swarm of molecular structure generators; having the ability to allow the communication between said generators via a global set of molecular restraints; determining said molecular structure in a cooperative manner by using self-optimization of a multi-agent system. In one embodiment, NOESY spectra are used to calculate inter-proton distances of the biomolecular structure and said distances are used as molecular restraints. Moreover, the method involves restrained molecular dynamics simulations, e.g. by simulated annealing. 
IP Reference WO2007023289 
Protection Patent application published
Year Protection Granted 2007
Licensed No
Impact See other entries.
 
Title NMRswarm 
Description A graphical user interface for swarm intelligence-based protein structure determination from NMR data. 
Type Of Technology Software 
Year Produced 2011 
Impact No notable impacts to date.