CCP4: Low resolution complexes handling difficult data; empowering structural biologists and supporting UK structural biology

Lead Research Organisation: Science and Technology Facilities Council
Department Name: Computational Science & Engineering

Abstract

Structural Biology is a powerful tool for analysing biological molecules. In essence it locates the atoms that make up proteins and other biomolecules. It is used in basic science to define the 3D structures of such molecules as the ribosome, ion channels and complex protein assemblies such as ATPase. Once a 3D 'model' of all the atoms in the biological model is available, it is possible to make deductions about how the molecule functions within the organism. This knowledge has corresponding applications in controlling that function, often leading to medical advances in drug design, better understanding of biological and feedback systems in the natural environment, new approaches in chemical engineering, and many other benefits. The process of 'solving' the structure involves first obtaining crystals and then exposing these to X-rays: this is equivalent to using a very powerful microscope and allows us to 'see' individual atoms. Getting crystals is in itself very demanding and involves a lot of skill and scientific insight. There is an additional problem in using X-rays in that there is no lens system for the microscope. This means that indirect, so-called diffraction images of the crystal are recorded. The process of extracting data from these images and proceeding to a final structure is computationally intensive and requires an entirely different set of skills from those of the biologist. Developing new computer methods and crucially embedding them in robust easy-to-use software will transform structural biology from a labour intensive highly technical process to a routine tool in biology. The main focus of this grant is on the computational process of solving 3D structures. CCP4 is a project which has collected, developed, packaged and distributed software for the many stages of this process, acting as a focus for methods development for over 25 years and making software available to both Academic and Industrial teams working in this area. It is an excellent example of technology transfer of basic science into a major UK Industry: Pharmaceutical development. The problem cannot yet be said to be totally solved. Biologists firstly are tackling more and more difficult structures, which require novel techniques. In addition, they are solving large number of sometimes related structures, such as a series of mutants and complexes, where automated high throughput is vital and systematic record keeping is essential. This proposal is focussed on enabling these developments.

Technical Summary

1.RA Post 1. Low resolution structures present special problems. Template-matching methods developed by Dr Cowtan provide a proven basis for the interpretation of maps of limited quality, and these methods will be enhanced by the use of larger scale structural information; this approach will be applied to model building and refinement at lower resolutions. 2. RA Post 2. Integrating intensities from diffraction images represents a crucial and fundamental step in the structure determination process, as all subsequent steps are dependent on these measurements and their estimated standard uncertainties. New data processing algorithms will be developed to handle multiple lattices, to automate processing and data analysis and to deal with the new Pilatus detectors. 3. RA Post 3. A database back-end will be developed to systematise and store all the data relevant to structure solution. We will design sensible 'wizards' to improve the consistency and accelerate the rate of structure determination, based on systematic handling of this information. 4. RA Post 4. Finally, for these tools to have the maximum benefit they will be translated into robust well-supported software distributed to the community. Software produced by tasks 1-3, by other CCP4 supported projects, and by independent contributors will be integrated into the suite in a way that is intuitive for a non-specialist. These major tasks will be complemented by suite-wide changes that keep CCP4 relevant to the latest challenges, such as large complexes.

Publications

10 25 50
publication icon
Beaven G (2014) Crystallization and preliminary X-ray characterization of the 2,4'-dihydroxyacetophenone dioxygenase from Alcaligenes sp. 4HAP. in Acta crystallographica. Section F, Structural biology communications

publication icon
Keegan R (2014) Structure of the 2,4'-dihydroxyacetophenone dioxygenase from Alcaligenes sp. 4HAP. in Acta crystallographica. Section D, Biological crystallography

publication icon
Keegan RM (2011) Evaluating the solution from MrBUMP and BALBES. in Acta crystallographica. Section D, Biological crystallography

publication icon
Krissinel E (2011) Macromolecular complexes in crystals and solutions. in Acta crystallographica. Section D, Biological crystallography

publication icon
Krissinel E (2012) Enhanced fold recognition using efficient short fragment clustering. in Journal of molecular biochemistry

publication icon
Lovelace J (2010) Simulation of modulated reflections in Journal of Applied Crystallography

publication icon
Murshudov GN (2011) REFMAC5 for the refinement of macromolecular crystal structures. in Acta crystallographica. Section D, Biological crystallography

publication icon
Potterton L (2018) CCP4i2: the new graphical user interface to the CCP4 program suite. in Acta crystallographica. Section D, Structural biology

publication icon
Read RJ (2011) A new generation of crystallographic validation tools for the protein data bank. in Structure (London, England : 1993)

 
Description The grant has funded the continued development of a software suite for solving macromolecular structures by the technique of X-ray crystallography. These structures are made publicly available via an international repository, and are used in drug development, biotechnology and basic biology. During the period of this grant, new algorithms have been developed and the useability of the software has been improved. The former are necessary in order to keep pace with new experimental techniques, while the latter is part of an on-going process of making the technique more accessible to lab scientists.
We have also continued a programme of training and dissemination. The highlight is the annual Study Weekend, in which the latest computational techniques are presented to an audience of over 400 crystallographers. We have also organised and participated in numerous training schools and conferences.
Exploitation Route The software is distributed worldwide to thousands of scientists, who use it in their individual research projects.
Sectors Agriculture, Food and Drink,Digital/Communication/Information Technologies (including Software),Healthcare,Manufacturing, including Industrial Biotechology,Pharmaceuticals and Medical Biotechnology

URL http://www.ccp4.ac.uk/
 
Description The software is distributed worldwide to thousands of scientists, who use it in their individual research projects. Typically, CCP4 software is used to solve a macromolecular structure which is then deposited in the Protein Data Bank. These structures are subsequently used in various areas of research, including pharmaceutical, biotechnology and agrochemical. The CCP4 suite, and individual programs, are cited many times in this context. The algorithms developed under this grant (those named in the grant underpin the programs Buccaneer, Pirate, Mosflm, Aimless, etc but other programs have also been improved during this grant) have been published, and help inspire further developments. For example, improvements to Mosflm for multiple lattices and new detectors are being incorporated into the next generation Dials software. The next generation GUI, named ccp4i2, is currently in alpha-release. The infrastructure provided has been used by several developers to code pipelines for structure solution. Finally, the grant has also covered training and dissemination via schools and the Study Weekend. These mostly train the next generation of crystallographers in the best way to use state-of-the-art software, but have also helped to inspire the next generation of software developers in this field.
First Year Of Impact 2008
Sector Agriculture, Food and Drink,Digital/Communication/Information Technologies (including Software),Healthcare,Manufacturing, including Industrial Biotechology,Pharmaceuticals and Medical Biotechnology
Impact Types Economic

 
Description CCP4 Advanced integrated approaches to macromolecular structure determination
Amount £275,180 (GBP)
Funding ID BB/S007040/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 04/2019 
End 03/2024
 
Description CCP4 Grant Renewal 2014-2019: Question-driven crystallographic data collection and advanced structure solution
Amount £11,705 (GBP)
Funding ID BB/L008777/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 03/2014 
End 03/2019
 
Description West-Life (Horizon 2020)
Amount € 3,981,000 (EUR)
Funding ID EINFRA-9-2015 
Organisation European Commission 
Sector Public
Country European Union (EU)
Start 10/2015 
End 09/2018
 
Title CCP4 software suite 
Description The software developed in this award, as well as earlier awards and current ones, is licensed for use by macromolecular crystallographers. The software allows the solution of macromolecular structures by X-ray crystallography. 
IP Reference  
Protection Copyrighted (e.g. software)
Year Protection Granted
Licensed Yes
Impact The suite is the largest available in the world for macromolecular crystallography, and underpins a significant fraction of research in structural biology.
 
Title CCP4 next generation GUI ccp4i2 
Description The new CCP4 GUI provides infrastructure for semi-automated structure solution requiring minimal user input and with clear feedback and guidance. Results are presented in graphics-rich reports, which offer links to visualization tools such as Coot, CCP4mg, and HKLVIEW. The GUI allows detailed expert intervention where necessary, and works on Linux, MacOSX and Windows. The current level of implementation corresponds to an alpha release, available to developers and expert users, and with functionality spanning the major pipelines for structure solution. 
Type Of Technology Software 
Year Produced 2013 
Open Source License? Yes  
Impact This software is of interest to the project CCP-EM for electron cryo-microscopy, and selected components are likely to be re-used in that project. This will help create synergies between structural biology techniques by providing common infrastructures, and facilitating data/metadata transfer. 
URL http://ccpforge.cse.rl.ac.uk/gf/project/ccp4i2/
 
Title CCP4 software suite release 6.1.0 
Description CCP4 software suite for structure solution by X-ray macromolecular crystallography. 
Type Of Technology Software 
Year Produced 2008 
Impact No actual Impacts realised to date 
URL http://www.ccp4.ac.uk/
 
Title CCP4 software suite release 6.2.0 
Description Major release of CCP4 software suite. 
Type Of Technology Software 
Year Produced 2011 
Impact No actual Impacts realised to date 
URL http://www.ccp4.ac.uk/
 
Title CCP4 software suite release 6.3.0 
Description Major release of the CCP4 software suite for macromolecular crystallography 
Type Of Technology Software 
Year Produced 2012 
Impact No actual Impacts realised to date 
URL http://www.ccp4.ac.uk/
 
Title CCP4 software suite release 6.4.0 
Description Major release of the CCP4 software suite for macromolecular crystallography. 
Type Of Technology Software 
Year Produced 2013 
Impact The software suite is downloaded many thousand times, and is used in the solution of macromolecular structures which are then deposited in the Protein Data Bank. We don't have accurate statistics on how many structures were solved partially or completely with CCP4 software, but based on citations and browsing of recent PDB entries we expect it to be comfortably > 50%. These structures are then a permanent resource, used by structural scientists, bioinformaticians and biotech scientists. 
URL http://www.ccp4.ac.uk/download/
 
Description CCP4 Study Weekends 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Type Of Presentation workshop facilitator
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The CCP4 Study Weekend is recognised as the best conference for computational methods in macromolecular crystallography (as opposed to those focussed on the scientific results). As such it attracts the leading international developers and an audience of over 400. Each year it provides a snapshot of the state-of-the-art.
Lunchtime bytes provides an opportunity for software developers to demonstrate their programs to the delegates at the meeting. Software from both CCP4 and CCP-EM is demonstrated each year.

The proceedings of each year's conference are published in a special issue of Acta Crystallographica D. Articles in these issues are usually highly cited, as they describe methods used by many crystallographers.
Year(s) Of Engagement Activity Pre-2006,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018,2019,2020
URL http://www.ccp4.ac.uk/ccp4course.php
 
Description CCP4/APS School in Macromolecular Crystallography 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Type Of Presentation workshop facilitator
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact The annual CCP4/APS summer school at Argonne National Lab aims to teach students how to solve macromolecular structures via X-ray crystallography. The course demonstrates CCP4 software (developed under BB/F020805) and Ample software (developed under BB/H013652).

Students bring their own diffraction datasets, and some new structures are solved each year.
Year(s) Of Engagement Activity 2008,2009,2010,2011,2012,2013,2014
URL http://www.ccp4.ac.uk/schools/APS-2014/
 
Description Tutoring at CEITEC Winter School on Structural Cell Biology 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact I gave talks on "De novo protein modelling and its use in interpreting experimental structural data" and "Volume data from cryoEM and crystallography: fitting and building atomic models and matching against other volume data". These sparked questions from the students and discussions with other tutors.

I invited my host at CEITEC to give a talk at the CCP-EM Spring Symposium later that year.
Year(s) Of Engagement Activity 2015