The Protein Circular Dichroism Data Bank, the DichroWeb Server, and ValiDichro: Data Sharing, Analysis and Standards Resources for CD Spectroscopy

Lead Research Organisation: Birkbeck, University of London
Department Name: Biological Sciences

Abstract

Circular Dichroism (CD) spectroscopy is a widely used method in structural biology for determining protein secondary structure, detecting conformational changes associated with different conditions such as ligand binding, and examining macromolecular interactions and protein folding. It is regularly used as a fundamental characterisation method in a large number of both academic and industrial laboratories and is a method designated by regulatory authorities for the characterisation of pharmaceutical proteins produced for use in humans. The worldwide development of Synchrotron Radiation Circular Dichroism (SRCD) beamlines has further extended the utility and applications of this spectroscopic technique.
The proposal is for a renewal of the Bioinformatics and Biological Resources Fund project "Support for the Protein Circular Dichroism Data Bank and the DichroWeb Analysis Server", which thus far has enabled the creation of the Protein Circular Dichroism Data Bank (PCDDB) for data sharing, the operation of the analysis webserver DichroWeb, and the advancement of data quality standards and validation protocols.
This project's aims are to enhance and maintain these comprehensive electronic archiving and analysis resources for CD spectroscopy that are used by the academic and industrial structural biology and bioinformatics communities and to develop new analysis tools for structural biology. The project includes the progressive development, curation, and operation of the PCDDB, a searchable and downloadable deposition data bank enabling free public access to CD data. This data bank was opened for operation earlier this year and is unique world-wide. It provides public archiving and search facilities for open access to circular dichroism spectral and metadata, much in the manner of the Protein Data Bank (PDB), a long-existing and valuable reference data bank resource for protein crystal and NMR data. This proposal would support continuing development, enhancements, and modifications to the PCDDB based on our experiences gained in operating it and user feedback, as well as enabling it to be run as an ongoing user resource. The project also includes the upgrading and continued operation of DichroWeb, a user-friendly widely-used online server for the analysis of circular dichroism data. In addition, it includes the development of a new suite of support tools for novel types of analyses (including cross-methodological ones), creation of common data formats for all commercial CD instruments and SRCD beamlines, validation software (to guide quality standards of data acquisition and processing, and ensure the integrity of PCDDB entries), and a "one-stop-shop" server providing a pipeline for processing, analysis, display, and deposition tools (thereby simplifying and improving the use of CD spectroscopy by the non-expert community, and the speeding up of these processes for regular CD users). These projects have strong support from the UK and international user communities as indicated by the letters of support included with the proposal.
Together these bioinformatics resources provide a comprehensive package of enabling and supportive tools in CD spectroscopy for the academic and industrial structural biology and bioinformatics communities.

Technical Summary

Circular Dichroism (CD) spectroscopy is used in structural biology for determining protein secondary structure, detecting conformational changes, and examining macromolecular interactions and protein folding, and is a method designated by regulatory agencies for the characterisation of pharmaceutical proteins for human use.
This proposal is to enhance, curate and operate a comprehensive set of electronic archiving and analysis resources for CD spectroscopy.
It includes the Protein Circular Dichroism Data Bank (PCDDB), a deposition, searchable and downloadable data bank of CD spectra, which began operation earlier this year. The aim of the data bank is to provide public archiving facilities and open access to validated circular dichroism spectral and metadata. The project would support continuing development, enhanced functionalities and modifications based on our experience gained in operating it and user feedback, as well as enabling it to be run as an ongoing user resource. The project also includes enhancements to, and operation of, DichroWeb, a widely-used online server for the analysis of circular dichroism data.
This proposal includes the development of a suite of support tools for novel types of CD analyses based on data deposited in the data bank, including spectral nearest neighbour identification, spectral matching (with applications in bioprocessing), and back calculations of spectra from crystallographic coordinates. It will also include enhanced validation software (establishing standards for spectroscopic data and ensuring the data quality in the PCDDB). A further development will be a one-stop-shop CDpipeline server that will incorporate processing, display, analyses, validation and deposition (aiding casual users of the method as well as improving throughput for more advanced users).
Together these bioinformatics resources will provide enabling and supportive tools in CD spectroscopy for the structural biology and bioinformatics communities.

Planned Impact

Circular dichroism (CD) spectroscopy is a widely used technique in structural biology. The resources proposed would benefit those who utilise CD in both academia and the commercial sector. Because CD is a spectroscopic technique currently meeting ICH Guidelines for characterisation of pharmaceuticals for human use, there has already been significant interest in the tools described in this proposal expressed by the pharmaceutical industry, SMEs and regulatory agencies such as the European Medicines Agency and the US Food and Drug Administration. The Protein Circular Dichroism Data Bank archive will be a resource for well-characterised protein spectra and can be a traceable resource for documentation of medicinal proteins (for bioprocessing and biosimilars comparisons). The validation tools and spectral comparisons and metrics tools to be developed will have value in protein characterisations and quality evaluations, and DichroWeb has already proven to be a useful tool by big pharma, SME and food industry users.

Publications

10 25 50

publication icon
Colledge M (2017) AnglerFish: a webserver for defining the geometry of a-helices in membrane proteins. in Bioinformatics (Oxford, England)

 
Description development and curation of new bioinformatics tools
this has had an enduring effect on availability and traceability and data sharing of spectroscopic data worldwide, and on new ways of analysing and validating CD data
Exploitation Route various tools have already been cited by more than 7000 users analysis website has enabled more than 960,000 analyses to be done. the PCDDB data bank is the only (and highly used) data sharing resource for CD spectra and meta data
we have taught many hundreds of students, staff, faculty members, and industrial people in workshops we have run around the world.
Sectors Agriculture, Food and Drink,Education,Manufacturing, including Industrial Biotechology,Pharmaceuticals and Medical Biotechnology

URL http://pcddb.cryst.bbk.ac.uk/
 
Description highly used by industry and academic for analyses and data-sharing of protein circular dichroism spectra
First Year Of Impact 2005
Sector Chemicals,Pharmaceuticals and Medical Biotechnology
Impact Types Economic

 
Description An International UK-Brazil Collaboration using Synchrotron Radiation Circular Dichroism Spectroscopy to Study Protein Structure and Function.
Amount £27,000 (GBP)
Funding ID BBJ0197471 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 07/2012 
End 06/2015
 
Description BBSRC-FAPSP partnering grant
Amount £35,000 (GBP)
Funding ID BB/N012763/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 01/2016 
End 12/2018
 
Description Biophysical studies of the structure/function of antimicrobial peptides and enzymes isolated from extremophile organisms
Amount £35,426 (GBP)
Funding ID BB/N012763/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 01/2016 
End 10/2018
 
Description Brazil Senior Visiting Fellowship
Amount R$ 115,240 (BRL)
Organisation National Council for Scientific and Technological Development (CNPq) 
Sector Public
Country Brazil
Start 03/2014 
End 02/2016
 
Description Collaborative Sino-UK Synchrotron Radiation Circular Dichroism Spectroscopy Studies and International SRCD Workshop
Amount £24,800 (GBP)
Funding ID 1289 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 07/2007 
End 07/2011
 
Description International Exchange
Amount £3,000 (GBP)
Organisation The Royal Society 
Sector Charity/Non Profit
Country United Kingdom
Start 02/2012 
End 05/2012
 
Description JCAMPX-CD Standardisation of Formats for CD and SRCD spectroscopy
Amount £9,600 (GBP)
Funding ID 2010-033-2-024 
Organisation International Union of Pure and Applied Chemistry (IUPAC) 
Sector Charity/Non Profit
Country United States
Start 02/2010 
End 03/2012
 
Description Science without Borders Visiting Researcher
Amount R$ 11,524,000 (BRL)
Organisation National Council for Scientific and Technological Development (CNPq) 
Sector Public
Country Brazil
Start 03/2014 
End 02/2016
 
Description Science without Borders psotdoctoral fellowships (3)
Amount R$ 1 (BRL)
Organisation National Council for Scientific and Technological Development (CNPq) 
Sector Public
Country Brazil
Start 09/2012 
End 01/2016
 
Description UK-China Partnering Grant
Amount £11,400 (GBP)
Funding ID IE121428 
Organisation The Royal Society 
Sector Charity/Non Profit
Country United Kingdom
Start 03/2013 
End 03/2015
 
Title Dichroweb analysis website 
Description software highly used by academic and industrial labs 
Type Of Material Improvements to research infrastructure 
Provided To Others? Yes  
Impact high usage by academic labs, universities for teaching, and industrial labs 
URL http://dichroweb.cryst.bbk.ac.uk/html/home.shtml
 
Title PCDDB 
Description data bank for validated circular dichroism spectra 
Type Of Material Improvements to research infrastructure 
Year Produced 2014 
Provided To Others? Yes  
Impact more than 500,000 downloads already 
URL http://pcddb.cryst.bbk.ac.uk
 
Title Validichro 
Description server for validation of circular dichroism spectra 
Type Of Material Improvements to research infrastructure 
Year Produced 2013 
Provided To Others? Yes  
Impact used thousands of times by outside users 
URL http://pcddb.cryst.bbk.ac.uk/home.php
 
Title new software tools for CD analyses 
Description software and websites and you tube videos about new infrastructure tools developed for CD spectroscopy and structural biology 
Type Of Material Improvements to research infrastructure 
Year Produced 2018 
Provided To Others? Yes  
Impact ongoing development of many new tools for bioinformatics and structural biology resources widespread usage of tools both in the UK and internationally 
 
Title oriented circular dichroism spectroscopy - methods development and software analysis tools 
Description development of methodology and software analysis tools for oriented CD spectroscopy 
Type Of Material Improvements to research infrastructure 
Year Produced 2017 
Provided To Others? Yes  
Impact other research groups interested in/using our Anglerfish server 
URL http://anglerfish.cryst.bbk.ac.uk/
 
Title Protein Circular Dichroism Data Bank 
Description user deposition of spectral and meta data obtained circular dichroism of proteins 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
Impact more than 2M downloads of contents used by industry and academia led to the development of new methodologies by us and others 
 
Title The MSP180 Reference Data Set 
Description reference data set for the analysis of membrane protein circular dichroism spectra 
Type Of Material Database/Collection of data 
Year Produced 2011 
Provided To Others? No  
Impact No actual impacts realised to date 
URL http://pcddb.cryst.bbk.ac.uk/home.php
 
Title The SP175 reference Data Set 
Description Data Set of Reference CD spectra of proteins deposited in the Protein Circular Dichroism Data Bank. protein circular dichoism data bank http://pcddb.cryst.bbk.ac.uk/home.php 
Type Of Material Database/Collection of data 
Year Produced 2009 
Provided To Others? No  
Impact No actual impacts realised to date 
URL http://pcddb.cryst.bbk.ac.uk/home.php
 
Title curartion and development of data base of circular dichroism spectroscopy and links to other bioinformatics resources 
Description updated and expanded database and cross-referencing with other data bases such as PDB, Uniprot 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
Impact MANY downloads ((>1,000,000 files) by many research groups around the world 
 
Description Collaboration on Intrinsically disordered proteins 
Organisation Institute of Cancer Research UK
Country United Kingdom 
Sector Academic/University 
PI Contribution Biophysical Studies on intrinsically disordered proteins Circular Dichroism Spectroscopy of intrinsically disordered protein
Collaborator Contribution Bioinformatics of Intrinsically disordered proteins
Impact Publication in F1000REs 2019
Start Year 2018
 
Description Collaboration with Brookhaven National Lab (Dr. J.Sutherland) 
Organisation Brookhaven National Laboratory
Department National Synchrotron Light Source
Country United States 
Sector Public 
PI Contribution Enabled us to access the Brookhaven SRCD beamline when there was none in the UK
Collaborator Contribution provided access to instrumentation
Impact students and postdocs had access to SRCD beamline when there was none in the UK. the Brookhaven beamline has now been shut down and our collaborator has retired
Start Year 2009
 
Description Collaboration with University of Sao Paulo 
Organisation Universidade de São Paulo
Department Physics Institute of São Carlos
Country Brazil 
Sector Academic/University 
PI Contribution i have been involved in helping my brazilian collaborators prepare a proposal for an SRCD beamline at the Brazilian synchrotron
Collaborator Contribution we have worked together on the proposal and planning
Impact several publications, a long term partnership leading to several BBSRC partnering grants, development of a new SRCD beamline at the Brazilian synchrotron, submission and funding of a new BBSRC partnering award
Start Year 2012
 
Description collaboration with Janes Lab (Queen Mary University of London) 
Organisation Queen Mary University of London
Department School of Biological and Chemical Science QMUL
Country United Kingdom 
Sector Academic/University 
PI Contribution a wide range of common outcomes, including publications, workshops, international talks
Collaborator Contribution a wide range of common outcomes, including publications, workshops, international talks
Impact publications, outreach, software, video (You Tube) instruction mannuals, teaching at international workshops, presentations at international meetings
 
Description collaborations with CD users 
Organisation Beijing Synchotron Radiation Facility
Country China 
Sector Academic/University 
PI Contribution cd data collection and analyses
Collaborator Contribution beamtime made available
Impact publications
Start Year 2013
 
Title cdtoolx 
Description open access downloadable software for processing and analysis of CD data 
Type Of Technology Software 
Year Produced 2018 
Impact many UK and international users used for teaching and research purposes 
 
Title this package of tools include many new analysis websites 
Description multiple websites for different types of analyses of proteins based on CD spectroscopic data and its relationships to other biophysical methods the outputs were realised in all years of this grant so far (from 2017-2019) but this form only allows one year, so i have chosen 2019 
Type Of Technology Webtool/Application 
Year Produced 2019 
Open Source License? Yes  
Impact large numbers of publications by UK and other groups world wide use and cite this software and use it for teaching purposes 
 
Description Engagement with International Elixer programme 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Study participants or study members
Results and Impact Elixer group on Intrinsically disordered proteins organised through £40,296
Year(s) Of Engagement Activity 2019,2020
 
Description Running training workshops for students, postdocs and industrial scientists on CD spectroscopy 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Teaching Lectures and Practicals in multi-day workshops to provide new skills training and ideas and information for researchers both nationally and internationally
Year(s) Of Engagement Activity Pre-2006,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018
 
Description School visit (Tonbridge schools science day) 
Form Of Engagement Activity Participation in an open day or visit at my research institution
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Schools
Results and Impact Keynote speaker and poster judge at Tonbridge Area Schools Science day
Year(s) Of Engagement Activity 2019
 
Description Talks at UK and international sites and meetings: Brazil Synchrotron Symposium, ), University of Vienna Austria (Chemistry Dept), Biochemical Society Training Course (Aston University, UK) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talks about Tools and Resources developed in the course of this grant at Universities and International meetings, including the headline talk at the inauguration of the Brazil Synchrotron SRCD beamline (CEDRO) held for the international community of SRCD users and potential users, The Biochemical Society training course on Membrane Proteins at Aston University, the chemistry dept at the University of Vienna.
Year(s) Of Engagement Activity 2022
 
Description Teaching at National and International Workshops for students and postdocs on the tools and resources we have developed 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Training workshops on CD spectroscopy in the UK (Leeds and Birmingham), at meetings in the US (NIH Biophysics Course), at the EBSA Biophysics Workshop in France, and the IUPAB Workshop in Cuba.
Year(s) Of Engagement Activity 2018,2019,2020
 
Description Teaching workshops for students, faculty and industrial scientists 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact A series of workshops on methods of analysis for CD spectroscopy given at national and international meetings and workshops, usually 2-3 days, including hands-on computing practicals. for phd and masters students, postdocs, professors and other academics and industry
Year(s) Of Engagement Activity 2009,2010,2011,2012,2013,2014,2015,2016,2017,2018
 
Description annual talks for the public 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact UCL Science Centre Friday Evening Discourses and Birkbeck SET Talks for the Public talks on science for the public (5th, 6th form, and public) annually and lecture/tour at the Wellcome Collection talks

no actual impacts realised to date
Year(s) Of Engagement Activity 2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018,2019,2020,2021
 
Description international advisory boad (germany) 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact international advisory board (Germany)
Year(s) Of Engagement Activity 2011,2012,2013,2014,2015,2016,2017,2018