The Protein Circular Dichroism Data Bank, the DichroWeb Server, and ValiDichro: Data Sharing, Analysis and Standards Resources for CD Spectroscopy
Lead Research Organisation:
Birkbeck, University of London
Department Name: Biological Sciences
Abstract
Circular Dichroism (CD) spectroscopy is a widely used method in structural biology for determining protein secondary structure, detecting conformational changes associated with different conditions such as ligand binding, and examining macromolecular interactions and protein folding. It is regularly used as a fundamental characterisation method in a large number of both academic and industrial laboratories and is a method designated by regulatory authorities for the characterisation of pharmaceutical proteins produced for use in humans. The worldwide development of Synchrotron Radiation Circular Dichroism (SRCD) beamlines has further extended the utility and applications of this spectroscopic technique.
The proposal is for a renewal of the Bioinformatics and Biological Resources Fund project "Support for the Protein Circular Dichroism Data Bank and the DichroWeb Analysis Server", which thus far has enabled the creation of the Protein Circular Dichroism Data Bank (PCDDB) for data sharing, the operation of the analysis webserver DichroWeb, and the advancement of data quality standards and validation protocols.
This project's aims are to enhance and maintain these comprehensive electronic archiving and analysis resources for CD spectroscopy that are used by the academic and industrial structural biology and bioinformatics communities and to develop new analysis tools for structural biology. The project includes the progressive development, curation, and operation of the PCDDB, a searchable and downloadable deposition data bank enabling free public access to CD data. This data bank was opened for operation earlier this year and is unique world-wide. It provides public archiving and search facilities for open access to circular dichroism spectral and metadata, much in the manner of the Protein Data Bank (PDB), a long-existing and valuable reference data bank resource for protein crystal and NMR data. This proposal would support continuing development, enhancements, and modifications to the PCDDB based on our experiences gained in operating it and user feedback, as well as enabling it to be run as an ongoing user resource. The project also includes the upgrading and continued operation of DichroWeb, a user-friendly widely-used online server for the analysis of circular dichroism data. In addition, it includes the development of a new suite of support tools for novel types of analyses (including cross-methodological ones), creation of common data formats for all commercial CD instruments and SRCD beamlines, validation software (to guide quality standards of data acquisition and processing, and ensure the integrity of PCDDB entries), and a "one-stop-shop" server providing a pipeline for processing, analysis, display, and deposition tools (thereby simplifying and improving the use of CD spectroscopy by the non-expert community, and the speeding up of these processes for regular CD users). These projects have strong support from the UK and international user communities as indicated by the letters of support included with the proposal.
Together these bioinformatics resources provide a comprehensive package of enabling and supportive tools in CD spectroscopy for the academic and industrial structural biology and bioinformatics communities.
The proposal is for a renewal of the Bioinformatics and Biological Resources Fund project "Support for the Protein Circular Dichroism Data Bank and the DichroWeb Analysis Server", which thus far has enabled the creation of the Protein Circular Dichroism Data Bank (PCDDB) for data sharing, the operation of the analysis webserver DichroWeb, and the advancement of data quality standards and validation protocols.
This project's aims are to enhance and maintain these comprehensive electronic archiving and analysis resources for CD spectroscopy that are used by the academic and industrial structural biology and bioinformatics communities and to develop new analysis tools for structural biology. The project includes the progressive development, curation, and operation of the PCDDB, a searchable and downloadable deposition data bank enabling free public access to CD data. This data bank was opened for operation earlier this year and is unique world-wide. It provides public archiving and search facilities for open access to circular dichroism spectral and metadata, much in the manner of the Protein Data Bank (PDB), a long-existing and valuable reference data bank resource for protein crystal and NMR data. This proposal would support continuing development, enhancements, and modifications to the PCDDB based on our experiences gained in operating it and user feedback, as well as enabling it to be run as an ongoing user resource. The project also includes the upgrading and continued operation of DichroWeb, a user-friendly widely-used online server for the analysis of circular dichroism data. In addition, it includes the development of a new suite of support tools for novel types of analyses (including cross-methodological ones), creation of common data formats for all commercial CD instruments and SRCD beamlines, validation software (to guide quality standards of data acquisition and processing, and ensure the integrity of PCDDB entries), and a "one-stop-shop" server providing a pipeline for processing, analysis, display, and deposition tools (thereby simplifying and improving the use of CD spectroscopy by the non-expert community, and the speeding up of these processes for regular CD users). These projects have strong support from the UK and international user communities as indicated by the letters of support included with the proposal.
Together these bioinformatics resources provide a comprehensive package of enabling and supportive tools in CD spectroscopy for the academic and industrial structural biology and bioinformatics communities.
Technical Summary
Circular Dichroism (CD) spectroscopy is used in structural biology for determining protein secondary structure, detecting conformational changes, and examining macromolecular interactions and protein folding, and is a method designated by regulatory agencies for the characterisation of pharmaceutical proteins for human use.
This proposal is to enhance, curate and operate a comprehensive set of electronic archiving and analysis resources for CD spectroscopy.
It includes the Protein Circular Dichroism Data Bank (PCDDB), a deposition, searchable and downloadable data bank of CD spectra, which began operation earlier this year. The aim of the data bank is to provide public archiving facilities and open access to validated circular dichroism spectral and metadata. The project would support continuing development, enhanced functionalities and modifications based on our experience gained in operating it and user feedback, as well as enabling it to be run as an ongoing user resource. The project also includes enhancements to, and operation of, DichroWeb, a widely-used online server for the analysis of circular dichroism data.
This proposal includes the development of a suite of support tools for novel types of CD analyses based on data deposited in the data bank, including spectral nearest neighbour identification, spectral matching (with applications in bioprocessing), and back calculations of spectra from crystallographic coordinates. It will also include enhanced validation software (establishing standards for spectroscopic data and ensuring the data quality in the PCDDB). A further development will be a one-stop-shop CDpipeline server that will incorporate processing, display, analyses, validation and deposition (aiding casual users of the method as well as improving throughput for more advanced users).
Together these bioinformatics resources will provide enabling and supportive tools in CD spectroscopy for the structural biology and bioinformatics communities.
This proposal is to enhance, curate and operate a comprehensive set of electronic archiving and analysis resources for CD spectroscopy.
It includes the Protein Circular Dichroism Data Bank (PCDDB), a deposition, searchable and downloadable data bank of CD spectra, which began operation earlier this year. The aim of the data bank is to provide public archiving facilities and open access to validated circular dichroism spectral and metadata. The project would support continuing development, enhanced functionalities and modifications based on our experience gained in operating it and user feedback, as well as enabling it to be run as an ongoing user resource. The project also includes enhancements to, and operation of, DichroWeb, a widely-used online server for the analysis of circular dichroism data.
This proposal includes the development of a suite of support tools for novel types of CD analyses based on data deposited in the data bank, including spectral nearest neighbour identification, spectral matching (with applications in bioprocessing), and back calculations of spectra from crystallographic coordinates. It will also include enhanced validation software (establishing standards for spectroscopic data and ensuring the data quality in the PCDDB). A further development will be a one-stop-shop CDpipeline server that will incorporate processing, display, analyses, validation and deposition (aiding casual users of the method as well as improving throughput for more advanced users).
Together these bioinformatics resources will provide enabling and supportive tools in CD spectroscopy for the structural biology and bioinformatics communities.
Planned Impact
Circular dichroism (CD) spectroscopy is a widely used technique in structural biology. The resources proposed would benefit those who utilise CD in both academia and the commercial sector. Because CD is a spectroscopic technique currently meeting ICH Guidelines for characterisation of pharmaceuticals for human use, there has already been significant interest in the tools described in this proposal expressed by the pharmaceutical industry, SMEs and regulatory agencies such as the European Medicines Agency and the US Food and Drug Administration. The Protein Circular Dichroism Data Bank archive will be a resource for well-characterised protein spectra and can be a traceable resource for documentation of medicinal proteins (for bioprocessing and biosimilars comparisons). The validation tools and spectral comparisons and metrics tools to be developed will have value in protein characterisations and quality evaluations, and DichroWeb has already proven to be a useful tool by big pharma, SME and food industry users.
Organisations
Publications
Chin S
(2017)
Attenuation of Phosphorylation-dependent Activation of Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) by Disease-causing Mutations at the Transmission Interface.
in The Journal of biological chemistry
Colledge M
(2017)
AnglerFish: a webserver for defining the geometry of a-helices in membrane proteins.
in Bioinformatics (Oxford, England)
D'Avanzo N
(2022)
The T1-tetramerisation domain of Kv1.2 rescues expression and preserves function of a truncated NaChBac sodium channel
in FEBS Letters
Davey NE
(2019)
An intrinsically disordered proteins community for ELIXIR.
in F1000Research
Drew Elliot
(2016)
Protein Design for Decreased Disorder: SHERP as an Exemplar Protein
in BIOPHYSICAL JOURNAL
Erskine PT
(2015)
X-ray, spectroscopic and normal-mode dynamics of calexcitin: structure-function studies of a neuronal calcium-signalling protein.
in Acta crystallographica. Section D, Biological crystallography
Groves K
(2021)
Reference Protocol to Assess Analytical Performance of Higher Order Structural Analysis Measurements: Results from an Interlaboratory Comparison.
in Analytical chemistry
Lopes JL
(2013)
Folding factors and partners for the intrinsically disordered protein micro-exon gene 14 (MEG-14).
in Biophysical journal
Lopes JL
(2014)
Distinct circular dichroism spectroscopic signatures of polyproline II and unordered secondary structures: applications in secondary structure analyses.
in Protein science : a publication of the Protein Society
Description | development and curation of new bioinformatics tools this has had an enduring effect on availability and traceability and data sharing of spectroscopic data worldwide, and on new ways of analysing and validating CD data |
Exploitation Route | various tools have already been cited by more than 7000 users analysis website has enabled more than 960,000 analyses to be done. the PCDDB data bank is the only (and highly used) data sharing resource for CD spectra and meta data we have taught many hundreds of students, staff, faculty members, and industrial people in workshops we have run around the world. |
Sectors | Agriculture Food and Drink Education Manufacturing including Industrial Biotechology Pharmaceuticals and Medical Biotechnology |
URL | http://pcddb.cryst.bbk.ac.uk/ |
Description | highly used by industry and academic for analyses and data-sharing of protein circular dichroism spectra |
First Year Of Impact | 2005 |
Sector | Chemicals,Pharmaceuticals and Medical Biotechnology |
Impact Types | Economic |
Description | An International UK-Brazil Collaboration using Synchrotron Radiation Circular Dichroism Spectroscopy to Study Protein Structure and Function. |
Amount | £27,000 (GBP) |
Funding ID | BBJ0197471 |
Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
Sector | Public |
Country | United Kingdom |
Start | 06/2012 |
End | 06/2015 |
Description | BBSRC-FAPSP partnering grant |
Amount | £35,000 (GBP) |
Funding ID | BB/N012763/1 |
Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
Sector | Public |
Country | United Kingdom |
Start | 01/2016 |
End | 12/2018 |
Description | Biophysical studies of the structure/function of antimicrobial peptides and enzymes isolated from extremophile organisms |
Amount | £35,426 (GBP) |
Funding ID | BB/N012763/1 |
Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
Sector | Public |
Country | United Kingdom |
Start | 01/2016 |
End | 10/2018 |
Description | Brazil Senior Visiting Fellowship |
Amount | R$ 115,240 (BRL) |
Organisation | National Council for Scientific and Technological Development (CNPq) |
Sector | Public |
Country | Brazil |
Start | 03/2014 |
End | 02/2016 |
Description | Collaborative Sino-UK Synchrotron Radiation Circular Dichroism Spectroscopy Studies and International SRCD Workshop |
Amount | £24,800 (GBP) |
Funding ID | 1289 |
Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
Sector | Public |
Country | United Kingdom |
Start | 06/2007 |
End | 07/2011 |
Description | International Exchange |
Amount | £3,000 (GBP) |
Organisation | The Royal Society |
Sector | Charity/Non Profit |
Country | United Kingdom |
Start | 02/2012 |
End | 05/2012 |
Description | JCAMPX-CD Standardisation of Formats for CD and SRCD spectroscopy |
Amount | £9,600 (GBP) |
Funding ID | 2010-033-2-024 |
Organisation | International Union of Pure and Applied Chemistry (IUPAC) |
Sector | Charity/Non Profit |
Country | United States |
Start | 02/2010 |
End | 03/2012 |
Description | Science without Borders Visiting Researcher |
Amount | R$ 11,524,000 (BRL) |
Organisation | National Council for Scientific and Technological Development (CNPq) |
Sector | Public |
Country | Brazil |
Start | 03/2014 |
End | 02/2016 |
Description | Science without Borders psotdoctoral fellowships (3) |
Amount | R$ 1 (BRL) |
Organisation | National Council for Scientific and Technological Development (CNPq) |
Sector | Public |
Country | Brazil |
Start | 08/2012 |
End | 01/2016 |
Description | UK-China Partnering Grant |
Amount | £11,400 (GBP) |
Funding ID | IE121428 |
Organisation | The Royal Society |
Sector | Charity/Non Profit |
Country | United Kingdom |
Start | 03/2013 |
End | 03/2015 |
Title | Dichroweb analysis website |
Description | software highly used by academic and industrial labs |
Type Of Material | Improvements to research infrastructure |
Provided To Others? | Yes |
Impact | high usage by academic labs, universities for teaching, and industrial labs |
URL | http://dichroweb.cryst.bbk.ac.uk/html/home.shtml |
Title | PCDDB |
Description | data bank for validated circular dichroism spectra |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2014 |
Provided To Others? | Yes |
Impact | more than 500,000 downloads already |
URL | http://pcddb.cryst.bbk.ac.uk |
Title | Validichro |
Description | server for validation of circular dichroism spectra |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2013 |
Provided To Others? | Yes |
Impact | used thousands of times by outside users |
URL | http://pcddb.cryst.bbk.ac.uk/home.php |
Title | new software tools for CD analyses |
Description | software and websites and you tube videos about new infrastructure tools developed for CD spectroscopy and structural biology |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2018 |
Provided To Others? | Yes |
Impact | ongoing development of many new tools for bioinformatics and structural biology resources widespread usage of tools both in the UK and internationally |
Title | oriented circular dichroism spectroscopy - methods development and software analysis tools |
Description | development of methodology and software analysis tools for oriented CD spectroscopy |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2017 |
Provided To Others? | Yes |
Impact | other research groups interested in/using our Anglerfish server |
URL | http://anglerfish.cryst.bbk.ac.uk/ |
Title | Protein Circular Dichroism Data Bank |
Description | user deposition of spectral and meta data obtained circular dichroism of proteins |
Type Of Material | Database/Collection of data |
Year Produced | 2016 |
Provided To Others? | Yes |
Impact | more than 2M downloads of contents used by industry and academia led to the development of new methodologies by us and others |
Title | The MSP180 Reference Data Set |
Description | reference data set for the analysis of membrane protein circular dichroism spectra |
Type Of Material | Database/Collection of data |
Year Produced | 2011 |
Provided To Others? | No |
Impact | No actual impacts realised to date |
URL | http://pcddb.cryst.bbk.ac.uk/home.php |
Title | The SP175 reference Data Set |
Description | Data Set of Reference CD spectra of proteins deposited in the Protein Circular Dichroism Data Bank. protein circular dichoism data bank http://pcddb.cryst.bbk.ac.uk/home.php |
Type Of Material | Database/Collection of data |
Year Produced | 2009 |
Provided To Others? | No |
Impact | No actual impacts realised to date |
URL | http://pcddb.cryst.bbk.ac.uk/home.php |
Title | curartion and development of data base of circular dichroism spectroscopy and links to other bioinformatics resources |
Description | updated and expanded database and cross-referencing with other data bases such as PDB, Uniprot |
Type Of Material | Database/Collection of data |
Year Produced | 2018 |
Provided To Others? | Yes |
Impact | MANY downloads ((>1,000,000 files) by many research groups around the world |
Description | Collaboration on Intrinsically disordered proteins |
Organisation | Institute of Cancer Research UK |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | Biophysical Studies on intrinsically disordered proteins Circular Dichroism Spectroscopy of intrinsically disordered protein |
Collaborator Contribution | Bioinformatics of Intrinsically disordered proteins |
Impact | Publication in F1000REs 2019 |
Start Year | 2018 |
Description | Collaboration with Brookhaven National Lab (Dr. J.Sutherland) |
Organisation | Brookhaven National Laboratory |
Department | National Synchrotron Light Source |
Country | United States |
Sector | Public |
PI Contribution | Enabled us to access the Brookhaven SRCD beamline when there was none in the UK |
Collaborator Contribution | provided access to instrumentation |
Impact | students and postdocs had access to SRCD beamline when there was none in the UK. the Brookhaven beamline has now been shut down and our collaborator has retired |
Start Year | 2009 |
Description | Collaboration with University of Sao Paulo |
Organisation | Universidade de São Paulo |
Department | Physics Institute of São Carlos |
Country | Brazil |
Sector | Academic/University |
PI Contribution | i have been involved in helping my brazilian collaborators prepare a proposal for an SRCD beamline at the Brazilian synchrotron |
Collaborator Contribution | we have worked together on the proposal and planning |
Impact | several publications, a long term partnership leading to several BBSRC partnering grants, development of a new SRCD beamline at the Brazilian synchrotron, submission and funding of a new BBSRC partnering award |
Start Year | 2012 |
Description | collaboration with Janes Lab (Queen Mary University of London) |
Organisation | Queen Mary University of London |
Department | School of Biological and Chemical Science QMUL |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | a wide range of common outcomes, including publications, workshops, international talks |
Collaborator Contribution | a wide range of common outcomes, including publications, workshops, international talks |
Impact | publications, outreach, software, video (You Tube) instruction mannuals, teaching at international workshops, presentations at international meetings |
Description | collaborations with CD users |
Organisation | Beijing Synchotron Radiation Facility |
Country | China |
Sector | Academic/University |
PI Contribution | cd data collection and analyses |
Collaborator Contribution | beamtime made available |
Impact | publications |
Start Year | 2013 |
Title | cdtoolx |
Description | open access downloadable software for processing and analysis of CD data |
Type Of Technology | Software |
Year Produced | 2018 |
Impact | many UK and international users used for teaching and research purposes |
Title | this package of tools include many new analysis websites |
Description | multiple websites for different types of analyses of proteins based on CD spectroscopic data and its relationships to other biophysical methods the outputs were realised in all years of this grant so far (from 2017-2019) but this form only allows one year, so i have chosen 2019 |
Type Of Technology | Webtool/Application |
Year Produced | 2019 |
Open Source License? | Yes |
Impact | large numbers of publications by UK and other groups world wide use and cite this software and use it for teaching purposes |
Description | Engagement with International Elixer programme |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Study participants or study members |
Results and Impact | Elixer group on Intrinsically disordered proteins organised through £40,296 |
Year(s) Of Engagement Activity | 2019,2020 |
Description | Running training workshops for students, postdocs and industrial scientists on CD spectroscopy |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Teaching Lectures and Practicals in multi-day workshops to provide new skills training and ideas and information for researchers both nationally and internationally |
Year(s) Of Engagement Activity | Pre-2006,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018 |
Description | School visit (Tonbridge schools science day) |
Form Of Engagement Activity | Participation in an open day or visit at my research institution |
Part Of Official Scheme? | No |
Geographic Reach | Regional |
Primary Audience | Schools |
Results and Impact | Keynote speaker and poster judge at Tonbridge Area Schools Science day |
Year(s) Of Engagement Activity | 2019 |
Description | Talks at UK and international sites and meetings: Brazil Synchrotron Symposium, ), University of Vienna Austria (Chemistry Dept), Biochemical Society Training Course (Aston University, UK) |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Talks about Tools and Resources developed in the course of this grant at Universities and International meetings, including the headline talk at the inauguration of the Brazil Synchrotron SRCD beamline (CEDRO) held for the international community of SRCD users and potential users, The Biochemical Society training course on Membrane Proteins at Aston University, the chemistry dept at the University of Vienna. |
Year(s) Of Engagement Activity | 2022 |
Description | Teaching at National and International Workshops for students and postdocs on the tools and resources we have developed |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Training workshops on CD spectroscopy in the UK (Leeds and Birmingham), at meetings in the US (NIH Biophysics Course), at the EBSA Biophysics Workshop in France, and the IUPAB Workshop in Cuba. |
Year(s) Of Engagement Activity | 2018,2019,2020 |
Description | Teaching workshops for students, faculty and industrial scientists |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | A series of workshops on methods of analysis for CD spectroscopy given at national and international meetings and workshops, usually 2-3 days, including hands-on computing practicals. for phd and masters students, postdocs, professors and other academics and industry |
Year(s) Of Engagement Activity | 2009,2010,2011,2012,2013,2014,2015,2016,2017,2018 |
Description | annual talks for the public |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Public/other audiences |
Results and Impact | UCL Science Centre Friday Evening Discourses and Birkbeck SET Talks for the Public talks on science for the public (5th, 6th form, and public) annually and lecture/tour at the Wellcome Collection talks no actual impacts realised to date |
Year(s) Of Engagement Activity | 2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018,2019,2020,2021 |
Description | international advisory boad (germany) |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Other audiences |
Results and Impact | international advisory board (Germany) |
Year(s) Of Engagement Activity | 2011,2012,2013,2014,2015,2016,2017,2018 |