Expanding the knowledge of structures and functional information through the SIFTS resource

Lead Research Organisation: European Bioinformatics Institute
Department Name: Protein Data Bank in Europe

Abstract

Over the last decade we have seen rapid increase in the amount and diversity of biological data. At the beginning of this process the challenge before the scientific community was to create the necessary infrastructure to collect, manage and make these data available in an efficient manner to the research community. This has transformed life-science research into a data driven scientific field. But very quickly the scientific community has realized that apart of having these data available, the real challenge is to add significantly to the biological context of these data and make this knowledge available to the researchers. This is especially true for the increasing amount of data on three-dimensional structures of macromolecules. The macromolecular structure data can provide great insights into the functional mechanism of the macromolecules. By integrating it with other biological data better understanding of life and disease processes can be derived leading to better intervention strategies by designing new drug molecules. The macromolecular structure data can also be used to predict the effects of genetic variation, found naturally in the population, on the function of the macromolecules again leading to better understanding of genetic diseases. So providing biological context to the increasing amount of macromolecular structure data is critical if we want to exploit these data and add value to the increasing amount of genomic and proteomic information. The SIFTS resource links the macromolecular structure data (archived in two publicly available databases PDB and EMDB) to its biological context by integrating annotations from different biological databases mainly through linking it to UniProt, a publicly available database of protein sequences, which is at the forefront of protein annotation. This resource was established in 2002 and has evolved over the years by integrating increasing number of protein related annotations from different databases. Before the SIFTS resource was established every major biological data resource or research laboratory had to establish processes and complex infrastructure to derive necessary information linking macromolecular structure data to other databases. With rapid advances in sequencing technology, an increasing amount of variation and isoform information is now becoming available. It is critical that the SIFTS resource is extended to map these variants and isoforms onto macromolecular structures and make it freely available for the benefit of the life-science research community. This will require the SIFTS resource to update its processes and infrastructure to include genomic and variation information for the first time. These data and the extended annotations for related uncharacterised sequences will be useful for developing methodologies for predicting structure-function relationship. These considerations and user requests have contributed to the proposed developments. The main objectives of the proposed project include -
1. Enhance the annotations available in the SIFTS resource to include genomic and variation information.
2. Increase coverage of protein sequence space by including isoforms, variants and related uncharacterised sequences.
3. Implement a mechanism to provide sequence annotations specific to isoforms and variation in UniProtKB database.
4. Develop the necessary infrastructure to include value-added structure-based annotations on ligand binding sites and assembly interface residues.
5. Consolidate the software processes and the database infrastructure for long-term sustainability.

Technical Summary

The proposed work in this application will lead to enhance the biological functionality and relevance of the SIFTS resource by extending the value added annotations to span data from genomes to systems biology. We also plan to carry out necessary work to ensure the long-term robustness and sustainability by consolidating the existing processes and infrastructure.
Currently, SIFTS has allowed the enrichment of ~33K UniProtKB protein sequences with structure information through the mapping of the PDB sequence onto a UniProtKB sequence entries based on sequence identity and the organism information. We plan to improve the provision and exploitation of structure information by extending the mapping procedures to include the protein isoforms and variants along with sequences with high residue identity (based on the UniRef90 set). UniProt database infrastructure will be updated to provide specific sequence annotations for particular isoforms and variants that are currently in free text format.
Other major enhancements include incorporation of genomic information including variation data based on unique protein mapping (isoform/variants) and linking it to Ensembl and ENA identifiers. We also plan to map the residue level annotation for manually curated ligand binding sites information available in UniProtKB to the PDB structures and vice versa. Additionally, we will develop the infrastructure to provide information on interface residues for all assemblies annotated in the PDB. We will evaluate ways of including this information in UniProtKB entries. PDBe will also provide PubMed identifiers based on text mining of full text open publications in collaboration with the Europe PMC team at EMBL-EBI. The procedure to map Pfam annotation to PDB structure will be replaced to use HMMER server to allow for more up-to-date Pfam cross-references information for all PDB structures. We will also update data export mechanism and API to make enhanced annotations available to our users.

Planned Impact

To remain at the forefront in data-driven life-science research, it is critical that researchers are able to take advantage of the diversity of datasets available to them. Integrating the wide range of knowledge in this diversity of datasets and providing ways to deliver them in a harmonized and timely manner are fundamental to achieving this goal. Over the last decade the challenge for the bioinformatics community has been to find ways to reliably integrate diverse datasets and to find robust ways to transfer value-added annotations from one domain to another to help the knowledge economy. SIFTS is one such resource that, at its core, focuses on integration of structure and sequence based annotations by mapping sequences from macromolecular structure data in the PDB database to sequence information in the UniProt Knowledgebase (UniProtKB). The value of this data is evident from its wide use mainly by the major bioinformatics databases such as RCSB PDB, PDBj, SCOP, CATH, CREDO, PSI-SBKB, PISCES and ProtCID. The data is also central to the EBI strategy for integrating macromolecular structure information in biological context and is used by all EBI resources (Ensembl, UniProt, PDBe, PDBsum, Pfam, InterPro, IntAct, ChEMBL, ChEBI and Reactome) for integrating structure data. SIFTS is also central to the PDB annotation policies, in providing up-to-date mapping between two widely used scientific resources, PDB and UniProtKB.

SIFTS is the only resource of structural data that is updated weekly with each PDB release and provides a reliable and robust mechanism for other databases and individual researchers to obtain up-to-date data mapping information. This has resulted in an efficient mechanism that avoids duplication of effort for each database and researcher to establish a similar process. The impact is exploiting the data management expertise in PDB and UniProt, achieving efficiency and letting other databases and researchers concentrate on their area of interest while deriving maximum benefit from structural data. To maximise the use of SIFTS, we provide users with data in various formats and services including - XML, comma and tab delimited files, DAS servers and PDBe and UniProt website. We plan to provide our users with new and extended REST APIs in SIFTS and UniProt for easy programmatic access to this data.

SIFTS data has been widely used by a variety of users spanning from genome scientists to systems biologists, and from structural bioinformaticians to drug-design communities. The planned enhancements of the resource will extend the benefits for bexisting users and will engage new users with interest in predicting structure-function relationship. It will also help researchers in developing methods to explain the effects of variants on the protein structure and function leading to better understanding of life processes. SIFTS data is essential in deriving maximum benefits from the macromolecular structure information in the genomic and proteomic context. Some of this research will lead to the design of drug molecules or to better understanding of genetic diseases contributing to better health and/or in design of efficient enzymes to help industrial processes directly contributing to the UK economy. Such combination of skills, in software development and experience in life-science data, are critical if UK has to remain competitive in the age of knowledge economy and data-driven biology.

Apart form the economic benefits; the proposed work will contribute directly to the professional development of staff. The software developers named on the proposal are experienced software engineers with many years of experience in database and software development. They will benefit from experience in handling diverse biological datasets and in developing methods to integrate such diverse data.

Publications

10 25 50
publication icon
Young JY (2018) Worldwide Protein Data Bank biocuration supporting open access to high-quality 3D structural biology data. in Database : the journal of biological databases and curation

publication icon
Velankar S (2021) The Protein Data Bank Archive. in Methods in molecular biology (Clifton, N.J.)

publication icon
UniProt Consortium (2021) UniProt: the universal protein knowledgebase in 2021. in Nucleic acids research

publication icon
UniProt Consortium (2019) UniProt: a worldwide hub of protein knowledge. in Nucleic acids research

publication icon
The UniProt Consortium (2017) UniProt: the universal protein knowledgebase. in Nucleic acids research

publication icon
The Gene Ontology Consortium (2019) The Gene Ontology Resource: 20 years and still GOing strong. in Nucleic acids research

 
Description The key deliverable of this project is the expansion of the mappings between the biomolecular structure data held in the PDB and the protein sequence information available in UniProtKB. SIFTS now provides mappings to isoforms of a given canonical UniProtKB entry and annotates the best mapping from among this number. In addition to the primary mapping, SIFTS now provides mappings to a conservative set of similar UniProtKB sequences, the so called UniRef90 set - i.e. sequences in UniProtKB that are at least 90% identical to the primary mapping. This latter enhancement makes structure data from the PDB applicable to approximately 2 million UniProtKB accessions, while the primary mappings amount to just over 46,000 accessions. SIFTS now also provides additional cross-references to literature and genomic data. All of SIFTS data in accessible as file downloads and programmatically via the PDBe API.
Exploitation Route SIFTS mappings underpin the PDBe website and are used by a large number of other bioinformatics resources. PDBe intends to continue maintaining the SIFTS infrastructure. The BBSRC-funded community-driven project - FunPDBe - aimed at enriching structure data with functional annotations makes extensive use of SIFTS to transfer applicable annotations from the structure to the sequence domain of protein bioinformatics.

SIFTS is now included into a new EMBL-EBI resource, called, the Protein Data Bank in Europe Knowledge Base (PDBe-KB; pdbe-kb.org). The SIFTS data form an essential part of the PDBe-KB aggregated views of protein structures, and was launched in 2018. This is also distributed by FTE and used by many data resources and research laboratories across the world.
Sectors Agriculture, Food and Drink,Digital/Communication/Information Technologies (including Software),Healthcare,Pharmaceuticals and Medical Biotechnology

URL http://pdbe.org/sifts
 
Description PDBj - RDF 
Organisation Protein Data Bank Japan
Country Japan 
Sector Charity/Non Profit 
PI Contribution The SIFTS information is made available in the XML and csv format. This information and associated explanation of the process was provided to PDBj team so they can understand the data provided by the SIFTS resource.
Collaborator Contribution The PDBj team has expertise in semantic web technologies and generated RDF representation of SIFTS information.
Impact An RDF representation of SIFTS infomration which can be easily integrated in the semantic integration platforms.
Start Year 2015
 
Title PDBe-KB API 
Description This API extends the PDBe API in production form 2015. The additional end points include further queries based on SIFTS and FunPDBe data, allowing the query of PDB structures and added value annotations on a residue level. 
Type Of Technology Webtool/Application 
Year Produced 2019 
Impact The availability of this extended API allowed the team to develop novel aggregated views of protein structures, to go into production in March 2019. 
URL https://www.ebi.ac.uk/pdbe/graph-api/pdbe_doc
 
Description A news item on the update of SIFTS data 
Form Of Engagement Activity A magazine, newsletter or online publication
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A news item was published on the PDBe website. The news item was also adevrtised via mailing list, PDBe and UniProt twitter accounts and PDBe facebook account to wider audience. The news item described the update to SIFTS data. There was exchange via twitter on with some comments.
Year(s) Of Engagement Activity 2017
URL http://www.ebi.ac.uk/pdbe/about/news/sift-ing-through-sequence-space-pdb
 
Description Advanced PDBe API hack-a-thon 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Three participants attended this hands-on hack-a-thon with PDBe team to understand the use of PDBe API and to add new endpoints where feasible. This was co-funded by the BioEXCEL scheme.
Year(s) Of Engagement Activity 2018
 
Description Bioinformatics Resources for Protein Biology course 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This was a workshop entitled "Bioinformatics Resources for Protein Biology course" conducted onsite with an international audience of 20 participants.
Year(s) Of Engagement Activity 2018
 
Description Booth with handouts on "PDBe-KB aggregated views of proteins" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Manned a booth and distributed handouts to participants at the RSC NMR Discussion Group meeting organised at the University of Leeds.
Year(s) Of Engagement Activity 2019
URL https://www.rsc.org/events/detail/37139/nmr-in-biophysics-and-molecular-biology
 
Description CCP4 WG2 Meeting 2023 Feb 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Gave an update to the CCP4 WG2 about the addition of SIFTS data to the mmCIF files available from PDBe.
Year(s) Of Engagement Activity 2023
 
Description CamLifeLab 
Form Of Engagement Activity Participation in an open day or visit at my research institution
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Schools
Results and Impact Distributed PDBe calendars, engaged with children, teachers and parents at the LifeLab - Science at the Cathedral (Ely).
Year(s) Of Engagement Activity 2019
URL https://www.camlifelab.co.uk/ely
 
Description Cambridge Bioinformatics Protein Structure Analysis course 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact 25 people attended this training workshop at the University of Cambridge.
Year(s) Of Engagement Activity 2018
 
Description Course on protein structure at EMBL Hamburg 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact This workshop was conducted in EMBL-Hamburg which involved 20 participants.
Year(s) Of Engagement Activity 2018
 
Description Data Integration in UniProt nucleotide sequence amino acid sequence protein structure 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact This was a training talk provided as part of the multi-omics training conducted at EMBL-EBI which saw the participation of 40 members.
Year(s) Of Engagement Activity 2018
 
Description ECCB 2018 - PDBe/UniProt workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This international workshop was conducted jointly by PDBe and UniProt teams.
Year(s) Of Engagement Activity 2018
 
Description EMBL training course "Bioinformatics Resources for Protein Biology" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact EMBL-EBI training course on protein resources. Workshop on PDBe tools for understanding protein function.
This three day workshop introduced participants to data resources and tools developed by EMBL-EBI that could help them protein studies. Each day focused on a particular protein topic, with the aim of helping them get more from your data and also to explore publicly-available data that can further support their research.
Year(s) Of Engagement Activity 2020
URL https://www.ebi.ac.uk/training/events/bioinformatics-resources-protein-biology/
 
Description EMBL training course "Mining PDBe and PDBe-KB using a graph database" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact EMBL training course titled "Advanced workshop on the PDBe graph database".

This workshop covered the use of the PDBe graph database to extract data for solving complex structural biology queries. It introduced the PDBe graph database and how to write Cypher queries to retrieve data of interest. Workshop participants were then able to use the graph database to explore data relevant to their own research with support and guidance from the development team at PDBe.
Year(s) Of Engagement Activity 2020
URL https://www.ebi.ac.uk/training/events/mining-pdbe-and-pdbe-kb-using-graph-database/
 
Description EMBL-EBI Summer School in Bioinformatics talk titled "An introduction to deep learning through functional annotation of proteins" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This talk was presented as part of the EMBL-EBI Summer School in Bioinformatics. Automatically annotating protein sequences with functional information is vital in a world where sequences are produced so fast that humans can't keep up. In this project participants learnt to explore how deep learning can be used to enrich sequences automatically.
Year(s) Of Engagement Activity 2019
URL https://www.ebi.ac.uk/training/events/2019/summer-school-bioinformatics-2
 
Description Genome3D annotations in InterPro 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Presentation on PDBe as part of the Genome3D annotations in InterPro course.
Year(s) Of Engagement Activity 2019
 
Description Indian Biophysical Society-PDBe workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact This workshop was conducted as part of the Indian Biophysical Society meeting at the Indian Institute of Science Education and Research (IISER), India.
Year(s) Of Engagement Activity 2018
 
Description Introduction to Multiomics Data Integration- Case study - Integration in public resources: UniProt 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact This course was aimed at biologists who have some experience of working with at least one type of omics data, and computational biologists / bioinformaticians who wish to gain a better understanding of the biological challenges when working with integrated datasets.
Year(s) Of Engagement Activity 2018
URL https://www.ebi.ac.uk/training/events/2018/introduction-multiomics-data-integration-0
 
Description Online training course on PDBe and PDBe-KB workshop in South Asia 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact PDBe and PDBe-KB workshop, introducing features of the PDBe APIs and PDBe-KB tools.
Year(s) Of Engagement Activity 2020
 
Description PDBe API webinar series "Creating complex PDBe API queries" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This webinar was part of a 6-part PDBe API webinar series, introducing different levels of programmatic access at PDBe. The series ranged from basic data retrieval and search using the PDBe API to more advanced features, including access and reuse of PDBe data visualisation components.

This webinar demonstrated how to create more complex queries by combining the PDBe search API with numerous other calls. By introducing specific case studies, we highlighted the scope of PDBe programmatic access.
Year(s) Of Engagement Activity 2020
URL https://www.ebi.ac.uk/training/events/creating-complex-pdbe-api-queries/
 
Description PDBe API webinar series "Using the PDBe graph API" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This webinar was part of a 6-part PDBe API webinar series, introducing different levels of programmatic access at PDBe.The series ranged from basic data retrieval and search using the PDBe API to more advanced features, including access and reuse of PDBe data visualisation components.

This webinar introduced the PDBe graph API, which is generated from the PDBe graph database and contains an even richer level of data than our standard API. We highlighted how this API supports our PDBe-KB aggregated views, with specific case studies that demonstrate the possibilities through this API.
Year(s) Of Engagement Activity 2020
URL https://www.ebi.ac.uk/training/events/using-pdbe-graph-api/
 
Description PDBe Knowledge Base (PDBe-KB) - infrastructure for FAIR structural and functional annotations 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Professional Practitioners
Results and Impact Talk presented during the South West Structural Biology Consortium meeting held at the University of Reading, UK.
Year(s) Of Engagement Activity 2019
 
Description PDBe Knowledge Base (PDBe-KB) - infrastructure for FAIR structural and functional annotations 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk given at the BCA Spring Meeting 2019 organised at the University of Nottingham, UK.
Year(s) Of Engagement Activity 2019
URL https://www.ebi.ac.uk/pdbe/about/events/bca-spring-meeting-2019
 
Description PDBe lunchtime byte 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact This event was part of the CCP4 Study Weekend 2019 at the University of Nottingham, where a talk was presented and calendars distributed to attendees.
Year(s) Of Engagement Activity 2019
 
Description PDBe workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact This workshop involved 20 participants at the Max F. Perutz Laboratories (MFPL) in Vienna.
Year(s) Of Engagement Activity 2018
 
Description PDBe-KB Hackathon 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact PDBe-KB was presented during a workshop/hackathon that aimed to solve specific scientific problems for the participants, using the data available in PDBe-KB.
Year(s) Of Engagement Activity 2020
 
Description PDBe-KB presentation at the EMBL Structural Biology Retreat 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact PDBe-KB was presented to an EMBL-wide audience, focused on structural biology.
Year(s) Of Engagement Activity 2020
 
Description PDBe/Uniprot API workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This workshop was conducted at the National Institute of Immunology in India and involved 50 international participants.
Year(s) Of Engagement Activity 2018
 
Description Poster at the ECCB 2018 entitled "PDBe-KB: Bringing together functional annotations related to structure" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This poster was presented as part of the ECCB 2018
Year(s) Of Engagement Activity 2018
 
Description Poster at the European Crystallographic Meeting (ECM31) entitled "PDBe tools for training" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This was a poster presented during the European Crystallographic Meeting (ECM31) in Spain which attracted ~100 visitors.
Year(s) Of Engagement Activity 2018
 
Description Poster at the European Crystallographic Meeting (ECM31) entitled "PDBe-KB: Bringing together functional annotations related to structure" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This poster was presented at the European Crystallographic Meeting (ECM31) in Spain which attracted ~100 visitors.
Year(s) Of Engagement Activity 2018
 
Description Poster entitled "Mapping Genes to Proteins in UniProtKB" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This poster was presented during the 2018 Genome Informatics Conference.
The UniProt Knowledgebase (UniProtKB) endeavours to provide the scientific community with a comprehensive catalogue of protein sequence and functional information.
To make effective use of these data for genome studies, it is essential to have accurate mapping from gene to protein sequence, and in the reverse direction from protein to gene.
Year(s) Of Engagement Activity 2018
URL https://www.ebi.ac.uk/sites/ebi.ac.uk/files/groups/uniprot/posters/2018_WGC_macdougall.pdf
 
Description Poster titled "Functional annotations in the PDBe Knowledge Base" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Poster presented during the CCPEM Spring Symposium 2019 organised at the University of Nottingham.
Year(s) Of Engagement Activity 2019
URL https://www.ebi.ac.uk/pdbe/about/events/ccp-em-spring-symposium
 
Description Poster titled "Functional annotations in the PDBe Knowledge Base" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Poster presented at the BCA Spring Meeting 2019 organised at the University of Nottingham.
Year(s) Of Engagement Activity 2019
URL https://www.ebi.ac.uk/pdbe/about/events/bca-spring-meeting-2019
 
Description Poster titled "Functional annotations in the PDBe Knowledge Base" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Poster presented at the Instruct ERIC Structural Biology Conference 2019 held at the University of Alcala, Spain.
Year(s) Of Engagement Activity 2019
URL https://www.structuralbiology.eu/biennial2019
 
Description Poster titled "PDBe-KB aggregated views of proteins" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This poster for presented by the PDBe-KB at the 12th International BioCuration Conference in UK.
Year(s) Of Engagement Activity 2019
 
Description Poster titled "PDBe-KB: Aggregated views of protein structural data for drug development". 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact This poster was presented at the UKQSAR Spring meeting 2019 at Downing College, Cambridge, an event hosted by Astex Pharmaceuticals and is themed around structure-based drug discovery.
Year(s) Of Engagement Activity 2019
 
Description Presentation at Diamond light source 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact Presentation on PDBe activities including SIFTS resource. The presentation described the new developments at PDBe including the web components and Web based 3D viewers that display annotations using SIFTS API. The SIFTS resource was described as a way to get value added annotation by linking Sequence and Structure based annotations from different data resources. The new query system and the search API at PDBe which is based on BioSolr developments was also described.
Year(s) Of Engagement Activity 2016
 
Description Presentation at NII Shonan meeting in Japan 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The NII Shonan meeting was organised to discuss visualisation of biological information. The presentation concentrated both on the visulisation of data but also source of annotation information with SIFTS data central to linking structure and sequence information. There were further inquiries from participants on the SIFTS data and the REST API that makes these data accessible. One of the work groups also discuss how to query information in most efficient way including some of the developments at PDBe that have come about due to BioSolr project.
Year(s) Of Engagement Activity 2016
URL http://shonan.nii.ac.jp/shonan/blog/2015/10/30/web-%E2%80%90based-molecular-graphics/
 
Description Presentation at Unité de glycobiologie structurale et fonctionnelle, Université de Lille 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Professional Practitioners
Results and Impact The presentation entitled "PDBe - Bringing structure to biology" described PDBe developments including SIFTS resource and the new query system. The presentation also described new developments on REST API and planned developments for SIFTS resource.
Year(s) Of Engagement Activity 2017
URL http://ugsf-umr-glycobiologie.univ-lille1.fr/Seminar-Friday-10th-February-Sameer-Velankar-PDBe-leade...
 
Description Protein Structure course 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact This workshop was conducted as part of the protein structure course conducted at the University of Durham, UK which involved 50 participants.
Year(s) Of Engagement Activity 2018
 
Description Structural bioinformatics course 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact 26 participants from an international background participated in this onsite workshop.
Year(s) Of Engagement Activity 2018
 
Description Talk and online training course on PDBe at Warwick University 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Undergraduate students
Results and Impact Workshop and talk to undergraduate Chemists at Warwick University, focusing on the PDB and accessing protein structure data.
Year(s) Of Engagement Activity 2020
 
Description Talk and workshop on "How to find and understand PDB data, using PDBe tools" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk and workshop focused on using the PDBe website and tools presented during the Structural Bioinformatics Course.
Year(s) Of Engagement Activity 2019
URL https://www.ebi.ac.uk/training/events/2019/structural-bioinformatics-3
 
Description Talk and workshop titled "From sequence to structure" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk and workshop on using PDBe website and tools presented during the Exploring Biological Sequences course.
Year(s) Of Engagement Activity 2019
URL https://www.ebi.ac.uk/training/events/2019/exploring-biological-sequences-2
 
Description Talk and workshop titled "How to find and understand PDB data, using PDBe tools" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk and Workshop on using PDBe website and tools presented in Colombia as part of the UNU-Biolac/CABANA: Structural bioinformatics course.
Year(s) Of Engagement Activity 2019
 
Description Talk and workshop titled "Introduction to macromolecular structure at the PDBe" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Talk and workshop presented during the Bioinformatics resources for protein biology in Iasi, Romania, as part of a wider EMBL-EBI workshop.
Year(s) Of Engagement Activity 2019
 
Description Talk at IISER (Pune) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A presentation on PDBe, pdbe.org website and the infrastructure behind it, including SIFTS.
Year(s) Of Engagement Activity 2017
 
Description Talk at MBU (Bengaluru) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact A presentation on PDBe, pdbe.org website and the infrastructure behind it, including SIFTS, API and search functionality. The talk was attended by over 50 people from the Molecular Biophysics Unit and other departments of the IISc in Bengaluru, India.
Year(s) Of Engagement Activity 2017
 
Description Talk at NII (New Delhi) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact A presentation on PDBe, pdbe.org website and the infrastructure behind it, including SIFTS.
Year(s) Of Engagement Activity 2017
 
Description Talk at Pune University (Pune) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact A presentation on PDBe, pdbe.org website and the infrastructure behind it, including SIFTS.
Year(s) Of Engagement Activity 2017
 
Description Webinar on "Protein Structures and their features in UniProtKB" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Annotation of proteins based on structure-based analyses is an integral component of the UniProt Knowledgebase (UniProtKB). UniProt works closely with the Protein Databank in Europe (PDBe) to map 3D structural entries (~100,000) to the corresponding UniProtKB accessions accurately. This webinar covered how the correspondence between protein sequences and structures is established using the expertise of scientist database curator and developing automatic pipeline.
Year(s) Of Engagement Activity 2018
URL https://www.ebi.ac.uk/training/events/2018/protein-structures-and-their-features-uniprotkb
 
Description Webinar on "UniProt Introduction: Navigating between genes, amino acid sequences and 3D structures" 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Starting with an amino acid sequence in a protein record, this webinar showed how to explore the links to genomic data in Ensembl, and to 3D protein structures in PDBe. It was aimed at individuals who wish to learn more about UniProt. No prior knowledge of bioinformatics was required, but an undergraduate level understanding of biology would be useful.
Year(s) Of Engagement Activity 2018
URL https://www.ebi.ac.uk/training/events/2018/uniprot-introduction-navigating-between-genes-amino-acid-...
 
Description Webinar on PDBe Graph Database 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This webinar introduced the concepts of a graph database and describe how we used a graph-approach for integrating the structural data of the Protein Data Bank.
Year(s) Of Engagement Activity 2020
URL https://www.ebi.ac.uk/training/events/pdbe-graph-database-neo4j-driven-integrative-knowledge-graph-s...
 
Description Webinar on Protein structures and their features in UniProtKB 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact Annotation of proteins based on structure-based analyses is an integral component of the UniProt Knowledgebase (UniProtKB). UniProt works closely with the Protein Databank in Europe (PDBe) to map 3D structural entries (~100,000) to the corresponding UniProtKB accessions accurately. This webinar covered how the correspondence between protein sequences and structures is established using the expertise of scientist database curator and developing automatic pipeline. The learning objectives were to be able to describe protein structure features that can be found in UniProtKB and how protein structure information is added to UniProtKB.
Year(s) Of Engagement Activity 2018
URL https://www.ebi.ac.uk/training/online/course/protein-structures-and-their-features-uniprotkb
 
Description Webinar on Proteins API including the isoform service 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact Webinar demonstrating how to use the Proteins REST API providing UniProt data for connecting proteins and protein annotations (e.g. structures, domains, variants) to the corresponding genome coordinates. Learn about navigating through protein isoforms, gene transcripts and exon boundaries to right data relationships.
Year(s) Of Engagement Activity 2017
URL https://www.ebi.ac.uk/training/events/2017/connecting-proteins-genes-programmatically-uniprot
 
Description Webinar: Finding macromolecular structures more easily at PDBe 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact This webinar was conducted as part of the online training program.
Year(s) Of Engagement Activity 2018
 
Description Workshop titled "Introduction to the PDBe-KB aggregated views" 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Professional Practitioners
Results and Impact Workshop on PDBe-KB was presented at the South West Structural Biology Consortium meeting held at the University of Reading, UK.
Year(s) Of Engagement Activity 2019
URL https://research.reading.ac.uk/swsbc2019/