CATH-FunVar - Predicting Viral and Human Variants Affecting COVID-19 Susceptibility and Severity and Repurposing Therapeutics
Lead Research Organisation:
UNIVERSITY COLLEGE LONDON
Department Name: Structural Molecular Biology
Abstract
SARS-CoV-2 has caused a pandemic resulting in millions of deaths worldwide and significant social and economic disruption. Although vaccine trials have been encouraging vaccines must be distributed globally and therapeutic interventions will be needed for some time. It is clear that some human populations are much more vulnerable to the disease. For example older men and black and Asian communities. The factors causing these differences are still unclear and whilst social, economic and cultural issues are likely to be important, genetic factors could also play a role. Furthermore, the biological mechanisms by which severe responses arise and increase morbidity are still not known.
In this project we will analyse genetic variations (causing reside mutations in the proteins) in diverse human populations (e.g. gender, ethnicity, people with severe responses) and in SARS-CoV-2. We will use structural and evolutionary data to determine whether the mutations could affect binding between the virus and human proteins. Human proteins in which mutations do affect binding will be mapped to protein networks to identify biological pathways that could be affected. We have powerful tools for functionally annotating proteins and the pathway modules in which they operate. Our data will rationalise the impacts on disease severity and improve diagnostics for populations at risk. Finally, proteins in these pathways are likely to be effective drug targets and we will use our protein family data to identify or repurpose suitable drugs having low side effects.
We will also analyse related coronaviruses to identify future risks.
We have already established a website (https://funvar.cathdb.info/uniprot/dataset/covid) providing mapping of SARS-CoV-2 viral proteins, functional annotations and proximity of mutations to known/predicted functional sites. This is currently populated with preliminary pilot data. It will be extended to host interactors and provide information on pathways and repurposed drugs.
Research Plan
We will:
(a) Classify 'human interactor' proteins interacting with viral proteins into CATH-FunFams to extract known or predicted structures and map variants (residue mutations) from different genders and populations onto these structures.
(b) Perform FunVar analyses to identify mutations in human interactor and SARS-CoV-2 proteins likely to have functional impacts.
(c) Map human interactors to a protein network to highlight biological processes implicated in host response and differentially affected between different genders/ethnicities
(d) Identify human interactors which have clinically approved drugs or which map to FunFams from which clinically approved drugs can be repurposed.
(e) Disseminate information via FunVar-COVID19 pages
Our pipeline will detect diverse variants in different human populations, likely to be impacting functions and affecting Covid-19 response. It will also analyse available drug data to suggest possible therapeutics. Furthermore, our pipeline will be generic and will also be used to analyse other closely related coronavirus genomes that could pose future risks.
In this project we will analyse genetic variations (causing reside mutations in the proteins) in diverse human populations (e.g. gender, ethnicity, people with severe responses) and in SARS-CoV-2. We will use structural and evolutionary data to determine whether the mutations could affect binding between the virus and human proteins. Human proteins in which mutations do affect binding will be mapped to protein networks to identify biological pathways that could be affected. We have powerful tools for functionally annotating proteins and the pathway modules in which they operate. Our data will rationalise the impacts on disease severity and improve diagnostics for populations at risk. Finally, proteins in these pathways are likely to be effective drug targets and we will use our protein family data to identify or repurpose suitable drugs having low side effects.
We will also analyse related coronaviruses to identify future risks.
We have already established a website (https://funvar.cathdb.info/uniprot/dataset/covid) providing mapping of SARS-CoV-2 viral proteins, functional annotations and proximity of mutations to known/predicted functional sites. This is currently populated with preliminary pilot data. It will be extended to host interactors and provide information on pathways and repurposed drugs.
Research Plan
We will:
(a) Classify 'human interactor' proteins interacting with viral proteins into CATH-FunFams to extract known or predicted structures and map variants (residue mutations) from different genders and populations onto these structures.
(b) Perform FunVar analyses to identify mutations in human interactor and SARS-CoV-2 proteins likely to have functional impacts.
(c) Map human interactors to a protein network to highlight biological processes implicated in host response and differentially affected between different genders/ethnicities
(d) Identify human interactors which have clinically approved drugs or which map to FunFams from which clinically approved drugs can be repurposed.
(e) Disseminate information via FunVar-COVID19 pages
Our pipeline will detect diverse variants in different human populations, likely to be impacting functions and affecting Covid-19 response. It will also analyse available drug data to suggest possible therapeutics. Furthermore, our pipeline will be generic and will also be used to analyse other closely related coronavirus genomes that could pose future risks.
Organisations
People |
ORCID iD |
Christine Orengo (Principal Investigator) |
Publications

Lam SD
(2022)
Structural and energetic analyses of SARS-CoV-2 N-terminal domain characterise sugar binding pockets and suggest putative impacts of variants on COVID-19 transmission.
in Computational and structural biotechnology journal


Waman VP
(2024)
Predicting human and viral protein variants affecting COVID-19 susceptibility and repurposing therapeutics.
in Scientific reports
Description | We have examined whether genetic variations in the SARS-CoV-2 protein or the human protein with which it interacts have an impact on the interaction and could therefore affect transmission of the virus or severity of the disease. We analysed variation in the N-terminal domain NTD of SARS-CoV-2 and all VOCs and its potential impact on COVID-19 transmission using computational approaches. This work is published (https://doi.org/10.1016/j.csbj.2022.11.004). The work provides insights into impact of mutations/insertions in NTD on sialic acid binding. We proposed the presence of sugar-binding pockets in NTD and several mutations in NTD are observed to increase binding to sugars and therefore could aid COVID-19 transmission. This work was presented at a couple of conferences in Malaysia (see outreach section for details). We have also analysed a number of key human-virus complexes associated with the immune response. We have identified a number of genetic variations in the human proteins interacting with viral proteins in these complexes, with the potential to have an impact on transmission / disease severity. We applied computational protocols to investigate the structural impact of coding variants in human proteins interacting with SARS-CoV-2, and their frequency in different ethnic groups. We analysed 3D-structures of protein complexes and evolutionary family data to analyse whether these variants could enhance binding affinity to SARS-CoV-2 proteins and could thereby trigger suppression of the immune system. Some variants in immunity-associated proteins such as TOM70, ISG15, TRIM25, IFIT2, IFIH1 and NUP98, are predicted to increase binding affinity to their interacting SARS-CoV-2 proteins. In conclusion, our study thus sheds light on affinity-enhancing variants in immunity-associated proteins and their impact on SARS-CoV-2-human protein complexes and COVID-19 susceptibility. Affinity-enhancing variants proposed in this study, along with their correlation with functional sites in 3D, aid in explaining potential mechanisms associated with suppression of normal functioning of immune proteins. These finding are made available as preprint on biorxiv (https://www.biorxiv.org/content/10.1101/2023.11.07.566012v1) and under revision for journal Nature Scientific Reports. This work was selected for Oral presentation and presented during the ISMB 2023 conference, Lyon, France (Please see:https://www.youtube.com/watch?v=rNHUpCtiFDI) |
Exploitation Route | The information on variants in both SARS-CoV-2 and immunity-associated human proteins will be important both for diagnostics, eg identifying human populations more susceptible to infection and disease severity. It may also be valuable for designing therapeutics by targeting the interactions analysed in this study. Our approach could be helpful in future studies of not only for COVID-19 but also other emerging infectious diseases. The workflow we have established, FunVar, will be made available for use by other researchers once we have tested it further. |
Sectors | Healthcare Pharmaceuticals and Medical Biotechnology |
URL | https://www.biorxiv.org/content/10.1101/2023.11.07.566012v1 |
Description | We analysed impact of insertions in N-terminal domain on sugar/ACE2 binding and and potential impact on COVID-19 transmission. This work has been cited by several other groups. These citations indicate that our work provides foundation and clues for further studies on evolution and emergence of Omicron lineages. For example, we report an insertion at 214 position in N-terminal domain and its association with increased sialic acid binding in human host. A subsequent study (10.3390/vaccines10091509) noted that this insertion is a unique defining feature of the Omicron variant, BA.1 (not other prior lineages). The same group further attempted to investigate possible mechanisms of evolutionary origin of Omicron. They proposed that template Switching is likely mechanism for the Origin of ins214EPE in Omicron. |
First Year Of Impact | 2021 |
Sector | Education,Healthcare |
Title | Dataset of 3D-complexes of SARS-CoV-2:Human proteins |
Description | The dataset of human: SARS-CoV-2 protein complexes used for the study. |
Type Of Material | Database/Collection of data |
Year Produced | 2023 |
Provided To Others? | Yes |
Impact | The dataset provides access to experimental 3D structures for 10 interactions available in the PDB, as well as access to 9 high-quality models predicted using AlphaFold2-multimer/ptm method, in this study. The dataset of predicted complexes provides high-confidence models based on overall pLDDT (predicted local difference distance test) > 70 as well as pTM-Score (predicted TM-score) > 70. We used these complexes to analyse the impact of missense variants occurring in the protein-protein interface region. We analysed 468 missense variants from human proteins and 212 variants from SARS-CoV-2 proteins and computationally predicted their impacts on binding affinities to SARS-CoV-2 proteins, using 3D-complexes. We predicted a total of 26 affinity-enhancing variants from 13 human proteins implicated in increased binding affinity to SARS-CoV-2. These include key-immunity associated genes (TOMM70, ISG15, IFIH1, IFIT2, RPS3, PALS1, NUP98, AXL, ARF6, TRIMM, TRIM25) as well as important spike receptors (KREMEN1, AXL and ACE2). We report both common (e.g., Y13N in IFIH1) and rare variants in these proteins and discuss their likely structural and functional impact, using information on known and predicted functional sites. The protocol designed in this study could be extended to analyse other protein interactions as more structures are experimentally determined and more powerful tools for protein structure prediction emerge. Our approach could be helpful in future studies of not only for COVID-19 but also other emerging infectious diseases. |
URL | https://zenodo.org/doi/10.5281/zenodo.10090696 |
Title | Dataset of 3D-complexes of SARS-CoV-2:Human proteins |
Description | The dataset of human: SARS-CoV-2 protein complexes used for the study. |
Type Of Material | Database/Collection of data |
Year Produced | 2023 |
Provided To Others? | Yes |
Impact | The dataset provides access to experimental 3D structures for 10 interactions available in the PDB, as well as access to 9 high-quality models predicted using AlphaFold2-multimer/ptm method, in this study. The dataset of predicted complexes provides high-confidence models based on overall pLDDT (predicted local difference distance test) > 70 as well as pTM-Score (predicted TM-score) > 70. We used these complexes to analyse the impact of missense variants occurring in the protein-protein interface region. We analysed 468 missense variants from human proteins and 212 variants from SARS-CoV-2 proteins and computationally predicted their impacts on binding affinities to SARS-CoV-2 proteins, using 3D-complexes. We predicted a total of 26 affinity-enhancing variants from 13 human proteins implicated in increased binding affinity to SARS-CoV-2. These include key-immunity associated genes (TOMM70, ISG15, IFIH1, IFIT2, RPS3, PALS1, NUP98, AXL, ARF6, TRIMM, TRIM25) as well as important spike receptors (KREMEN1, AXL and ACE2). We report both common (e.g., Y13N in IFIH1) and rare variants in these proteins and discuss their likely structural and functional impact, using information on known and predicted functional sites. The protocol designed in this study could be extended to analyse other protein interactions as more structures are experimentally determined and more powerful tools for protein structure prediction emerge. Our approach could be helpful in future studies of not only for COVID-19 but also other emerging infectious diseases. |
URL | https://zenodo.org/doi/10.5281/zenodo.10090695 |
Description | An oral presentation at ISMB/ECCB 2023, Lyon |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | The work was presented at Variant Interpretation track, at ISMB conference, 2023. About 50 participants attended this talk in-person/online. The participants were from diverse research backgrounds such as genetics, genomics and structural bioinformatics. Our study thus sheds light on affinity-enhancing variants in immunity-associated proteins and their impact on SARS-CoV-2-human protein complexes and COVID-19 susceptibility, suing AlphaFold2-multimer. This sparked interesting questions about application of computational approaches to understand SARS-CoV-2 interactions and its applicability for other pathogens. |
Year(s) Of Engagement Activity | 2023 |
URL | https://www.youtube.com/watch?v=rNHUpCtiFDI |
Description | Physical and Integrated Sciences Conference 2022, Bangi, Malaysia - 6-7 December 2022 (Invited Speaker) |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | Local |
Primary Audience | Postgraduate students |
Results and Impact | The talk was presented in Physical and Integrated Sciences Conference 2022, Bangi, Malaysia - 6-7 December 2022 (Invited Speaker) . |
Year(s) Of Engagement Activity | 2022 |