CATH-FunVar - Predicting Viral and Human Variants Affecting COVID-19 Susceptibility and Severity and Repurposing Therapeutics

Lead Research Organisation: University College London
Department Name: Structural Molecular Biology

Abstract

SARS-CoV-2 has caused a pandemic resulting in millions of deaths worldwide and significant social and economic disruption. Although vaccine trials have been encouraging vaccines must be distributed globally and therapeutic interventions will be needed for some time. It is clear that some human populations are much more vulnerable to the disease. For example older men and black and Asian communities. The factors causing these differences are still unclear and whilst social, economic and cultural issues are likely to be important, genetic factors could also play a role. Furthermore, the biological mechanisms by which severe responses arise and increase morbidity are still not known.

In this project we will analyse genetic variations (causing reside mutations in the proteins) in diverse human populations (e.g. gender, ethnicity, people with severe responses) and in SARS-CoV-2. We will use structural and evolutionary data to determine whether the mutations could affect binding between the virus and human proteins. Human proteins in which mutations do affect binding will be mapped to protein networks to identify biological pathways that could be affected. We have powerful tools for functionally annotating proteins and the pathway modules in which they operate. Our data will rationalise the impacts on disease severity and improve diagnostics for populations at risk. Finally, proteins in these pathways are likely to be effective drug targets and we will use our protein family data to identify or repurpose suitable drugs having low side effects.
We will also analyse related coronaviruses to identify future risks.

We have already established a website (https://funvar.cathdb.info/uniprot/dataset/covid) providing mapping of SARS-CoV-2 viral proteins, functional annotations and proximity of mutations to known/predicted functional sites. This is currently populated with preliminary pilot data. It will be extended to host interactors and provide information on pathways and repurposed drugs.

Research Plan
We will:
(a) Classify 'human interactor' proteins interacting with viral proteins into CATH-FunFams to extract known or predicted structures and map variants (residue mutations) from different genders and populations onto these structures.
(b) Perform FunVar analyses to identify mutations in human interactor and SARS-CoV-2 proteins likely to have functional impacts.
(c) Map human interactors to a protein network to highlight biological processes implicated in host response and differentially affected between different genders/ethnicities
(d) Identify human interactors which have clinically approved drugs or which map to FunFams from which clinically approved drugs can be repurposed.
(e) Disseminate information via FunVar-COVID19 pages

Our pipeline will detect diverse variants in different human populations, likely to be impacting functions and affecting Covid-19 response. It will also analyse available drug data to suggest possible therapeutics. Furthermore, our pipeline will be generic and will also be used to analyse other closely related coronavirus genomes that could pose future risks.

Publications

10 25 50
 
Description We have examined whether genetic variations in the SARS-CoV-2 protein or the human protein with which it interacts have an impact on the interaction and could therefore affect transmission of the virus or severity of the disease.

We analysed variation in the N-terminal domain NTD of SARS-CoV-2 and all VOCs and its potential impact on COVID-19 transmission using computational approaches. This work is published (https://doi.org/10.1016/j.csbj.2022.11.004). The work provides insights into impact of mutations/insertions in NTD on sialic acid binding. We proposed the presence of sugar-binding pockets in NTD and several mutations in NTD are observed to increase binding to sugars and therefore could aid COVID-19 transmission. This work was presented at a couple of conferences in Malaysia (see outreach section for details).

We have also analysed a number of key human-virus complexes associated with the immune response. We have identified a number of genetic variations in the human proteins interacting with viral proteins in these complexes, with the potential to have an impact on transmission / disease severity. We applied computational protocols to investigate the structural impact of coding variants in human proteins interacting with SARS-CoV-2, and their frequency in different ethnic groups. We analysed 3D-structures of protein complexes and evolutionary family data to analyse whether these variants could enhance binding affinity to SARS-CoV-2 proteins and could thereby trigger suppression of the immune system. Some variants in immunity-associated proteins such as TOM70, ISG15, TRIM25, IFIT2, IFIH1 and NUP98, are predicted to increase binding affinity to their interacting SARS-CoV-2 proteins. We have drafted a manuscript to submit to a journal. This work will also be presented during the upcoming ISMB 2023 conference.

We have also built a workflow for automatically analysing the impacts of further variants in SARS-CoV2 and other viral proteins.
Exploitation Route The information on variants in both SARS-CoV-2 and immunity-associated human proteins will be important both for diagnostics, eg identifying human populations more susceptible to infection and disease severity. It may also be valuable for designing therapeutics by targeting the interactions analysed in this study.

The workflow we have established, FunVar, will be made available for use by other researchers once we have tested it further.
Sectors Healthcare,Pharmaceuticals and Medical Biotechnology

URL https://doi.org/10.1016/j.csbj.2022.11.004
 
Description We analysed impact of insertions in N-terminal domain on sugar/ACE2 binding and and potential impact on COVID-19 transmission. This work has been cited by several other groups. These citations indicate that our work provides foundation and clues for further studies on evolution and emergence of Omicron lineages. For example, we report an insertion at 214 position in N-terminal domain and its association with increased sialic acid binding in human host. A subsequent study (10.3390/vaccines10091509) noted that this insertion is a unique defining feature of the Omicron variant, BA.1 (not other prior lineages). The same group further attempted to investigate possible mechanisms of evolutionary origin of Omicron. They proposed that template Switching is likely mechanism for the Origin of ins214EPE in Omicron.
First Year Of Impact 2021
Sector Education,Healthcare
 
Description Physical and Integrated Sciences Conference 2022, Bangi, Malaysia - 6-7 December 2022 (Invited Speaker) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Postgraduate students
Results and Impact The talk was presented in Physical and Integrated Sciences Conference 2022, Bangi, Malaysia - 6-7 December 2022 (Invited Speaker) .
Year(s) Of Engagement Activity 2022