📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

An integrated genetic and proteomic approach to understanding cardiovascular disease aetiology

Lead Research Organisation: Imperial College London
Department Name: Dept of Medicine

Abstract

My research programme is focussed on using genomic technologies to understand the links between inflammation and cardiovascular disease. Despite advances in prevention and treatment, cardiovascular disease is the leading cause of death worldwide, highlighting the need for new therapeutic strategies. Most deaths are due to myocardial infarction or "heart attack". Heart attacks occur when fatty deposits in the wall of an artery rupture, triggering formation of a clot which blocks the blood supply to the heart. These fatty deposits build up over many years in a process called atherosclerosis. Traditionally atherosclerosis was thought to be solely due to build-up of excess fats, but over the last few decades the importance of inflammation in this process has been increasingly recognised. Inflammation usually occurs as a protective response to infection or injury, but in certain circumstances it can be harmful. A large number of drugs have been developed to treat harmful inflammation in autoimmune diseases such as rheumatoid arthritis. This raises the possibility that these or similar treatments could be effective in cardiovascular disease. Indeed, the recent CANTOS trial showed that canakinumab, a drug (originally designed for rheumatic diseases) which targets the IL1-beta protein, reduced coronary events in patients with high inflammation levels. There are multiple proteins involved in inflammation and so the challenge is identifying those that are the most promising drug targets.

I propose to address this by integrating 'high-dimensional' genetic and proteomic data. As a result of advances in technology, it now possible to simultaneously measure large numbers of proteins (the 'proteome') in the blood in large numbers of individuals. However, simply showing an association of a protein with cardiovascular disease does not necessarily indicate that the protein is a valid drug target, as correlation does not always reflect causation. To circumvent this issue, I will integrate genetic information, utilising an approach called 'Mendelian randomisation'. This method takes advantage of the randomisation of genetic variants that occurs during reproduction, providing in effect a natural randomised trial. The first step is to identify genetic variants that affect the level of a particular protein. Then, by examining whether individuals who inherit such genetic variants are at higher or lower risk of cardiovascular disease, we can establish whether that protein is likely plays a causal role in disease and thus whether it is a valid drug target.

In a complementary strand of work in collaboration with Prof. Justin Mason (Imperial College London), I am studying patients with Takayasu arteritis, a rare disease characterised by arterial inflammation, which often results in vascular narrowing, or, less commonly, dilatation (aneurysm). Cardiovascular complications are a major cause of morbidity. The prognosis in Takayasu arteritis is very variable - some patients have a benign course, while others develop progressive vascular injury. We will examine the plasma proteome to understand factors that influence this variability and to identify signatures that could be used to distinguish high-risk patients in need of stronger immunosuppressive treatment from those in whom milder, less toxic treatments would be sufficient. In addition, we anticipate that improved understanding of an extreme form of vascular inflammation should provide insights into cardiovascular disease more generally.

Technical Summary

To gain insight into cardiovascular disease aetiology, I will use Mendelian randomisation (MR). MR exploits the natural randomisation of alleles during meiosis to disentangle causation from confounding. By providing evidence that a molecular trait is causal, MR can help identify drug targets, thus reducing the high attrition rate in pharmaceutical pipelines. A major bottleneck limiting use of MR is lack of suitable genetic instruments. To address this, we will use data from INTERVAL to perform GWAS of plasma proteins. Proteins are ideal for MR since they are under proximal genetic control and can be targeted pharmacologically. We recently performed GWAS of 3000 plasma proteins measured using the SOMAscan assay in 3301 individuals. We will extend this to 10000 individuals (providing major power gains, particularly for rarer variants) using a 5000-plex SOMAscan assay. I also co-lead the SCALLOP consortium meta-analysis of GWASs of plasma proteins measured using Olink immunoassays. A second strand of work involves -omic studies in patients with vasculitis. Prof Mason (Imperial) and I have established a multi-national collaboration studying proteomic signatures in Takayasu arteritis (TA), a vasculitis of large arteries resulting in vascular stenosis or aneurysm. Some patients have aggressive disease whereas others have a more indolent course. Prospectively identifying these subtypes would allow for tailored treatments, but current measures of disease activity inadequately predict outcome. We propose to discover novel plasma biomarkers for TA activity and prognosis through a wide-angle proteomic approach coupled with machine learning. Our results will provide a basis for stratified medicine, a key theme of HDR UK, and elucidate the pathogenic pathways underlying TA (and the role of inflammation in vascular injury more braodly), thus providing a foundation for novel therapeutic strategies. Assay/infrastructure costs associated with the TA project are separately funded.

Publications

10 25 50

Related Projects

Project Reference Relationship Related To Start End Award Value
MR/S004068/1 14/02/2018 29/09/2019 £451,154
MR/S004068/2 Transfer MR/S004068/1 30/09/2019 13/02/2021 £244,664
 
Description An integrated multi-omic approach to understanding lupus pathogenesis and heterogeneity
Amount £297,512 (GBP)
Funding ID MRF-057-0003-RG-PETE-C0799 
Organisation Medical Research Council (MRC) 
Department Medical Research Foundation
Sector Charity/Non Profit
Country United Kingdom
Start 01/2020 
End 12/2022
 
Description COVID-19: Longitudinal immunological and multi-omic profiling of haemodialysis patients
Amount £588,823 (GBP)
Funding ID MR/V027638/1 
Organisation Medical Research Council (MRC) 
Sector Public
Country United Kingdom
Start 07/2020 
End 08/2021
 
Description Community Jameel and the Imperial President's Excellence Fund
Amount £144,831 (GBP)
Organisation Imperial College London 
Sector Academic/University
Country United Kingdom
Start 06/2020 
End 07/2021
 
Description Lupus Research Alliance Global Team Science Award
Amount $10,000 (USD)
Organisation Lupus Research Alliance 
Sector Charity/Non Profit
Country United States
Start 12/2020 
End 03/2021
 
Description Mendelian Randomisation of inflammation-related proteins to understand coronary heart disease aetiology
Amount £70,645 (GBP)
Funding ID MRF-042-0001-RG-PETE-C0839 
Organisation Medical Research Council (MRC) 
Department Medical Research Foundation
Sector Charity/Non Profit
Country United Kingdom
Start 08/2020 
End 09/2022
 
Title Longitudinal proteomic profiling of high-risk patients with COVID-19 reveals markers of severity and predictors of fatal disease 
Description End-stage kidney disease (ESKD) patients are at high risk of severe COVID-19. We performed dense serial blood sampling in hospitalised and non-hospitalised ESKD patients with COVID-19 (n=256 samples from 55 patients) and used Olink immunoassays to measure 436 circulating proteins. Comparison to 51 non-infected ESKD patients revealed 221 proteins differentially expressed in COVID-19, of which 69.7% replicated in an independent cohort of 46 COVID-19 patients. 203 proteins were associated with clinical severity scores, including IL6, markers of monocyte recruitment (e.g. CCL2, CCL7), neutrophil activation (e.g proteinase-3) and epithelial injury (e.g. KRT19). Random Forests machine learning identified predictors of current or future severity such as KRT19, PARP1, PADI2, CCL7, and IL1RL1 (ST2). Survival analysis with joint models revealed 69 predictors of death including IL22RA1, CCL28, and the neutrophil-derived chemotaxin AZU1 (Azurocidin). Finally, longitudinal modelling with linear mixed models uncovered 32 proteins that display different temporal profiles in severe versus non-severe disease, including integrins and adhesion molecules. Our findings point to aberrant innate immune activation and leucocyte-endothelial interactions as central to the pathology of severe COVID-19. The data from this unique cohort of high-risk individuals provide a valuable resource for identifying drug targets in COVID-19. 
Type Of Material Database/Collection of data 
Year Produced 2020 
Provided To Others? Yes  
Impact Open access dataset providing proteomic data on blood samples from COVID19 patients 
URL http://datadryad.org/stash/dataset/doi:10.5061/dryad.6t1g1jwxj
 
Title Multi-omics identify LRRC15 as a COVID-19 severity predictor and persistent pro-thrombotic signals in convalescence 
Description RNA sequencing, SomaLogic proteomics and flow cytometry data were generated for two cohorts of end-stage kidney disease patients with COVID-19. The Wave 1 cohort consists of samples collected from patients during the first wave of COVID-19 in early 2020, while samples were collected for the Wave 2 cohort in the following year. This data deposition includes the RNA-seq counts, SomaScan proteomics, flow cytometry and clinical metadata associated with the study. For further information about the study and data, see the associated GitHub repository (https://github.com/jackgisby/covid-longitudinal-multi-omics) or our pre-print (https://doi.org/10.1101/2022.04.29.22274267). The repository also contains code to replicate our analysis of the data. The raw RNA-seq reads were processed using the nf-core RNA-seq v3.2 pipeline before htseq-count was used to generate a raw counts matrix, which is included in this deposition ( htseq_counts.csv). Three files make up the proteomics data: sample_technical_meta.csv, feature_meta.csv and soma_abundance.csv. The first two files contain metadata columns for the samples and protein features, respectively. The final file includes the unprocessed protein abundance data. The files general_panel.csv and t_cell_panel.csv contain the flow cytometry data, split into the general and T-cell panels, respectively. Finally, clinical metadata is available for the two cohorts described in this study ( w1_metadata.csv, w2_metadata.csv). The features in the clinical metadata include: Column Name Data Type Description sample_id Character Unique identifier for samples individual_id Character Unique identifier for individuals ethnicity Character The individual's ethnicity (asian, white, black or other) sex Character The individual's sex (M or F) calc_age Integer Age in years ihd Character Information on coronary heart disease previous_vte Character Whether individuals have had venous thromboembolism copd Character Whether individuals have chronic obstructive pulmonary disease diabetes Character Whether individuals have diabetes, and, if so, the type of diabetes smoking Character Smoking status cause_eskd Character Cause of ESKD WHO_severity Character The peak (WHO) severity for the patient over the disease course WHO_temp_severity Character The (WHO) severity at time of sampling fatal_disease Logical Whether the disease was fatal case_control Character Whether the individual was COVID-19 POSITIVE or NEGATIVE at time of sampling. Convalescent patients are denoted by the label RECOVERY radiology_evidence_covid Character Evidence of COVID-19 from radiology time_from_first_symptoms Integer The number of days since the individual first experienced COVID symptoms at time of sampling time_from_first_positive_swab Integer The number of days since the individual's first positive swab was taken at time of sampling 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact 1) These data reveal prolonged gene expression abnormalities in immune cells for months post-COVID19, including up regulation of pro-thrombotic pathways, which may explain the prolonged increased risk of clotting since after COVID-19 infection. 2) These data identify falling plasma levels of LRRC15 as a poor prognostic sign in COVID-19. A number of independent groups have since published that this may be an alternative protective receptor against SARS-COV-2: e.g. see https://doi.org/10.1371/journal.pbio.3001959 and https://doi.org/10.1371/journal.pbio.3001967 . This finding has opened new research avenues into a previously uncharacterised aspect of host susceptibility to viral infection. 
URL https://zenodo.org/record/6497250
 
Title Multi-omics identify LRRC15 as a COVID-19 severity predictor and persistent pro-thrombotic signals in convalescence 
Description RNA sequencing, SomaLogic proteomics and flow cytometry data were generated for two cohorts of end-stage kidney disease patients with COVID-19. The Wave 1 cohort consists of samples collected from patients during the first wave of COVID-19 in early 2020, while samples were collected for the Wave 2 cohort in the following year. This data deposition includes the RNA-seq counts, SomaScan proteomics, flow cytometry and clinical metadata associated with the study. For further information about the study and data, see the associated GitHub repository (https://github.com/jackgisby/covid-longitudinal-multi-omics) or our pre-print (https://doi.org/10.1101/2022.04.29.22274267). The repository also contains code to replicate our analysis of the data. The raw RNA-seq reads were processed using the nf-core RNA-seq v3.2 pipeline before htseq-count was used to generate a raw counts matrix, which is included in this deposition ( htseq_counts.csv). Three files make up the proteomics data: sample_technical_meta.csv, feature_meta.csv and soma_abundance.csv. The first two files contain metadata columns for the samples and protein features, respectively. The final file includes the unprocessed protein abundance data. The files general_panel.csv and t_cell_panel.csv contain the flow cytometry data, split into the general and T-cell panels, respectively. Finally, clinical metadata is available for the two cohorts described in this study ( w1_metadata.csv, w2_metadata.csv). The features in the clinical metadata include: Column Name Data Type Description sample_id Character Unique identifier for samples individual_id Character Unique identifier for individuals ethnicity Character The individual's ethnicity (asian, white, black or other) sex Character The individual's sex (M or F) calc_age Integer Age in years ihd Character Information on coronary heart disease previous_vte Character Whether individuals have had venous thromboembolism copd Character Whether individuals have chronic obstructive pulmonary disease diabetes Character Whether individuals have diabetes, and, if so, the type of diabetes smoking Character Smoking status cause_eskd Character Cause of ESKD WHO_severity Character The peak (WHO) severity for the patient over the disease course WHO_temp_severity Character The (WHO) severity at time of sampling fatal_disease Logical Whether the disease was fatal case_control Character Whether the individual was COVID-19 POSITIVE or NEGATIVE at time of sampling. Convalescent patients are denoted by the label RECOVERY radiology_evidence_covid Character Evidence of COVID-19 from radiology time_from_first_symptoms Integer The number of days since the individual first experienced COVID symptoms at time of sampling time_from_first_positive_swab Integer The number of days since the individual's first positive swab was taken at time of sampling 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact Gisby, J.S., Buang, N.B., Papadaki, A. et al. Multi-omics identify falling LRRC15 as a COVID-19 severity marker and persistent pro-thrombotic signals in convalescence. Nat Commun 13, 7775 (2022). https://doi.org/10.1038/s41467-022-35454-4 This paper identified LRRC15, a proposed alternative receptor for SARS-CoV2, as a key biomarker of COVID-19 severity. In addition, it demonstrated persistent abnormalities of gene expression in peripheral blood immune cells for months after clinical recovery from COVID-19, with activation of prothrombotic gene expression programmes. This may explain the prolonged increased risk of clots that occurs post COVID-19. 
URL https://zenodo.org/record/6497251
 
Title Proteomic dataset from broncho-alveolar lavage fluid and blood of post-COVID19 patients 
Description Proteomic dataset from broncho-alveolar lavage fluid and blood (plasma) of post-COVID19 patients and controls. Proteins were measured using Olink immunoassays. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact One of the first papers describing the role of ongoing immune-inflammation in the lungs of patients with pulmonary post-COVID19 syndromes ("long COVID"). 
URL https://doi.org/10.5061/dryad.2ngf1vhq3
 
Title Raw RNA-seq COVID-19 ESKD dataset 
Description This dataset contains paired RNA sequencing data for end-stage kidney disease (ESKD) patients on dialysis. There are two cohorts. The first includes 179 samples from 51 COVID-19 patients recruited during the initial phase of the COVID-19 pandemic (April-May 2020) and 55 non-infected ESKD patients as controls. 17 patients initially recruited as controls as part of the Wave 1 cohort were later infected with COVID-19 in January-March 2021. We acquired a total of 90 samples during the acute infection and convalescent samples for 12 of the 17 patients following the acute COVID-19 episode. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact Gisby, J.S., Buang, N.B., Papadaki, A. et al. Multi-omics identify falling LRRC15 as a COVID-19 severity marker and persistent pro-thrombotic signals in convalescence. Nat Commun 13, 7775 (2022). https://doi.org/10.1038/s41467-022-35454-4 
URL https://ega-archive.org/studies/EGAS00001006778
 
Description Large vessel vasculitis collaboration 
Organisation University of Leeds
Department Leeds School of Medicine
Country United Kingdom 
Sector Academic/University 
PI Contribution Contribution of patient samples. Contribution to running assays and analysis..
Collaborator Contribution Contribution of patient samples. Contribution to running assays and analysis.
Impact Not yet
Start Year 2020
 
Description Lupus collaboration 
Organisation King's College London
Country United Kingdom 
Sector Academic/University 
PI Contribution Contribution of patient samples, intellectual input and data analysis
Collaborator Contribution Contribution of patient samples, intellectual input and data analysis
Impact Successful award of approx. 3 million USD from the Lupus Research Alliance (Global Team Science Award).
Start Year 2020
 
Description Lupus collaboration 
Organisation Monash University
Country Australia 
Sector Academic/University 
PI Contribution Contribution of patient samples, intellectual input and data analysis
Collaborator Contribution Contribution of patient samples, intellectual input and data analysis
Impact Successful award of approx. 3 million USD from the Lupus Research Alliance (Global Team Science Award).
Start Year 2020
 
Description Mendelian randomisation of proteins in severe COVID-19 
Organisation Medical Research Council (MRC)
Department MRC Human Genetics Unit
Country United Kingdom 
Sector Academic/University 
PI Contribution We conceived the study and contributed data that fed into the research question. Jointly with Prof Jim Wilson's group (University of Edinburgh/MRC Human Genetics Unit), we performed data analysis, data interpretation, writing the manuscript.
Collaborator Contribution Prof Jim Wilson's group (University of Edinburgh/MRC Human Genetics Unit) contributed protein quantitative trait locus data and performed, jointly with my group, data analysis, data interpretation, and writing the manuscript. They also contributed resources in terms of high performance computing clusters.
Impact Klaric et al. Preprint on MedRxiv 2021. https://doi.org/10.1101/2021.04.01.21254789
Start Year 2020
 
Title ???????????????????????? 
Description Provided are a classification method, kit, and biomarker that can be utilized to classify a coronavirus-infected patient. This classification method for a coronavirus-infected patient according to one embodiment of the present invention includes a step for detecting at least one protein selected from the group consisting of the GDF15 protein, the WFDC2 protein, the CHI3L1 protein, and the KRT19 protein in a specimen collected from a subject. A classification method for a systemic inflammatory response syndrome patient according to another embodiment includes a step for detecting the GDF15 protein in a specimen collected from a subject. 
IP Reference JPWO2023063021 
Protection Patent / Patent application
Year Protection Granted 2023
Licensed No
Impact As above
 
Title METHOD OF PROGNOSIS 
Description Disclosed herein are methods and uses for identifying if a patient is predisposed to experiencing a severe episode of a disease and/ or at risk of experiencing a severe episode of a disease, comprising detecting the presence of a biosignature comprising at least the biomarkers hepatocyte growth factor (HGF) and Spondin-1 (SPON-1) and optionally other biomarkers as described herein. Also disclosed are methods of selecting medical interventions, evaluating the effectiveness of medical interventions, and of preventing and/or treating a severe episode of a disease, comprising detecting the presence of a biosignature comprising biomarkers as described herein and identifying if a patient is predisposed to experiencing a severe episode of a disease and/or at risk of experiencing a severe episode of a disease. Also disclosed are kits for identifying if a patient is predisposed to experiencing a severe episode of a disease and/ or at risk of experiencing a severe episode of a disease, comprising a means for detecting the presence of a biosignature comprising biomarkers as described herein. 
IP Reference WO2022243703 
Protection Patent / Patent application
Year Protection Granted 2022
Licensed No
Impact As above
 
Description Collaborative Covid Immunology: A UK-CIC Conference 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact Several hundred people attended this online conference. My PhD student presented our work in ePoster format.
Year(s) Of Engagement Activity 2021
 
Description Presentation at the SCALLOP consortium meeting 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact I presented our COVID-19 proteomic study to the SCALLOP consortium meeting. This resulted in useful feedback and a new research collaboration.
Year(s) Of Engagement Activity 2020
URL https://www.olink.com/scallop/
 
Description Presentation at the UK Coronavirus Immunology Consortium (UK-CIC) meeting 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact Leading national experts/PIs in Immunology from 20 centres of excellence attended the meeting. I gave one of two talks. The talk sparked great interest in biological insights into severe COVID-19 and potential opportunities for therapeutic intervention, and also around proteomic techniques.
Year(s) Of Engagement Activity 2020
 
Description Press release re our paper profiling the immune cells and proteins in the lungs of patients with pulmonary post-COVID19 syndromes 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Media (as a channel to the public)
Results and Impact Press release describing the findings of our study. In summary, we showed that patients with persistent breathlessness months after COVID-19 infection have an ongoing immune 'attack' in their lungs. This may have implications for potential treatments e.g. with drugs that suppress inflammation.
This was also featured in the Imperial College internal newsletter which will have reached >1000 people.
Year(s) Of Engagement Activity 2022
 
Description Social media dissemination of our study on post-COVID19 pulmonary syndromes 
Form Of Engagement Activity Engagement focused website, blog or social media channel
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Twitter post advertising our paper in Immunity https://doi.org/10.1016/j.immuni.2022.01.017
57 retweets, 186 likes.
Year(s) Of Engagement Activity 2022
URL https://twitter.com/HarkerImmuno/status/1486369117162586116?cxt=HHwWiICymci10qApAAAA
 
Description Wellcome Genome Campus Longitudinal Studies Conference 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This virtual conference was attended by several hundred people (primarily a research audience, but also clinicians and policy makers). My PhD student gave a talk on our work in longitudinal proteomic profiling and took part in a panel Q&A.
Year(s) Of Engagement Activity 2021
URL https://coursesandconferences.wellcomeconnectingscience.org/event/longitudinal-studies-virtual-confe...