Using data to improve public health: COVID-19 secondment
Lead Research Organisation:
Plymouth University
Department Name: Sch of Computing, Electronics & Maths
Abstract
The COVID-19 pandemic has seen the importance of using data to understand the spread of the Coronavirus, predict future trends of the pandemic in space and time, forecast challenges for healthcare delivery and inform government management policies. The availability of national electronic health records and longitudinal cohort studies provides unique opportunities for using big data to better understand the impact of COVID-19 on health, society and economics.
The aim of this fellowship is to develop well-designed statistical analyses which make the best use of data from multiple sources in order to understand the impacts and management of the COVID-19 pandemic to improve public health.
The objectives of this fellowship are to:
1. Identify risk factors associated with Long COVID.
2. Evaluate the effectiveness and safety of vaccines.
3. Quantify the healthcare disruptions during different waves of the pandemic.
4. Assess the effects of COVID-19 infection on other health outcomes.
A key focus of this fellowship will be for me to advance my research career from an independent researcher to become a leading expert in big data enabled health research. The proposed research will allow me to:
a) Conduct COVID-19 research through engaging in the Longitudinal Health and Wellbeing National Core Study.
The fellowship will allow me to apply and advance my expertise in big data, statistical modelling and epidemiology to develop statistical analyses for addressing important research questions which will have an impact on public health.
b) Grow a research team for COVID-19 research and health data science.
This will be achieved through bringing together researchers to support this fellowship, and by making funding applications to support the team's activities and to train early-career researchers beyond this fellowship.
c) Advance my expertise in using big data for health research.
I have an intrinsic passion for, and continually develop my experience in, using real-world big data to address important research questions, especially within the clinical decision-making framework. This fellowship will allow me to extend my expertise in this exciting area that has a great potential to influence healthcare policy leading to better population health.
d) Disseminate research outputs widely.
I will disseminate the outputs from this research in peer-reviewed journal publications and conference presentations. Furthermore, I will create a website to communicate the key findings of my work to a wider audience including the general public. I will also host seminars or a stakeholder workshop related to COVID-19, to generate further interests, collaborations and impact of the research.
Achieving these fellowship objectives will enable me to reach my career goal to be a leading expert in big data approaches to health, heading up a research team that develops quantitative methods to address research questions related to COVID-19 public health and beyond. Big data approaches will allow us to provide better insights, more robust conclusions, and better recommendations for health policy, which will ultimately benefit the population health.
The aim of this fellowship is to develop well-designed statistical analyses which make the best use of data from multiple sources in order to understand the impacts and management of the COVID-19 pandemic to improve public health.
The objectives of this fellowship are to:
1. Identify risk factors associated with Long COVID.
2. Evaluate the effectiveness and safety of vaccines.
3. Quantify the healthcare disruptions during different waves of the pandemic.
4. Assess the effects of COVID-19 infection on other health outcomes.
A key focus of this fellowship will be for me to advance my research career from an independent researcher to become a leading expert in big data enabled health research. The proposed research will allow me to:
a) Conduct COVID-19 research through engaging in the Longitudinal Health and Wellbeing National Core Study.
The fellowship will allow me to apply and advance my expertise in big data, statistical modelling and epidemiology to develop statistical analyses for addressing important research questions which will have an impact on public health.
b) Grow a research team for COVID-19 research and health data science.
This will be achieved through bringing together researchers to support this fellowship, and by making funding applications to support the team's activities and to train early-career researchers beyond this fellowship.
c) Advance my expertise in using big data for health research.
I have an intrinsic passion for, and continually develop my experience in, using real-world big data to address important research questions, especially within the clinical decision-making framework. This fellowship will allow me to extend my expertise in this exciting area that has a great potential to influence healthcare policy leading to better population health.
d) Disseminate research outputs widely.
I will disseminate the outputs from this research in peer-reviewed journal publications and conference presentations. Furthermore, I will create a website to communicate the key findings of my work to a wider audience including the general public. I will also host seminars or a stakeholder workshop related to COVID-19, to generate further interests, collaborations and impact of the research.
Achieving these fellowship objectives will enable me to reach my career goal to be a leading expert in big data approaches to health, heading up a research team that develops quantitative methods to address research questions related to COVID-19 public health and beyond. Big data approaches will allow us to provide better insights, more robust conclusions, and better recommendations for health policy, which will ultimately benefit the population health.
Technical Summary
The fellowship will focus on making the best use of data from multiple sources to better understand the impact of COVID-19 on population health in the UK.
The proposed research includes,
1. Identifying risk factors associated with Long COVID.
o Use data from multiple sources, such as longitudinal cohort studies and/or electronic health records, to develop a prediction model to classify patients into groups with and without Long COVID.
o Use unsupervised learning to further identify and define the subgroups of patients with Long COVID.
2. Evaluating vaccine effectiveness and safety using real-world (out of trial) data.
o Evaluate and compare the effectiveness of different methods of vaccine offering: one dose, two doses of the same brand, and mixed brands of vaccine.
o Evaluate vaccine safety, such as possible blood clots and heart inflammation, by using regression models for rare adverse events to better identify subgroups of patients at high risk.
3. Quantifying healthcare disruptions during different waves of the pandemic.
o Analyse the data related to COVID-19 healthcare activity, to identify how the healthcare burdens change in different waves of the pandemic.
o Investigate what factors are associated with healthcare burdens, what are the contributing factors in heterogeneity, and identify good practices in mitigating healthcare disruptions.
4. Assessing the effects of COVID-19 infection on other health outcomes.
o Use electronic health records and/or multiple longitudinal cohort studies, to examine how COVID-19 infection impacts on other health outcomes.
o Quantify how the effects of COVID-19 infection on health outcomes change with time.
o Identify subgroups of the population who are more vulnerable to adverse health outcomes associated with COVID-19 infection.
Well documented statistical code will be produced to allow for future updated analysis when new data become available, and for reproducible research.
The proposed research includes,
1. Identifying risk factors associated with Long COVID.
o Use data from multiple sources, such as longitudinal cohort studies and/or electronic health records, to develop a prediction model to classify patients into groups with and without Long COVID.
o Use unsupervised learning to further identify and define the subgroups of patients with Long COVID.
2. Evaluating vaccine effectiveness and safety using real-world (out of trial) data.
o Evaluate and compare the effectiveness of different methods of vaccine offering: one dose, two doses of the same brand, and mixed brands of vaccine.
o Evaluate vaccine safety, such as possible blood clots and heart inflammation, by using regression models for rare adverse events to better identify subgroups of patients at high risk.
3. Quantifying healthcare disruptions during different waves of the pandemic.
o Analyse the data related to COVID-19 healthcare activity, to identify how the healthcare burdens change in different waves of the pandemic.
o Investigate what factors are associated with healthcare burdens, what are the contributing factors in heterogeneity, and identify good practices in mitigating healthcare disruptions.
4. Assessing the effects of COVID-19 infection on other health outcomes.
o Use electronic health records and/or multiple longitudinal cohort studies, to examine how COVID-19 infection impacts on other health outcomes.
o Quantify how the effects of COVID-19 infection on health outcomes change with time.
o Identify subgroups of the population who are more vulnerable to adverse health outcomes associated with COVID-19 infection.
Well documented statistical code will be produced to allow for future updated analysis when new data become available, and for reproducible research.
Publications
Cezard GI
(2024)
Impact of vaccination on the association of COVID-19 with cardiovascular diseases: An OpenSAFELY cohort study.
in Nature communications
Horne EMF
(2024)
CHALLENGES IN ESTIMATING THE EFFECTIVENESS OF 2 DOSES OF COVID-19 VACCINE BEYOND 6 MONTHS IN ENGLAND.
in American journal of epidemiology
Prestige E
(2022)
Covid lockdowns in the UK: Estimating their effects on transmission.
in Significance (Oxford, England)
Wei Y
(2023)
Bivariate copula regression models for semi-competing risks
in Statistical Methods in Medical Research
Description | Enhancing Understanding of Long COVID Using Novel Mathematical Clustering Techniques |
Amount | £79,506 (GBP) |
Funding ID | 2738361 |
Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
Sector | Public |
Country | United Kingdom |
Start | 08/2022 |
End | 03/2026 |
Description | NIHR Pre-Doctoral Fellowship |
Amount | £89,205 (GBP) |
Funding ID | NIHR302739 |
Organisation | National Institute for Health Research |
Sector | Public |
Country | United Kingdom |
Start | 08/2022 |
End | 09/2024 |
Description | NIHR Pre-Doctoral Fellowship |
Amount | £66,150 (GBP) |
Funding ID | NIHR303372 |
Organisation | National Institute for Health Research |
Sector | Public |
Country | United Kingdom |
Start | 09/2023 |
End | 03/2025 |
Description | Electronic Health Records Analysis |
Organisation | University of Bristol |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | Led the study design, analysis, interpretation and write up on determining patient characteristics associated with clinically coded long COVID. Conducted analysis using big data from linked electronic health records via the OpenSAFELY secure analytics platform. Intellectual input in study design, data curation, analysis, validation and interpretation on investigating health outcomes following COVID-19 diagnosis. |
Collaborator Contribution | Intellectual input in study design, data curation, analysis, validation and interpretation. |
Impact | [1] Challenges in estimating waning effectiveness of two doses of BNT162b2 and ChAdOx1 COVID-19 vaccines beyond six months: an OpenSAFELY cohort study using linked electronic health records. https://doi.org/10.1101/2023.01.04.22283762 [2].Patient characteristics associated with clinically coded long COVID: an OpenSAFELY study using electronic health records. https://www.medrxiv.org/content/10.1101/2023.06.23.23291776v1 [3[ Diabetes following SARS-CoV-2 infection: Incidence, persistence, and implications of COVID-19 vaccination. A cohort study of fifteen million people. https://www.medrxiv.org/content/10.1101/2023.08.07.23293778.abstract [4] Impact of COVID-19 on mental illness in vaccinated and unvaccinated people: a population-based cohort study in OpenSAFELY. https://doi.org/10.1101/2023.12.06.23299602 [5] Impact of vaccination on the association of COVID-19 with arterial and venous thrombotic diseases: an OpenSAFELY cohort study using linked electronic health records. https://doi.org/10.21203/rs.3.rs-3168263/v1 |
Start Year | 2021 |
Description | Y Wei Presentation at Royal Statistical Society 2022 International Conference - Sep 2022 |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Yinghui Wei presented 'Risk factors for Long COVID: big data analytics using OpenSAFELY' at the Royal Statistical Society 2022 International Conference |
Year(s) Of Engagement Activity | 2022 |
URL | https://virtual.oxfordabstracts.com/#/event/2726/submission/233 |
Description | Y Wei organised invited session on "Health data science for COVID-19 research" at Royal Statistical Society 2022 International Conference - Sep 2022 |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Yinghui Wei organised an invited session on 'Health data science for COVID-19 research' at Royal Statistical Society 2022 International Conference. |
Year(s) Of Engagement Activity | 2022 |
URL | https://virtual.oxfordabstracts.com/#/event/2726/session/36394 |
Description | Y Wei presentation at Epidemiology Seminar at Institut für Epidemiologie und Sozialmedizin, University Munster |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Invited seminar on "Patient characteristics associated with clinically coded long COVID: an OpenSAFELY study using electronic health records". |
Year(s) Of Engagement Activity | 2023 |