The association of wearable sensor measures of time use with cardiovascular disease
Lead Research Organisation:
University of Oxford
Department Name: Population Health
Abstract
Keywords: Physical activity- Accelerometer- Cardiovascular disease- Statistical methods- Big Data- Genomics
Background
Cardiovascular disease (CVD) is the leading cause of mortality, globally and in the UK1. CVD prevention is therefore a public health priority. Physical inactivity is associated with adverse health outcomes (e.g. higher all-cause mortality, CVD rates and Type 2 diabetes rates)2; moderate-to-vigorous intensity physical activity seems to reduce CVD risk3,4. However, evidence is limited by the use of self-reported questionnaire data which is crude and prone to measurement error. Therefore, more research is needed to understand quantitatively the change in risk conferred by different physical activity (PA) profiles, including the role of light intensity PA.
Measurement of PA using accelerometers, which are devices worn on the wrist, produces vast amounts of data that can be challenging to analyse as it is compositional. Statistical methods developed in different contexts may be helpful e.g. compositional data analysis and isotemporal substitution10-12. Accelerometer data also comes as a time series: functional data analysis and ARIMA modelling are candidate methods to capture its particular structure13. There is limited literature applying these methods to PA data, so research and development is needed to understand which methods are most useful in creating PA profiles for epidemiology.
Until recently, there was little understanding of genetic influences on PA behaviours8. Understanding genetic influences on PA behaviours allows better understanding of its underlying biology and causal inference of its potential association with CVD outcomes.
Aims
This project aims to improve understanding of PA and its association with CVD, using objectively measured PA data and genetic epidemiology. The objectives are:
1. To identify and develop methods to characterise activity profiles from time-series device data.
2. To perform epidemiological investigations into the association between PA patterns and risk of CVD outcomes.
3. To perform genome-wide association studies (GWAS) to identify genetic variants associated with PA profiles and to apply Mendelian Randomization to assess causal relevance of PA profiles with CVD outcomes.
Data and Data Preparation
UK Biobank is a prospective study of 500,000 individuals, aged 40-69 at recruitment5. Participants have given various health-related measurements and have been genotyped6. Linking with health registries allows participants' health outcomes to be tracked. Over 100,000 participants wore wrist-worn accelerometers for a week7,9.
Methods
A review of statistical methods potentially relevant to PA data analysis will be performed. Methods developed for analysis of compositional and/or time series data in different fields will be assessed and developed for application in this epidemiological context.
Prospective epidemiological analysis of PA profiles with CVD outcomes will be carried out. Cox regression analyses will be conducted in order to assess the association of the newly derived PA metrics with ischaemic heart disease and stroke with adjustment for age, sex, income, education, alcohol, smoking and sedentary behaviour in the first instance.
A genome-wide association study (GWAS) will be performed on the newly derived PA metrics. The goal is to apply genomic understanding in analysing the potential causal role of PA in disease aetiology via Mendelian Randomisation.
References
1. Murray CJL et al. Lancet. 2012;380(9859):2197-2223.
2. Lee I-Min et al. Lancet. 2012;380(9838):219-229.
3. Bennett DA et al. JAMA Cardiol. 2017:1-10.
4. Stewart J et al. JRSM Cardiovasc Dis. 2017;6:204800401668721.
5. Sudlow C et al. PLoS Medicine. 2015: 12(3), p.e1001779.
6. Bycroft C et al. Nature. 2018: 562(7726), p.203.
7. Doherty A et al. PloS one. 2017: 12(2), p.e0169649.
8. Bauman AE et al. Lancet. 2012;380(12):258-271.
9. Willetts M et al. bioRxiv. 2017:187625.
10. Ch
Background
Cardiovascular disease (CVD) is the leading cause of mortality, globally and in the UK1. CVD prevention is therefore a public health priority. Physical inactivity is associated with adverse health outcomes (e.g. higher all-cause mortality, CVD rates and Type 2 diabetes rates)2; moderate-to-vigorous intensity physical activity seems to reduce CVD risk3,4. However, evidence is limited by the use of self-reported questionnaire data which is crude and prone to measurement error. Therefore, more research is needed to understand quantitatively the change in risk conferred by different physical activity (PA) profiles, including the role of light intensity PA.
Measurement of PA using accelerometers, which are devices worn on the wrist, produces vast amounts of data that can be challenging to analyse as it is compositional. Statistical methods developed in different contexts may be helpful e.g. compositional data analysis and isotemporal substitution10-12. Accelerometer data also comes as a time series: functional data analysis and ARIMA modelling are candidate methods to capture its particular structure13. There is limited literature applying these methods to PA data, so research and development is needed to understand which methods are most useful in creating PA profiles for epidemiology.
Until recently, there was little understanding of genetic influences on PA behaviours8. Understanding genetic influences on PA behaviours allows better understanding of its underlying biology and causal inference of its potential association with CVD outcomes.
Aims
This project aims to improve understanding of PA and its association with CVD, using objectively measured PA data and genetic epidemiology. The objectives are:
1. To identify and develop methods to characterise activity profiles from time-series device data.
2. To perform epidemiological investigations into the association between PA patterns and risk of CVD outcomes.
3. To perform genome-wide association studies (GWAS) to identify genetic variants associated with PA profiles and to apply Mendelian Randomization to assess causal relevance of PA profiles with CVD outcomes.
Data and Data Preparation
UK Biobank is a prospective study of 500,000 individuals, aged 40-69 at recruitment5. Participants have given various health-related measurements and have been genotyped6. Linking with health registries allows participants' health outcomes to be tracked. Over 100,000 participants wore wrist-worn accelerometers for a week7,9.
Methods
A review of statistical methods potentially relevant to PA data analysis will be performed. Methods developed for analysis of compositional and/or time series data in different fields will be assessed and developed for application in this epidemiological context.
Prospective epidemiological analysis of PA profiles with CVD outcomes will be carried out. Cox regression analyses will be conducted in order to assess the association of the newly derived PA metrics with ischaemic heart disease and stroke with adjustment for age, sex, income, education, alcohol, smoking and sedentary behaviour in the first instance.
A genome-wide association study (GWAS) will be performed on the newly derived PA metrics. The goal is to apply genomic understanding in analysing the potential causal role of PA in disease aetiology via Mendelian Randomisation.
References
1. Murray CJL et al. Lancet. 2012;380(9859):2197-2223.
2. Lee I-Min et al. Lancet. 2012;380(9838):219-229.
3. Bennett DA et al. JAMA Cardiol. 2017:1-10.
4. Stewart J et al. JRSM Cardiovasc Dis. 2017;6:204800401668721.
5. Sudlow C et al. PLoS Medicine. 2015: 12(3), p.e1001779.
6. Bycroft C et al. Nature. 2018: 562(7726), p.203.
7. Doherty A et al. PloS one. 2017: 12(2), p.e0169649.
8. Bauman AE et al. Lancet. 2012;380(12):258-271.
9. Willetts M et al. bioRxiv. 2017:187625.
10. Ch
Organisations
People |
ORCID iD |
Aiden Doherty (Primary Supervisor) | |
Rosemary Walmsley (Student) |
Studentship Projects
Project Reference | Relationship | Related To | Start | End | Student Name |
---|---|---|---|---|---|
MR/S502509/1 | 01/10/2018 | 30/06/2022 | |||
2107664 | Studentship | MR/S502509/1 | 01/10/2018 | 30/06/2022 | Rosemary Walmsley |
Description | Contribution to MSc course |
Geographic Reach | Local/Municipal/Regional |
Policy Influence Type | Influenced training of practitioners or researchers |
Impact | Increased awareness of reproducible research methods. |
Description | Contribution to workshop on consumer wearables to understand cardiovascular disease. |
Geographic Reach | National |
Policy Influence Type | Contribution to a national consultation/review |
Description | Course material for CDT Health Data Science |
Geographic Reach | Local/Municipal/Regional |
Policy Influence Type | Influenced training of practitioners or researchers |
Impact | This activity increased trainee researchers' skills in using wearable device data. |
URL | https://activitymonitoring.github.io/cdtWearablesHealth/index.html |
Title | Capture-24: Activity tracker dataset for human activity recognition |
Description | Dataset contains accelerometer data alongside ground-truth activity labels derived from wearable camera images and time use diaries. The data was collected from 151 participants in free-living. It enables the development of methods for activity recognition in accelerometer data. |
Type Of Material | Database/Collection of data |
Year Produced | 2021 |
Provided To Others? | Yes |
Impact | This dataset has been used to develop methods for activity recognition in accelerometer data, which have subsequently been used in epidemiological studies. |
URL | https://ora.ox.ac.uk/objects/uuid:99d7c092-d865-4a19-b096-cc16440cd001 |
Title | biobankAccelerometerAnalysis |
Description | A tool for extracting health information from large-scale accelerometer datasets. |
Type Of Technology | Software |
Year Produced | 2014 |
Open Source License? | Yes |
Impact | Used for accelerometer data processing in several health studies. |
URL | https://github.com/activityMonitoring/biobankAccelerometerAnalysis |
Title | epicoda |
Description | epicoda is an R package designed to support epidemiological analyses using compositional exposure variables. It provides wrappers for common epidemiological use cases. Simulated data (simdata) can be used to try out the functions, and a vignette illustrates the steps to carrying out an epidemiological analysis with a Compositional Data Analysis approach to the exposure. |
Type Of Technology | Software |
Year Produced | 2020 |
Open Source License? | Yes |
Impact | This software has been used in a preprint (https://www.medrxiv.org/content/10.1101/2020.11.10.20227769v3). It has also been used in teaching. |
URL | https://github.com/activityMonitoring/epicoda |
Title | ukb_download_and_prep_template |
Description | This tool facilitates analyses using UK Biobank data by automating lots of the data preparation steps. |
Type Of Technology | Software |
Year Produced | 2020 |
Open Source License? | Yes |
Impact | This software has been used to facilitate data pre-processing. |
URL | https://github.com/activityMonitoring/ukb_download_and_prep_template |
Description | News article about paper |
Form Of Engagement Activity | A press release, press conference or response to a media enquiry/interview |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Public/other audiences |
Results and Impact | News article about research paper (https://bjsm.bmj.com/content/early/2022/02/15/bjsports-2021-104050) to disseminate results to a wider audience. |
Year(s) Of Engagement Activity | 2021 |
URL | https://www.ndph.ox.ac.uk/news/new-machine-learning-approaches-could-help-reveal-the-best-daily-acti... |
Description | Open Doors Talk |
Form Of Engagement Activity | Participation in an open day or visit at my research institution |
Part Of Official Scheme? | No |
Geographic Reach | Regional |
Primary Audience | Public/other audiences |
Results and Impact | An 'Open Doors' open day event was held at our organisation. At this event, I gave a short talk about my research and how it makes use of 'big data'. |
Year(s) Of Engagement Activity | 2019 |
URL | https://www.bdi.ox.ac.uk/upcoming-events/oxford-open-doors/oxford-open-doors-2019-programme-for-the-... |
Description | Presentation for UK Biobank Scientific Conference |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Study participants or study members |
Results and Impact | I prepared a video 'Thesis in Three Minutes' presentation on my research for the UK Biobank Scientific Conference. It was intended to share this research with a broader audience. This was disseminated on UK Biobank social media channels, aimed both at researchers and at study participants. |
Year(s) Of Engagement Activity | 2021 |
URL | https://www.youtube.com/watch?v=bwSLvT8bPLc |
Description | School Visit |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | Regional |
Primary Audience | Schools |
Results and Impact | I visited a local secondary school to give a talk about my research and about careers in science. The school reported that the students had found it interesting, especially to learn that science does not always happen in a laboratory. |
Year(s) Of Engagement Activity | 2020 |
Description | School Visit (Big Data Institute) |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | Regional |
Primary Audience | Schools |
Results and Impact | A group of school students (14 - 15 years) participating in a week to learn more about science and scientific careers visited our organisation. I gave a talk on my work and on day-to-day life as a postgraduate researcher. The students' teachers reported that the students enjoyed the visit and were very interested in the range of scientific careers available. |
Year(s) Of Engagement Activity | 2020 |
URL | https://scienceoxford.com/science-oxfords-stem-insight-week-a-first-byte-into-big-data/ |