📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

A statistical framework for the apportionment of particulate contaminants and their health effect determination

Lead Research Organisation: Imperial College London
Department Name: School of Public Health

Abstract

Air pollution is a complex mixture of diverse substances from anthropogenic and natural sources. These sources in combination with factors such as meteorology and chemical/biological transformations, determine the air pollution concentration and the variation in the physiochemical components across space and time. The identification of these sources is a key element for developing effective and efficient strategies to control and reduce pollution through targeted actions. In addition, air pollution is a major public health concern, being increasingly associated with risk of morbidity and mortality of human populations. Recent evidence points out that mixture of particles from different sources can have a different detrimental contribution on health; this makes the understanding of pollutant sources even more important in order for air quality managers to fully understand the potential health outcomes of pollutant mixtures. While knowledge of the main sources of pollution can be effectively obtained on temporal and/or geographical localised setting, the modelling and the understanding of some aspects of the dynamic and physiochemical processes remain a substantial challenge.

This project focuses primarily upon: (i) the development of a methodological approach for particle matter (PM) source apportionment (SA) which uses nonparametric processes with dependence on dynamic factors (e.g. meteorology) to model the underlying spatial or temporal structure and the distribution of contaminants to identify sources; (ii) the quantification of the impact of apportioned air contaminants upon vulnerable populations; and (iii) the translation of this methodological approach to real-life decision making through the predictions of the health outcomes under changing scenarios of pollution mix as a result of potential policy implementations.
The proposed approach will be tested against the state-of-the-art tools for SA of air pollution, using simulated examples. Evaluation of the adverse responses associated with air particulate sources is reached by comparing two-stage procedures vs joint models for SA and health-effect assessment. We will consider two real case studies: (i) to identify time-varying sources of particles (PM2.5) in Greater London and evaluate their acute effects on respiratory hospital admissions in vulnerable populations (0-14 years, 65+) in a time-series framework; (ii) to disentangle spatially-varying sources of particles (PM2.5) in South East England and evaluate their respiratory chronic effects in the same region in a small-area framework.

By using rigorous and innovative methodologies, we believe that the proposed research (i) will provide scientific evidence of the differential harmful effect of PM chemical components, (ii) will help understand the sources that can be controlled, and (iii) will have the potential to inform air pollution policy implementation and regulation to improve UK population health.

Technical Summary

Particulate matter (PM) is a complex mixture of diverse substances from anthropogenic and natural sources, which vary widely across space and time. Despite being a major public health concern, most studies have linked total PM to health outcomes, while it is now understood that contaminant mixtures deriving from different sources may have differential health effects.
We propose a Bayesian nonparametric approach that, starting from measurements of a wide range of components, estimate sources of PM and evaluate their health effects, through probabilistic clustering. We will account for spatial or temporal and covariate dependency (e.g. meteorology) in the cluster allocation; the link between sources and health outcomes will be evaluated using (i) a two-stage model where the sources identification is separated from the epidemiological model and (ii) a joint model, allowing for the outcome to influence the apportionment of the sources.
Our approach will provide competitive advantages over standard receptor methods, as it incorporates: (i) temporal dynamics in time-series speciation data, (ii) spatial dependence structure in multi-site contaminant data, (iii) missing data accommodation, (iv) quantification of uncertainty associated with source profiles, (v) nonlinear relationship between sources and health outcomes.
We will build an extensive simulation study and consider two real case studies: (i) to identify time-varying sources of PM in Greater London and evaluate their acute effects on respiratory hospital admissions in vulnerable populations; (ii) to disentangle spatially-varying sources of PM in South East England and evaluate their respiratory chronic effects. We believe that the proposed research (i) will provide new scientific evidence of the differential health impact of air pollutant mixtures, (ii) will help understand the sources that can be controlled and (iii) will have the potential to inform policy implementation for improving UK population health.

Planned Impact

The proposed research will develop a novel statistical approach to apportion chemical components of particulate pollution into sources and to evaluate their health effects. It will provide methodological advances: i) in the field of environmental exposure, providing competitive advantages over standard receptor methods as it explicitly accounts for the spatial or temporal dependences present in the source allocation; ii) in the field of environmental epidemiology, shifting the paradigm of the evaluation of health effects from total particulate matter (PM) to their sources, which will be more directly linked to policy implementations. The flexibility of the approach will ensure that it can be adapted to increased complexity in the data, as it could incorporate possible nonlinear relationship in the source profile exposure and health outcomes and distributed lags; a main characteristic of the framework regards the quantification of uncertainty associated with source profiles and its propagation into the health model.

The dynamicity (in space or time) of our approach will allow to disentangle the potential differential effect of sources at different locations and time points, which has never been attempted before. For instance, we will be able to identify point sources (e.g. factories) or diffuse contributions (e.g. urban traffic), which will lead to discussions about permits, fines, policies (e.g. ultra low emission zone) to directly control those sources. It is important to note that there might be large uncertainties about the extent and timing of pollution; a main advantage of our approach is the ability to account for and propagate these uncertainties throughout the model. We will communicate the level of uncertainty in the estimates visually through plots and maps.

We will build a toolbox with a friendly interface (using the R shiny web interface) which will allow users to upload their data, choose the model to run (depending on the space or time structure of the data) and to get the results in the form of a report with maps/plots and tables. Through the toolbox the research from the project will be freely available to a wide range of applied researchers, thus with the potential of becoming the state-of-the-art method to carry out source apportionment analyses and to link sources with health outcomes in epidemiological studies. We will advertise the toolbox on the MRC Centre for Environment and Health website (http://www.environment-health.ac.uk/) as well as on our personal webpages.

We will disseminate the results from our framework through the university press offices which will allow reaching a wide audience and being in contact with the media. Through the involvement of Public Health England in our research team and through the advisory board we will have direct access to policy makers across environmental and public health field. We have devoted a work package (WP5) to the translation of our methods and results to decision-making and will run two workshops (as outlined in the case for support and in the Communication plan) targeting specific stakeholders and policy makers, as well as the wider scientific community, to present the results and to discuss ways to inform policy implementation.
Finally, the developed framework will serve as a methodological platform which will be then transferable to the investigation of other environmental exposures. For instance, we could use the developed methodology for pollution in water or soil, or even for estimating probabilistic clusters of green and blue spaces and to evaluate their association with mental wellbeing.

Publications

10 25 50

publication icon
Martínez-Hernández I (2025) Modelling particle number size distribution: a continuous approach in Journal of the Royal Statistical Society Series C: Applied Statistics

publication icon
Martínez-Hernández I. (2025) Modelling particle number size distribution: a continuous approach in Journal of the Royal Statistical Society Series C

 
Title Model Code 
Description Code for pre-processing and model development for the source apportionment methods in Baerenbold et al. 2022 
Type Of Material Computer model/algorithm 
Year Produced 2022 
Provided To Others? Yes  
Impact We had several researchers reaching out to ask details about the published code as they wanted to use this for their research 
URL https://github.com/OBaerenbold/UFP-TIES/tree/v0
 
Description Conference presentation - Monica Pirani 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Other audiences
Results and Impact Monica was invited to give a presentation at the annual conference of the Italian Statistical Society. It was well attended by a mixture of early career (MSc, PhD) researchers, as well as more senior researchers (mostly statisticians), and policymakers.
Year(s) Of Engagement Activity 2022
 
Description Interview for an talian national newspaper 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Media (as a channel to the public)
Results and Impact Interview with a journalist for a piece on a study related to air pollution modelling and its health impact
Year(s) Of Engagement Activity 2023
URL https://www.repubblica.it/green-and-blue/2023/08/17/news/covid_effetto_lockdown_meno_morti_inquiname...
 
Description Panel on the health effects of climate change 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Professional Practitioners
Results and Impact Participated in an expert panel on the impact of climate change on health at the Grantham Institute (Imperial College). The audience included about 50 professionals/specialists in different aspects of climate change, but not necessarily on its health effects. The lively debate initiated a working group on this topic, and more specifically around biodiversity
Year(s) Of Engagement Activity 2025
 
Description Panellist at a session within the Turing AI conference - Marta Blangiardo 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact This was a panel discussion as part of the Turing Artificial Intelligence conference. It was an online event and we had more than 100 people who joined. The presentation was very well received with several questions and requests for follow up.
Year(s) Of Engagement Activity 2022
 
Description School visit (Hersham - Surrey) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact Talk to year 6 children at Cardinal Newman Primary School, Hersham on how to use statistics for climate change research (including how to model air pollution). 60 children were present and were very engaging, we had 30 minutes of questions after the talk, with several questions about how to become a scientist and a statistician.
Year(s) Of Engagement Activity 2023
 
Description Short course at AIMS Rwanda 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Short course on the use of Bayesian spatio-temporal models for environmental and health research was run in Kigali (Rwanda) on the 26th Nov-1st Dec 2023. It involved 35 postgraduate students (MSc and PhD) and some policy makers from Rwanda Biomedical Centre. The course sparked additional collaborations in the form of grant proposals as well as short secondments.
Year(s) Of Engagement Activity 2023