Optimisation-centric Generalisations of Bayesian Inference
Lead Research Organisation:
UNIVERSITY COLLEGE LONDON
Department Name: Statistical Science
Abstract
Large scale black box statistical models are ubiquitous in modern society; and aimed at providing a way to examine the behaviour of complex systems. For example, Improbable has helped design such models as part of the RAMP initiative to help the UK government predict the spread of the COVID-19 virus. In engineering, so-called 'digital twins' of real-world physical phenomena or assets are commonly used to conduct virtual stress tests and predict the behaviour of critical systems in the presence of exogenous shocks.
An important concern for these models is the nature of our uncertainty about their predictions and recommendations. Unlike for more traditional statistical analysis, the underlying models are often highly complex, not immediately interpretable, and often misspecified. As a consequence, standard Bayesian methods of uncertainty quantification derived under the assumptions of the traditional paradigm for statistical analysis are often inappropriate. More specifically, they often result in over-confidence and a lack of robustness.
To tackle this issue, generalised forms of Bayesian uncertainty quantification have recently been developed. Such methods can ensure robustness and reduce the computational burden relative to standard Bayesian methods. This makes them ideal for applications in simulation-based modelling scenarios---such as COVID-19 modelling or digital twins. Yet, to date they have not been used in this context and primarily enjoyed success in time-ordered problems (such as on-line learning, changepoint detection, or filtering and smoothing) as well as in Bayesian Deep Learning applications (such as Bayesian neural networks or deep Gaussian Processes). In spite of their promise however, both their foundational theoretical properties as well as their computation are under-explored topics of research.
In this fellowship, I will advance the theory, methodology, and application of generalised Bayesian posteriors that are defined implicitly through an optimisation problem. While such generalised Bayesian methods have shown great promise, a thorough investigation of this kind will be required if they are to be adopted more widely. As part of this, I will investigate the fundamental question of how one should choose between different generalised posteriors. Complementing this, I will devise methodology for Bayesian computation geared towards the special properties of these posteriors. I will then leverage the advances made as part of this research to apply them on two classes of high-impact problems that traditional Bayesian methods struggle with: models revolving around intractable likelihoods, and simulator-based inference. For the applied component of this research programme, I will draw on the expertise of my project partners and use generalised posteriors for better uncertainty quantification in 'digital twins', as well as applications of importance for national security---such as modelling the COVID-19 pandemic.
An important concern for these models is the nature of our uncertainty about their predictions and recommendations. Unlike for more traditional statistical analysis, the underlying models are often highly complex, not immediately interpretable, and often misspecified. As a consequence, standard Bayesian methods of uncertainty quantification derived under the assumptions of the traditional paradigm for statistical analysis are often inappropriate. More specifically, they often result in over-confidence and a lack of robustness.
To tackle this issue, generalised forms of Bayesian uncertainty quantification have recently been developed. Such methods can ensure robustness and reduce the computational burden relative to standard Bayesian methods. This makes them ideal for applications in simulation-based modelling scenarios---such as COVID-19 modelling or digital twins. Yet, to date they have not been used in this context and primarily enjoyed success in time-ordered problems (such as on-line learning, changepoint detection, or filtering and smoothing) as well as in Bayesian Deep Learning applications (such as Bayesian neural networks or deep Gaussian Processes). In spite of their promise however, both their foundational theoretical properties as well as their computation are under-explored topics of research.
In this fellowship, I will advance the theory, methodology, and application of generalised Bayesian posteriors that are defined implicitly through an optimisation problem. While such generalised Bayesian methods have shown great promise, a thorough investigation of this kind will be required if they are to be adopted more widely. As part of this, I will investigate the fundamental question of how one should choose between different generalised posteriors. Complementing this, I will devise methodology for Bayesian computation geared towards the special properties of these posteriors. I will then leverage the advances made as part of this research to apply them on two classes of high-impact problems that traditional Bayesian methods struggle with: models revolving around intractable likelihoods, and simulator-based inference. For the applied component of this research programme, I will draw on the expertise of my project partners and use generalised posteriors for better uncertainty quantification in 'digital twins', as well as applications of importance for national security---such as modelling the COVID-19 pandemic.
Organisations
- UNIVERSITY COLLEGE LONDON (Lead Research Organisation)
- Duke University (Collaboration, Project Partner)
- Newcastle University (Collaboration, Project Partner)
- University of Hong Kong (Collaboration)
- University College London (Collaboration)
- University of Adelaide (Collaboration)
- Monash University (Collaboration)
- Improbable Worlds Ltd (Project Partner)
- The Alan Turing Institute (Project Partner)
- University of Warwick (Project Partner)
- RIKEN (Project Partner)
People |
ORCID iD |
| Jeremias Knoblauch (Principal Investigator / Fellow) |
Publications
Altamirano M
(2023)
Robust and Scalable Bayesian Online Changepoint Detection
Altamirano M.
(2024)
Robust and Conjugate Gaussian Process Regression
Altamirano M.
(2023)
Robust and Scalable Bayesian Online Changepoint Detection
in Proceedings of Machine Learning Research
Charita Dellaporta
(2022)
Robust Bayesian inference for simulator-based models via the MMD
posterior bootstrap
in Proceedings of Machine Learning Research
Duran-Martin G.
(2024)
Outlier-robust Kalman Filtering through Generalised Bayes
Frazier T. D.
(2024)
The Impact of Loss Estimation on Gibbs Measures
H Husain
(2022)
Adversarial Interpretation of Bayesian Inference
in Proceedings of The 33rd International Conference on Algorithmic Learning Theory
Jaskari J
(2022)
Uncertainty-Aware Deep Learning Methods for Robust Diabetic Retinopathy Classification
in IEEE Access
Jaskari, J.
(2022)
Uncertainty-aware deep learning methods for robust diabetic retinopathy classification
in arXiv
| Description | Bayesian inference has been thoroughly modernised & advanced, and made fit for purpose for a range of machine learning applications |
| Exploitation Route | Some outcomes of this grant are already powering production code of tech companies, e.g. Meta's Bayesian Optimisation team and MTU's quality control team which is involved with building jet engines. Our methods have also been used by Idoven (for cardiac healthcare) and Matterhorn Studios (for self-driving lab design). There are also many other ways to apply this work. |
| Sectors | Aerospace Defence and Marine Digital/Communication/Information Technologies (including Software) Healthcare |
| Description | This is already elaboraetd upon in key findings. Used in various companies' live production code. |
| Sector | Aerospace, Defence and Marine,Digital/Communication/Information Technologies (including Software),Healthcare |
| Description | 2017 Enrichment Scheme |
| Amount | £1 (GBP) |
| Funding ID | TU/D/000009 |
| Organisation | Alan Turing Institute |
| Sector | Academic/University |
| Country | United Kingdom |
| Start | 09/2017 |
| End | 09/2018 |
| Description | Bloomberg PhD Fellowship |
| Amount | $150,000 (USD) |
| Organisation | Bloomberg |
| Sector | Private |
| Country | United States |
| Start | 09/2024 |
| End | 10/2025 |
| Title | Generalised Bayesian Inference for Discrete Intractable Likelihood |
| Description | Discrete state spaces represent a major computational challenge to statistical inference, since the computation of normalisation constants requires summation over large or possibly infinite sets, which can be impractical. This paper addresses this computational challenge through the development of a novel generalised Bayesian inference procedure suitable for discrete intractable likelihood. Inspired by recent methodological advances for continuous data, the main idea is to update beliefs about model parameters using a discrete Fisher divergence, in lieu of the problematic intractable likelihood. The result is a generalised posterior that can be sampled from using standard computational tools, such as Markov chain Monte Carlo, circumventing the intractable normalising constant. The statistical properties of the generalised posterior are analysed, with sufficient conditions for posterior consistency and asymptotic normality established. In addition, a novel and general approach to calibration of generalised posteriors is proposed. Applications are presented on lattice models for discrete spatial data and on multivariate models for count data, where in each case the methodology facilitates generalised Bayesian inference at low computational cost. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| URL | https://tandf.figshare.com/articles/dataset/Generalised_Bayesian_Inference_for_Discrete_Intractable_... |
| Title | Generalised Bayesian Inference for Discrete Intractable Likelihood |
| Description | Discrete state spaces represent a major computational challenge to statistical inference, since the computation of normalisation constants requires summation over large or possibly infinite sets, which can be impractical. This paper addresses this computational challenge through the development of a novel generalised Bayesian inference procedure suitable for discrete intractable likelihood. Inspired by recent methodological advances for continuous data, the main idea is to update beliefs about model parameters using a discrete Fisher divergence, in lieu of the problematic intractable likelihood. The result is a generalised posterior that can be sampled from using standard computational tools, such as Markov chain Monte Carlo, circumventing the intractable normalising constant. The statistical properties of the generalised posterior are analysed, with sufficient conditions for posterior consistency and asymptotic normality established. In addition, a novel and general approach to calibration of generalised posteriors is proposed. Applications are presented on lattice models for discrete spatial data and on multivariate models for count data, where in each case the methodology facilitates generalised Bayesian inference at low computational cost. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| URL | https://tandf.figshare.com/articles/dataset/Generalised_Bayesian_Inference_for_Discrete_Intractable_... |
| Description | Chris Oates / Takuo Matsuara |
| Organisation | Newcastle University |
| Country | United Kingdom |
| PI Contribution | theory and development of methodology (focus on generalised Bayes) |
| Collaborator Contribution | theory and development of methodology (focus on kernel methods) |
| Impact | JRSSB publication JASA publication |
| Start Year | 2021 |
| Description | David Dunson |
| Organisation | Duke University |
| Country | United States |
| Sector | Academic/University |
| PI Contribution | expertise on generalised Bayes |
| Collaborator Contribution | resources; expertise on statistical sciences |
| Impact | Publication at the Journal of American Statistical Association in 2025 ('Robustifying likelihoods by optimistically re-weighting data') |
| Start Year | 2023 |
| Description | David Frazier |
| Organisation | Monash University |
| Country | Australia |
| Sector | Academic/University |
| PI Contribution | expertise in generalised Bayesian methodology |
| Collaborator Contribution | expertise in Bayesian asymptotics |
| Impact | Paper on asymptotics of generalised Bayesian methods with estimated losses |
| Start Year | 2023 |
| Description | Dino Sejdinovic |
| Organisation | University of Adelaide |
| Country | Australia |
| Sector | Academic/University |
| PI Contribution | expertise on generalised / post-Bayesian methodology |
| Collaborator Contribution | PhD student (Veit Wild), resources (travel expenses paid), and expertise on kernel methods / functional analysis |
| Impact | NeurIPS oral paper led by Veit Wild JMLR submission led by Veit Wild |
| Start Year | 2023 |
| Description | Edwin Fong |
| Organisation | University of Hong Kong |
| Country | Hong Kong |
| Sector | Academic/University |
| PI Contribution | Expertise in generalised Bayesian inference |
| Collaborator Contribution | Expertise in nonparametric Bayesian statistics & Martingale posteriors |
| Impact | Biometrika submission on the predictive influence of the so-called learning rate parameter |
| Start Year | 2023 |
| Description | FX |
| Organisation | University College London |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Expertise in generalised Bayesian methods |
| Collaborator Contribution | Expertise on kernel methods and Gaussian process |
| Impact | 6 papers to date |
| Start Year | 2021 |
| Description | Creation of Post-Bayesian seminar series |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Professional Practitioners |
| Results and Impact | fortnightly international remote seminar series on the advances in post-Bayesian analysis. High profile lectures by leaders in the field.(https://postbayes.github.io/seminar/) |
| Year(s) Of Engagement Activity | 2025 |
| URL | https://postbayes.github.io/seminar/ |
| Description | Inaugural post-Bayesian workshop @ UCL |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Professional Practitioners |
| Results and Impact | first workshop on post-Bayesian methods; 2-day event organised 15./16. of May 2025. |
| Year(s) Of Engagement Activity | 2025 |
| URL | https://postbayes.github.io/workshop2025/ |
| Description | Workshop on robust Bayesian methods @ BayesComp Singapore |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Professional Practitioners |
| Results and Impact | 2-day satellite workshop on robust Bayesian methods at BayesComp, the biannual meeting of the Section on Bayesian Computation of the International Soceity of Bayesian analysis |
| Year(s) Of Engagement Activity | 2025 |