Robust and scalable Markov chain Monte Carlo for heterogeneous models

Lead Research Organisation: University College London

Department Name: Statistical Science

Abstract

A large proportion of statistical inference tasks can be framed as either an optimisation or an integration problem. Markov chain Monte Carlo (MCMC) algorithms can be used to solve both, but are most commonly used for the latter. They have been successful in such important and diverse settings as the observation of gravitational waves, modelling the spread of infectious diseases, and predicting the results of elections from political polling data. MCMC algorithms are also popular outside of statistical inference, in particular their use is widespread for molecular dynamics simulations in statistical physics.

Despite their numerous successes, current MCMC algorithms has some known drawbacks. A prominent example is their performance when model parameters vary over very different scales and exhibit multiple levels of inter-dependence (heterogeneous models). There is an increasingly urgent need to improve this performance as high volume and highly heterogeneous datasets become more and more available, and as researchers begin to ask progressively more nuanced questions from their data for which heterogenous models are needed.

The standard approach to MCMC for heterogeneous models is through adaptive pre-conditioning of algorithms. Doing this naively in a high-dimensional setting comes at a significant cost (the required number of operations per algorithm step is often cubic in the number of model parameters, and the number of algorithmic tuning parameters to learn is quadratic). In addition, current state of the art algorithms such as Hamiltonian and Langevin Monte Carlo work particularly poorly in combination with the technique, as has recently been shown both theoretically and experimentally by myself and others. In this proposal I will attack this problem on two fronts.

In the first work package I will develop and study a new suite of MCMC algorithms that are specifically tailored to heterogeneous models. I will do this by designing algorithms based on the recently derived class of Markov processes termed 'locally-balanced', for which there is considerable evidence of improved robustness to model heterogeneity. I will provide a rigorous foundation for this class of Markov processes, establish key theoretical properties on convergence to equilibrium and optimality, and then design new algorithms based on this class of processes, each tailored towards specific application areas of known interest.

In the second work package I will develop new theoretically grounded methodology for scalable adaptive pre-conditioning of algorithms. I will do this in part by taking inspiration from the literature on sparse estimation of covariance matrices for high-dimensional datasets. I will design methods that are both scalable to high-dimensional settings and for which theoretical guarantees can be established, to provide a clear indication of expected performance gains. This should improve the applicability of existing state of the art methods such as Hamiltonian Monte Carlo to the high-dimensional and heterogeneous model setting.

There is a keen focus on integrating new methods within widely used statistical software within the proposal. To this end, I have planned collaborations with the founding developers of the 'Stan' statistical programming language, which has over 100,00 users, as well as detailed plans to create bespoke open source packages in software such as R and Python. I also outline plans to work closely with data scientists to apply the new methodology in many prominent application areas.

Funded Value:

£207,950

Funded Period:

Jul 22 - Oct 24

Funder:

EPSRC

Project Status:

Active

Project Category:

Research Grant

Project Reference:

EP/V055380/1

Principal Investigator:

Samuel Livingstone

Research Subject:

Mathematical sciences (100%)

Research Topic:

Statistics & Appl. Probability (100%)

Organisations

People	ORCID iD
Samuel Livingstone (Principal Investigator)	http://orcid.org/0000-0002-7277-086X

Publications

Author Name

Title Publication Date Published

10 25 50

Liang X (2022) Adaptive random neighbourhood informed Markov chain Monte Carlo for high-dimensional Bayesian variable selection in Statistics and Computing

Vogrinc J (2023) Optimal design of the Barker proposal and other locally balanced Metropolis-Hastings algorithms in Biometrika

Collaboration


Description	Visit to Flatiron institute New York
Organisation	Simons Foundation
Department	Flatiron Institute
Country	United States
Sector	Academic/University
PI Contribution	I visited the institute for 2 weeks, gave a seminar, attended a workshop for 3 days, and discussed several research ideas with the staff. We are currently exploring projects related to these.
Collaborator Contribution	Sharing of research ideas, knowledge exchange.
Impact	No outputs yet, collaborations still ongoing.
Start Year	2022

Abstract

Organisations

People

ORCID iD

Publications