New developments in non-reversible Markov chain Monte Carlo

Lead Research Organisation: Lancaster University
Department Name: Mathematics and Statistics

Abstract

The exploration performed by a Markov chain Monte Carlo (MCMC) algorithm can be likened to the exploration of some interesting terrain. Traditional MCMC is `reversible': the simplicity of this condition has facilitated the huge number of extensions and variations on the standard MCMC algorithm that are available today; however reversibility also implies that on relatively flat terrain (and in real, high-dimensional applications only one direction is `uphill', with all other directions relatively flat), an MCMC `walker' loses their sense of direction so that their path becomes erratic and the exploration slow. By contrast, non-reversible MCMC keeps a sense of direction even over flat terrain. Current non-reversible algorithms come in two main flavours: one imagines a drone flying in a straight line above the terrain and occasionally changing direction so as to keep above the higher regions; the other inverts the terrain and imagines kicking a ball along it in a random direction. Both of these methods have great potential, but also practical problems that limit their usability. Drawing on both methods, this project will create new non-reversible algorithms which are much more efficient than standard, reversible, MCMC and can be applied across a wide variety of contexts; it will also create easy-to-use software for statistical practitioners.

MCMC is used for the statistical analysis of complex data sets across a huge range of applications, from finance and fraud detection, through understanding, predicting and intervening in the spread of infectious diseases, to understanding the location of dark matter in the universe, and our work will benefit anyone analysing complex datasets in these and many other areas.

Planned Impact

Markov chain Monte Carlo (MCMC) is the tool of choice for statistical inference across a huge range of applications, such as those mentioned in the summary. The results of the analyses are used, for example, in deciding upon interventions to prevent the spread of infectious diseases or interventions to prevent further banking fraud, or in understanding molecular pathways and designing molecules. The more iterations of the MCMC algorithm that are performed, the more accurate and reliable the final answer, but the actual number of iterations required for `sufficient' accuracy depends on the efficiency of the underlying algorithm. All applications of MCMC are limited by finite time and finite computing resources, and for many modern, complex applications using currently-available algorithms this translates into unacceptable levels of uncertainty. Our algorithms will lead to orders of magnitude increases in the efficiency of MCMC across a wide range of applications and our easy-to-use software will enable statistical practitioners to provide clear, timely evidence to facilitate policy decisions.

Our "pathways to impact" describes a range of mechanisms for maximising the impact from this grant. Most important is the development of semi-automatic software, as mentioned above. In addition, in the final year of the grant we will also hold a workshop that will mix researchers working on non-reversible MCMC with statistical practitioners who could benefit from the efficiency of our algorithms and software, and whose specific needs for efficiency gains could also inform the final work on the grant. The research will also be disseminated at conferences and in publications. Finally, by the end of the project the PDRA will be ready to take their place working in a key national-need area, having been trained in the collaborative development of statistical methodology and in publishing related articles; in the development and documentation of associated, user-friendly statistical software; and in interaction with end users, including scientists and applied statisticians.

Publications

10 25 50
publication icon
C. Sherlock (2023) The Apogee to Apogee Path Sampler in Journal of Computational and Graphical Statistics

publication icon
Sherlock C (2023) The Apogee to Apogee Path Sampler in Journal of Computational and Graphical Statistics

publication icon
Sherlock C (2022) A discrete bouncy particle sampler in Biometrika

 
Description Three papers have been published in statistics journals:
C.Sherlock and A.Thiery; A discrete bouncy particle sampler. Biometrika (2021).
M.Ludkin and C.Sherlock; Hug and Hop: a discrete-time, non-reversible Markov chain Monte Carlo algorithm. Published in Biometrika (2022).
C. Sherlock, S.Urbas and M.Ludkin; The Apogee to Apogee Path Sampler. Accepted for publication in the Journal of Computational and Graphical Statistics

In August 2021, a week-long workshop (online because of Covid) on the grant topic was held at the Lorentz Centre, Netherlands. It was organised by the PI and three other academics (at Bristol, Delft and Viginia). There were 65 attendees from across the world and across physics/probability/statistics. Both from feedback and my own observations, the workshop went extremely well.
Exploitation Route The algorithms could be adopted by researchers who need to use MCMC-tools to tackle complex inference problems in many areas such as physics, econometrics and health.
Sectors Environment,Other

 
Description Workshop at the Lorenz Centre, Leiden, Netherlands 
Organisation Lorentz Centre
Country Netherlands 
Sector Academic/University 
PI Contribution Christophe Andrieu (Bristol, UK), Joris Bierkens (VU Amsterdam, Netherlands), Matt Ludkin (Lancaster, UK, RA on this project) and Marija Vucelja (Virginia, USA) and I submitted a successful bid for a full-week workshop on Non-reversible Markovian Monte Carlo at the Lorenz Centre.
Collaborator Contribution The workshop will take place July 27-31 2020. The Lorenz Centre estimates the value of the venue for the week, and the administrative support before during and after the event at around 25000 Euros.
Impact At time of writing the workshop details are not yet online at the above URL, but they will appear soon. The workshop is multidisciplinary, it will bring together statisticians, applied probabilists, physicists and practitioners.
Start Year 2020
 
Description Workshop on Non-reversible Markovian Monte Carlo at the Lorentz Centre, Netherlands 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The workshop ran from the 2nd-6th August 2021. It was intended to be in-person at the Lorentz Centre in the Netherlands but, because of Covid-19, was online, though still hosted by the Lorentz Centre. The organisers were myself, Christophe Andrieu (Bristol), Joris Bierkens (TU Delft) and Marija Vucelja (Virginia).

Historically, research into non-reversible Markovian Monte Carlo has been conducted by small, relatively isolated groups of probabilists, statisticians and physicists dotted around the world. There has been relatively little contact between groups and between disciplines. Indeed the three disciplines have very different jargon, notation and intuitions for the same fundamental concepts. This workshop brought together researchers from the different groups and disciplines to share their knowledge and understanding. The aim was to boost communication and foster new cross-disciplinary collaborations, thus providing new impetus to the development of the field.
Year(s) Of Engagement Activity 2021
URL https://www.lorentzcenter.nl/non-reversible-markovian-monte-carlo-2021.html