📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

DMS-EPSRC: Fast martingales, large deviations and randomised gradients for heavy-tailed target distributions

Lead Research Organisation: University of Warwick
Department Name: Statistics

Abstract

Markov chain is a mathematical object representing a random evolution with the following property: if we know the present state of the chain, its past and future are independent (i.e. information about the past does not alter the distribution of its future states). Markov chain models are fundamental across sciences and engineering. At the centre of this project are Markov chains on multi-dimensional state spaces that arise in randomised algorithms used in statistics and machine learning. This proposal is focused on the theoretical analysis of chains arising in applications in the case when their limiting distribution has heavy tails. The analysis of the heavy-tailed phenomena is crucial for the future success of randomised algorithms for two reasons: (a) they arise naturally in many applied problems and (b) are least well understood as they violate standard assumptions made in the existing theory (e.g. asymptotic linearity of the potential of the limit distribution at infinity).

(a) Heavy-tailed limiting distributions arise naturally in many applications. For example, if the errors in a regression model are distributed according to a Cauchy distribution, the posterior density has polynomial tails. Perhaps a more startling fact is that heavy tails can arise in the posterior even though a heavy-tailed distribution does not appear in the definition of a model. If the errors in a data set are heteroscedastic, meaning that the variance of the error term varies with each observation, it is necessary to use the so-called robust regression (based on e.g. Lasso-type penalisation) in order to reduce the effect of the outliers. Again the posterior has heavy tails.

(b) The presence of a spectral gap is known to be equivalent to geometric convergence of a Markov chain. However, as pointed out recently in the queueing literature, under geometric convergence ergodic estimators may still exhibit large deviation behaviour of the heavy-tailed type. Conversely, Markov chains with heavy tailed stationary measures typically do not have a spectral gap but might nevertheless exhibit good convergence properties. The EPSRC-NSF Lead Agency agreement presents a unique opportunity to combine the US expertise in theoretical Operations Research with the UK's capability in Computational Statistics, resulting in novel methodology for the analysis of the convergence of Markov chains with heavy-tailed targets, the main focus for this project.

Our main goal is to fill the gap in the literature, best illustrated by the following baseline algorithm from applications: a random-scan Metropolis-within-Gibbs chain picks randomly a coordinate of a target distribution and moves it by a one-dimensional Metropolis step based on the conditional of the target. It is possible to prove that if ANY one-dimensional marginal of the target has heavy tails, the random-scan chain is NOT geometrically ergodic. The main goal of this proposal is to lay the theoretical foundations for the analysis of the stability of Markov chains with heavy-tailed targets, focusing on the processes that underpin many randomised algorithms used in practice. In time, this work is expected to have impact far beyond applied probability in a number of sub-areas of computational statistics and machine learning where heavy-tailed targets arise.

Publications

10 25 50
 
Description New research directions have been identified and significant progress made.

1. Levy processes (LPs) are arguably the most fundamental class of stochastic processes with heavy tails as they are continuous-time analogues of random walks. LPs were introduced in the first half of the twentieth century. They have been extensively studied since and have played a major role in many application areas in the sciences and beyond. In this project we made a surprising discovery of a new representation of the law of a Levy process, based on a probabilistic structure (called the "stick-breaking construction"). This new way of looking at LPs greatly simplified the classical theory, yielding much more directly deep results that previously required 150 pages of a monograph to establish. It also enabled us to prove new results in this classical field. Our main motivation was simulation of certain path statistics of Levy processes with heavy tails, that arise in applications, and it was precisely this angle, which is the focus of this grant, that made these advances possible.

2. High-dimensional heavy-tailed invariant measures and rates of convergence. The convergence of Markov processes with high-dimensional stationary distributions, especially those with heavy tails, is known to be slow in practice and notoriously difficult to analyse. We developed a criterion for establishing lower bounds on the convergence rate in such cases. The key feature of this advance is that it is applicable in practice (in the same way the classical theory for upper bounds on convergence are). It is thus expected to be useful in the design of simulation algorithms, as it will for the first time allow a direct comparison of convergence rates, which is not feasible if only upper bounds are available. A further contribution highlight of the project is a new algorithmic approach for MCMC samplers that maps the original high-dimensional problem in Euclidean space onto a sphere thus remedying the notorious mixing problems for heavy-tailed target distributions. The proposed samplers may enjoy the "blessings of dimensionality", where convergence is faster in higher dimensions.

3. New limit theorems have been established, quantifying the rate of convergence in the stable domain of attraction (in the context of 1 above) and the rate of convergence in stationarity for Markov processes and randomised algorithms (in the context of 2 above). Both sets of results are novel and surprising, with potential for use in applications in applied probability.
Exploitation Route The work in this project, while theoretical in nature, can be taken forward to advance the simulation methods for Levy processes used in practice as well as help provide novel approaches to simulation problems within Machine Learning that require sampling from heavy-tailed target distributions. We are still at a relatively early stage of the theoretical developments and expect to report on the implications of this research in more detail in the years to come.
Sectors Digital/Communication/Information Technologies (including Software)

Energy

Financial Services

and Management Consultancy

Healthcare

Security and Diplomacy

URL https://www.youtube.com/@prob-am7844
 
Description Competing diffusive particle systems; short graduate course delivered by Professor Ioannis Karatzas (Columbia University)
Geographic Reach National 
Policy Influence Type Influenced training of practitioners or researchers
URL https://warwick.ac.uk/fac/sci/statistics/research/probability-at-warwick/karatzas/
 
Description Long time behavior of the infinite Atlas model: local stability, ergodicity, and equilibrium fluctuations; short graduate course delivered by Professor Amarjit Budhiraja (University of North Carolina at Chapel Hill)
Geographic Reach Local/Municipal/Regional 
Policy Influence Type Influenced training of practitioners or researchers
URL https://warwick.ac.uk/fac/sci/statistics/research/probability-at-warwick/budhiraja/
 
Description HEAVY TAILS IN MACHINE LEARNING - 1 month INI Satellite Programme on applied probability and machine learning, to take place at The Alan Turing Institute
Amount £75,000 (GBP)
Organisation Isaac Newton Institute for Mathematical Sciences 
Sector Academic/University
Country United Kingdom
Start 05/2024 
End 07/2024
 
Description Competing diffusive particle systems; short graduate course delivered by Professor Ioannis Karatzas (Columbia University) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Postgraduate students
Results and Impact This was a 5 hour lecture course by a world leading expert in the area of stochastic modelling. The lectures were very well attended and made an impact on numerous graduate students' research agendas.

Abstract:

We introduce and study stable multidimensional diffusions interacting through their ranks. The interactions give rise to invariant measures in broad agreement with stability properties of large equity markets observed over long time-periods. The probabilistic models we present assign drifts and variances that depend on both the name (identity) and the rank (according to capitalization) of each individual asset; they are able realistically to capture critical features of the stability in capital distribution, yet are simple enough to allow rather detailed analytical study.

The methodologies in this study touch upon the question of triple points for systems of competing diffusive particles; in particular, some choices of parameters may permit triple (or higher-order) collisions to occur. We show, however, that such multiple collisions have no effect on any of the stability properties of the resulting system. This is accomplished through a detailed analysis of collision local times.

The models have connections with the analysis of Queueing Networks in heavy traffic, with multi-dimensional diffusions reflected off the faces of the positive orthant, and with competing particle systems in Statistical Mechanics (e.g., Sherrington- Kirkpatrick model for spin-glasses). Their hydrodynamic-limit behavior is governed by generalized porous medium equations with convection, and the fluctuations around these limits by appropriate linear stochastic partial differential equations of parabolic type with additive noise. Whereas, limits of a different kind display phase transitions and are governed by Poisson-Dirichlet distributions. We survey progress on some of these fronts, and suggest open problems for further study.

Most of the models have invariant probability densities of the simple finite-sum-of-products-of exponentials type. We study also systems of interacting particles in which some of the variances-by-rank are allowed to degenerate, and for which the invariant measures of interest are not at all straightforward to compute. We report very recent work on such degenerate three-particle systems, for which very explicit invariant measure computations are possible; via appropriate Carleman-type boundary value problems for the associated Laplace transforms, and via Jacobi-type theta functions for the invariant densities themselves. The full extent and scope of these methodologies are yet to be explored.

(Joint work with Drs. E. Robert Fernholz, Tomoyuki Ichiba, Mykhaylo Shkolnikov, Vilmos Prokaj, Andrei Sarantsev, Sandro Francheschi and Killlian Rachel.)
Year(s) Of Engagement Activity 2024
URL https://warwick.ac.uk/fac/sci/statistics/research/probability-at-warwick/karatzas/
 
Description Living Proof Podcast - Exploring anomalous diffusion: An Interview with Aleks Mijatovic and Codina Cotar 
Form Of Engagement Activity A broadcast e.g. TV/radio/film/podcast (other than news/press)
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Living Proof Podcast - Exploring anomalous diffusion: An Interview with Aleks Mijatovic and Codina Cotar

In this episode of Living Proof, Dan Aspel speaks to Aleks Mijatovic (Warwick, Alan Turing Institute) and Codina Cotar (UCL) about the Stochastic systems for anomalous diffusion programme.
Year(s) Of Engagement Activity 2024
URL https://www.newton.ac.uk/media/podcasts/post/63-exploring-anomalous-diffusion-an-interview-with-alek...
 
Description Long time behavior of the infinite Atlas model: local stability, ergodicity, and equilibrium fluctuations; short graduate course delivered by Professor Amarjit Budhiraja (University of North Carolina at Chapel Hill) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Postgraduate students
Results and Impact This is a short course for PhD students on a very active area in probability theory, delivered by one of the world leading experts in the field.

Title: Long time behavior of the infinite Atlas model: local stability, ergodicity, and equilibrium fluctuations
Abstract:
The infinite Atlas model describes a countable system of competing Brownian particles where the lowest particle gets a unit upward drift and the rest evolve as standard Brownian motions. In this mini-course I will give an overview of recent developments in the study of long-time behavior of this infinite dimensional Markov process. Topics that I plan to cover include, ergodic invariant measures of the infinite Atlas model, domain of attraction properties, and a class of stochastic partial differential equations describing equilibrium fluctuations. This is based on joint work with Sayan Banerjee and Peter Rudzis.
Year(s) Of Engagement Activity 2025
URL https://warwick.ac.uk/fac/sci/statistics/research/probability-at-warwick/budhiraja/
 
Description Prob-AM, a YouTube channel for dissemination of research outputs 
Form Of Engagement Activity Engagement focused website, blog or social media channel
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Short YouTube videos aimed at researchers and students (graduate and undergraduate). The main purpose is to extend awareness of my research by lowering the time/effort barrier for understanding and using the research output.
Year(s) Of Engagement Activity 2023,2024,2025
URL https://www.youtube.com/channel/UCXSoLS_uKebYZ9GzgAF0ZsA