Probabilistic coupling: co-adaptedness, maximality and efficiency

Lead Research Organisation: University of York
Department Name: Mathematics

Abstract

My proposed research programme concerns the idea of coupling of Markov chains. Markov chains are random processes which satisfy the appealing quality that predictions about the future behaviour of the chain only depend upon its present state - knowing extra information about where the chain has been in the past does not help us to better predict the future. Markov chains have been intensely studied during the past hundred years, and are now used in a multitude of applications including weather forecasting, the analysis of card-shuffling techniques, and the PageRank method used by Google.

This investigation concerns the problem of constructing two copies of the same chain, started from different locations, so that once the two processes meet one another they stay together from that time onwards: this is known as a 'coupling' of the two chains. There is a range of possible coupling constructions available to us, and so a natural question to ask is how quickly it is possible to make the two chains meet. As well as being an interesting mathematical question, answers to this problem have significant implications for problems in theoretical computer science and statistical physics. The fastest couplings are known as 'maximal' couplings: we know that these couplings exist for a wide range of processes, but they are almost always difficult (if not impossible) to compute explicitly. This problem usually arises because the maximal coupling is often not co-adapted - loosely speaking, one chain has to use information about the future behaviour of the other in order to determine the next step of its trajectory. This makes co-adapted couplings (ones where no such 'cheating by looking into the future' is allowed) much more widely used in applications, but little is known about how good such couplings can be.

My proposed plan of research will investigate this problem, looking first of all at a particular class of Markov chains called random walks on groups. These random walks have a lot of nice structure, which can often be exploited to enable explicit computations to be performed. I will look at how good co-adapted couplings can be for these types of Markov chain, and how this is related to the underlying structure of the state-space. Another major objective of the project to is to look at maximal couplings in a more general setting, and try to understand when such couplings can be constructed in a co-adapted manner. The final aim of the proposal is to investigate the implications of the above results for 'perfect simulation' algorithms, many of which make use of co-adapted couplings to produce random samples from the exact stationary distribution of a Markov chain.

Planned Impact

The research to be undertaken in this project will be innovative and timely, and will benefit other mathematicians and the wider scientific community; there is also a high probability that some of the results will be of interest to the general public.

The proposed research deals with a theoretical area of probability, and so the people who will benefit the most from this work are fellow mathematicians, especially those working in the fields of coupling theory, Markov processes and perfect simulation algorithms. However, I also anticipate a great deal of interest in the work from scientists in other related fields.

Objective 1 of the proposal deals with the problem of constructing optimal co-adapted couplings for random walks on groups, and how the existence of such couplings is related to the structure of the underlying state space. There are already a lot of deep results concerning random walks on groups, which provide interesting links between probability, algebra and analysis: accordingly, it is to be expected that there will be a large audience for my work from pure mathematicians as well as fellow probabilists. In addition, Goal 1.1 of the proposal focuses on random walks on the symmetric group; these walks can be thought of as methods for shuffling a pack of playing cards, and results in this area are hence likely to be of interest to the general public.

Objectives 2 and 3 concern an investigation into the relationships between optimal co-adapted, efficient and maximal couplings. Co-adapted couplings are often used by those working in probability and theoretical computer science to establish convergence rates of Markov processes; for example, the concept of path-coupling has played an important role in analysing algorithms for approximating #P-hard problems such as counting graph colourings. A better understanding of how fast co-adapted couplings can be for any given process will thus be of great benefit to researchers tackling such problems.

Finally, Objective 4 seeks to understand the implications of the results from the first three objectives for perfect simulation algorithms. These algorithms are similar to the much-used method of Markov chain Monte Carlo (MCMC), in that they return a sample from the stationary distribution of an ergodic Markov chain. Unlike with MCMC however, the output from a perfect simulation algorithm is from the exact stationary distribution (rather than an approximation, hence the name), at the cost of a random run-time. These algorithms have been successfully used in a wide number of applications by probabilists, applied statisticians, computer scientists and statistical physicists during the past 15 years. It is therefore to be anticipated that any new results concerning the speed of such algorithms will be of benefit to a wide variety of scientists.

Publications

10 25 50
publication icon
Connor S (2019) Mixing times for exclusion processes on hypergraphs in Electronic Journal of Probability

publication icon
Connor S (2016) Perfect simulation of M/G/ c queues in Advances in Applied Probability

 
Description Perfect simulation methods can be used to understand complicated probability distributions which cannot be tackled theoretically. This is a topic that has received a considerable amount of attention over the past 20 years; recently a lot of effort has been focussed on simulation techniques for queueing systems. Consider a queueing system in which customers arrive at random times, each with some random amount of work to be completed by a team of servers. If the servers are on average completing the work faster than it is arriving then the number of customers will converge to some limiting distribution. But what does this look like? Very little can be calculated directly here, and so efficient simulation methods are called for. Understanding such systems is important as they are a common feature of many real-world scenarios. Example applications include people-management problems ("how should telephone enquiries to a call centre be routed to minimise the time that customers are on hold?"), and computer processor control ("how can multiple processors deal most efficiently with multiple tasks?"). In a recent paper ("Perfect simulation for M/G/c queues", Connor & Kendall, Advances in Applied Probability, 47, 2015), Kendall and I showed how to produce a perfect sample from the workload distribution of a stable multi-server queue under Poisson arrivals. This generalised previous work by Sigman, who had shown how this could be achieved under the much stronger assumption of super-stability (where a single server could deal with all customers without the queue length blowing up). Our algorithms make minimal assumptions about the distribution of service times, and our theoretical results are backed up by a variety of practical simulations.
Exploitation Route My work on perfect simulation algorithms for M/G/c queues has already been used by other authors in the development of similar algorithms for other types of queueing system. There remains potential for the work to be used in further queueing applications, both in theory and in practice, such as networks of queues. It may also be possible for the methods developed in my work to be generalised to other application areas where perfect simulation is suspected to be a practical method of simulation. In addition, I have since shown how to modify our perfect simulation algorithm to create an "omnithermal" algorithm, capable of sampling simultaneously from queues with differing numbers of servers.
Sectors Digital/Communication/Information Technologies (including Software)

Retail

 
Description LMS partial funding for "Developments in Coupling" Workshop
Amount £5,340 (GBP)
Funding ID WS-1112-01 
Organisation London Mathematical Society 
Sector Academic/University
Country United Kingdom
Start 08/2012 
End 09/2012