New approaches to Gibbs measures at the interface of probability and computational complexity

Lead Research Organisation: University of Birmingham
Department Name: School of Mathematics

Abstract

The concept of a "Gibbs distribution" - developed during the rise of statistical mechanics in the 19th century by Clausius, Maxwell, Boltzmann, Hamilton, and Gibbs - defines a probability distribution over configurations of particles based on an energy function governed by local interactions. Not only have Gibbs distributions proved invaluable in describing the behaviour of physical systems, but the simple framework of Gibbs distributions has become ubiquitous in fields far from statistical physics under the name "probabilistic graphical models": these fields include Bayesian statistics, optimisation, machine learning, mathematical biology, artificial intelligence, and many others.

Gibbs distributions are remarkably effective because they respect the underlying structure of a complex system: probabilistic dependencies are specified by a simple network structure, corresponding to the network indicating which components of the system interact.

Although the specification of a Gibbs distribution is very simple, the resulting probability distribution on configurations can be staggeringly complex. Typically the number of possible configurations grows as an exponential function of the system size, and so it is computationally hopeless to enumerate and calculate exact statistics of the system. Nevertheless, it is sometimes possible to understand all of the relevant global information about the system - the macroscopic `observables' - with simple and efficient algorithms, and this is why probabilistic graphical models arise so often in practical applications.

This research proposal aims to develop new rigorous analytic and computational methods for understanding Gibbs distributions, and in particular, the physical heuristics that underly two associated algorithms, Markov Chain Monte Carlo and Belief Propagation.

The proposal involves three interrelated goals.

The first is to make rigorous an detailed family of predictions from statistical physics about Gibbs distributions on random, or "mean-field", networks. Such random networks are used as both an approximation of physical systems and as a model of real-life networks, but have the advantage of being more amenable to analysis.

The second goal is to prove mathematically that the original mathematical model of a fluid, the "hard sphere model", exhibits a freezing transition from liquid to solid. The model, which is purely geometric, without any forces between molecules except for excluded volume, dates back to at least Boltzmann's time but has proved extremely resistant to rigorous analysis.

The third and final goal is to study the extremes of Gibbs distributions: which networks maximise or minimise certain statistics? Besides delineating the boundaries of what is possible in a given Gibbs distribution, these questions also have connections to long-standing open problems in combinatorics.

Planned Impact

The proposed research promises significant long-term practical impact, based on advancing the understanding of two widely used algorithms: Belief Propagation and Markov Chain Monte Carlo. These are simple, very general algorithms which are used in many practical applications from engineering, to Bayesian statistics, to economics, to data mining. The various uses of these algorithms often fall under the name of "Machine Learning": automated procedures for computers to `learn' or infer underlying truths from large amounts of noisy data.

The algorithms are used to perform inference on and to sample from probabilistic graphical models: models in which dependencies between random variables follow the structure of a network. These models originated in theoretical physics long before computers were even invented, but have proved hugely important due to their simple formulation and the availability of simple algorithms (whose origins again lie in statistical physics) to perform inference.

What is needed is a greater understanding of the conditions under which these algorithms will perform as intended. In general, these algorithms are simply heuristics - used without any guarantees on their performance. By developing a deeper understanding of the fundamentals of the models from statistical physics, the research from this proposal can help classify the conditions under which efficient algorithmic inference is feasible or not.

The focus of Aim 1 of the proposal is on models generated at random (mean field models) which are important in the study of algorithms as a source of hard computational problems. By gaining a precise understanding of the relationship between various phase transitions in the models and the performance of message passing algorithms we can begin to understand the structures that impede algorithmic performance.

The focus of Aim 2 is on models with the geometric structure of Euclidean space - the geometry that arises in physical systems.

Finally, the focus of Aim 3 is on extremal structures: given a set of constraints, which graphical structures maximise or minimise certain important parameters. The practical use of such results is in providing a priori bounds on parameters in the course of algorithmic design and analysis.


To ensure that research from this proposal does in fact reach practitioners and impact real-world algorithms, I will submit papers to and present work at leading Machine Learning conferences (such as NIPS and COLT) at which both academics and practitioners interact, as well as conferences and journals in both mathematics and computer science.

One final important aspect of delivering impact from this proposal is the communication of ideas and results between these different scientific communities, each of whom have their own terminology and perspective on the same general models and problems. By actively pursuing interdisciplinary collaboration and continuing to speak in conferences and seminars in different fields, I will contribute to the exchange of ideas between computer science, machine learning, statistical physics, probability and combinatorics.

Publications

10 25 50
publication icon
Aubin B (2019) Storage capacity in symmetric binary perceptrons in Journal of Physics A: Mathematical and Theoretical

publication icon
Bernshteyn A (2019) A short nonalgorithmic proof of the containers theorem for hypergraphs in Proceedings of the American Mathematical Society

publication icon
Coja-Oghlan A (2018) Information-theoretic thresholds from the cavity method in Advances in Mathematics

publication icon
Coja-Oghlan A (2019) Bethe States of Random Factor Graphs in Communications in Mathematical Physics

publication icon
Davies E (2018) Extremes of the internal energy of the Potts model on cubic graphs in Random Structures & Algorithms

publication icon
Davies E (2018) Tight bounds on the coefficients of partition functions via stability in Journal of Combinatorial Theory, Series A

publication icon
Davies E (2017) On the average size of independent sets in triangle-free graphs in Proceedings of the American Mathematical Society

 
Description The research proposal consisted of three main parts: 1) developing rigorous mathematical tools to pursue the cavity method from statistical physics in the context of randomized computational problems 2) using the theory of Gibbs measures to prove theorems in extremal combinatorics 3) understanding the "hard sphere" model from statistical physics, a simple model of a gas. We have made significant progress in all three areas, as I will describe briefly. 1) with A. Coja-Oghlan we have verified one of the tenets of the physicists "replica symmetry breaking hypothesis"; the preprint is titled "Bethe states of random factor graphs" and describes how Gibbs measures on random factor graphs can be decomposed into a small number of probability measures that are simple local and globally. 2) with E. Davies, M. Jenssen, and B. Roberts we have developed our methods for applying Gibbs measures to extremal graph theory problems further - we have shown how to use the notion of "stability" - that any near optimizer of an extremal problem must be close in some sense to the unique optimizer - to prove exact extremal results on the coefficients of important graph polynomials (e.g. the independence polynomial and the matching polynomial) (preprint "Tight bounds on the coefficients of partition functions via stability"). 3) With M. Jenssen and F. Joos, we have used the statistical physics based methods from the research proposal to prove new results in high dimensional Euclidean geometry. First, a lower bound on the entropy of dense sphere packings (preprint "On the hard sphere model and sphere packings in high dimensions"), and then a new lower bound on the kissing number of spheres in high dimensions (preprint "On kissing numbers and spherical codes in high dimensions").
Exploitation Route We have developed some very general mathematical tools that we hope will be of use to other mathematicians, as well as computer scientists and statistical physicists. Two specific examples of such tools are 1) a canonical decomposition of probability measures on discrete cubes into "extremal measures" (joint with A. Coja-Oghlan). 2) a statistical-physics based proof technique for understanding sphere packings and error correcting codes (joint with M. Jenssen and F. Joos). One benefit of both of these tools is that they are fairly simple to understand and very broadly applicable, so we envision they will have many further uses beyond what we have used them for so far.
Sectors Digital/Communication/Information Technologies (including Software),Education

URL http://willperkins.org/
 
Description The methods that have been developed in this project have led to new approaches to combinatorial problems which utilise methods and results from statistical physics. It is usually the case that mathematics provide the language of physics. However, here we see the opposite happening: it is the physical theory which influences purely mathematical problems.
First Year Of Impact 2018
Sector Other