Stochastic Processes and Partial Differential Equations for problems in Statistics and Machine learning

Lead Research Organisation: Imperial College London
Department Name: Mathematics

Abstract

The project will investigate the application of continuous time stochastic processes for optimisation and sampling problems. The aim is to clarify links between optimisation and sampling and the long term behaviour of stochastic processes, such as accelerated or self interacting diffusions. We will consider the properties and performance of appropriate time discretisations when intended for contemporary applications from Statistics and Machine learning, such as learning parameters of deep neural networks, high dimensional regression or Bayesian variable selection.

Publications

10 25 50

publication icon
Chak M (2023) Optimal friction matrix for underdamped Langevin sampling in ESAIM: Mathematical Modelling and Numerical Analysis

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/N509486/1 30/09/2016 30/03/2022
2129618 Studentship EP/N509486/1 30/09/2018 29/06/2022 Martin Chak
EP/R513052/1 30/09/2018 29/09/2023
2129618 Studentship EP/R513052/1 30/09/2018 29/06/2022 Martin Chak
 
Description A commonly used paradigm for sampling and optimisation methods is the Langevin dynamics. There has been evidence that the second order methods that use momentum are more effective in exploring the state space than first order methods and in a certain case in optimisation, there is clear theoretical proof that the inclusion of momentum improves the rate of convergence. This award has allowed research that explores the use of higher order dynamics, in particular, of the generalised Langevin dynamics, that introduce an auxiliary variable and is equivalent to incorporating memory of past information over the trajectory of the dynamics into the time evolution. This has given promising results that show performance at the level of and above the second order methods in the case of nonconvex optimisation.
Exploitation Route Higher order dynamics may be used as a gradient-based tool in high dimensional sampling and optimisation problems. These include the computation of numerical approximations to expectations with respect to probability measures such as the macroscopic quantities considered in molecular dynamics and finding optimal parameters to a given objective function such as those considered in machine learning.
Sectors Digital/Communication/Information Technologies (including Software)

Financial Services

and Management Consultancy

Manufacturing

including Industrial Biotechology

URL https://arxiv.org/abs/2003.06448