Uncertainty Quantification for Numerical Models with two Regions of Solution

Lead Research Organisation: University of Exeter
Department Name: Engineering Computer Science and Maths

Abstract

The use of complex numerical models in science and policy is now ubiquitous. For practical application predictions from such models need to be accompanied by estimates of uncertainty. Supplying these estimates is a difficult problem. Gaussian process emulation has proved to be a very effective solution A Gaussian process emulator is a statistical model of the numerical code that is fast to run so can be used to calculate uncertainty estimates, for example by Monte Carlo methods that are computationally impossible with the full model. Emulators have been used across a wide variety of applications from climate and air pollution through engineering and geology to systems biology. However there are numerical models where there are two or more solutions separated by bifurcations or tipping points. A simple example from climate science is the Stommel model which has a different solution for when the overturning circulation is turned on or off. We have other examples from climate, oil reservoir modelling and biology. This PhD will look at this problem developing methods to identify the areas of model input space where each solution holds and how to build separate, but linked, emulators for each solutions. The inverse problem where we have real world observations of some of the model outputs and these are used to make inferences about the model inputs, in effect running the model 'backwards', will also be considered. In addition the student will look at the problem of experimental design, where best to run the model with a limited computer budget, developing new sequential methods. Although the student will develop a general methodology it will be applied to a number of real world applications throughout the PhD.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/N509656/1 01/10/2016 30/09/2021
1783352 Studentship EP/N509656/1 01/10/2016 31/03/2020 Louise Kimpton
 
Description My main area of research has been in uncertainty quantification. Complex numerical models are used in science to represent real life physical systems, and I am particularly interested in models with two distinct regions in output space where classification is required. For example, a computer model may fail to complete for specific input regions, and we'd like to predict where to avoid running the model, or incorrectly running an emulator. My first result was to produce a latent Gaussian process model for correlated classification.

A common classification method is logistic regression, which produces a distribution for the predictive class membership of being in either region. When sampling from this to make predictions, current practice is to draw from an independent Bernoulli distribution; drawing marginally loses any correlation between data and can result in large numbers of misclassifications. If simulating chains or fields of 0's and 1's, it is hard to control the 'stickiness' of like symbols. My current research is aimed at generating a correlated Bernoulli process using de Bruijn graphs to create chains of 0's and 1's, for which like symbols cluster together. De Bruijn Graphs are a generalisation of Markov chains, where the 'word' length controls the number of states that each individual state is dependent on, hence increasing correlation over a wider area.
Exploitation Route I am keen to publish my current work and hopefully one day to apply for funding for a postdoc or fellowship to carry on the work.
Sectors Digital/Communication/Information Technologies (including Software),Environment,Other