Uncertainty Estimation and Assessment in Machine Learning

Lead Research Organisation: University of Cambridge

Department Name: Engineering

Abstract

This project relates to EPSRC key research areas "Statistics and applied probability" and "Artificial intelligence technologies". The goal is to explore rigorous ways of assessing uncertainty estimates, and to improve current tools with respect to these metrics. From credit scoring to collision prevention in self-driving cars, reliable uncertainty estimation is crucially important for ethical and safe deployment of machine learning algorithms.

Design of flexible and scalable uncertainty estimators in machine learning has recently received much attention. Unfortunately, most effort concentrated on improving properties of the estimators which are only indirectly related to calibration of the uncertainty estimates. Moreover, the problem of assessing quality of uncertainty estimates has been largely avoided by the machine learning community, but for a few exceptions like Vovk et al., (2006). Contrarily, statisticians have studied this area for several decades (Neyman, 1937; Szabo et al., 2014); we expect their ideas to be indispensable for further progress. Substantial conceptual and algorithmic advances are needed to develop a practical approach applicable to large modern models and datasets. We plan to explore ways of assessing quality of uncetainty estimates provided by modern approximate inference techniques, and use these to design more reliable and safe algorithms. Early signs that this might be feasible have appeared in recent publications (Chwialkowski et al., 2016; Jing et al., 2016).

V. Vovk, A. Gammerman, G. Shafer (2006) Algorithmic Learning in a Random World. Springer.
J. Neyman (1937) Outline of a Theory of Statistical Estimation Based on the Classical Theory of Probability. Philosophical Transactions of the Royal Society of London A, 236: 333-380.
B. Szabo, A. W. van der Vaart, J. H. Zanten (2014) Frequentist coverage of adaptive nonparametric Bayesian credible sets. The Annals of Statistics 43.4: 1391-1428.
K. Chwialkowski, H. Strathmann, A. Gretton (2016) A kernel test of goodness of fit. In Proceedings of the International Conference on Machine Learning (ICML).
L. Jing, M. G'Sell, A. Rinaldo, R. J. Tibshirani, L. Wasserman (2016) Distribution-Free Predictive Inference For Regression. arXiv preprint arXiv:1604.04173.

Student:

Jiri Hron

Period of Study:

Oct 17 - Sep 22

Funder:

EPSRC

Project Status:

Closed

Project Category:

Studentship

Project Reference:

1949878

Research Topic:

Unclassified

Organisations

People	ORCID iD
Zoubin Ghahramani (Primary Supervisor)
Jiri Hron (Student)

Publications

Author Name

Title Publication Date Published

10 25 50

Gal Y (2017) Concrete dropout

Hron J (2018) Variational Bayesian dropout: pitfalls and fixes

Janz D (2019) Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning

Matthews AGG (2018) Gaussian process behaviour in wide deep neural networks

Novak R (2020) Neural Tangents: Fast and Easy Infinite Neural Networks in Python

Novak R (2019) Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes

Rowland M (2019) Orthogonal Estimation of Wasserstein Distances

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/R511870/1			01/10/2017	30/09/2023
1949878	Studentship	EP/R511870/1	01/10/2017	30/09/2022	Jiri Hron

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects