Concept-based explanations for medical imaging

Lead Research Organisation: University of Oxford

Abstract

The black box nature of deep learning models reduce their potential clinical use; it being difficult to validate, and therefore trust, deep learning model predictions. Interpretability methods aim to address this by providing ways to explain or present model predictions in understandable terms to a human. Explanations of erroneous predictions can give suggestions for improvements and can aid in algorithm debugging. They also provide a way to determine if a model is conforming to ethical standards with the ability to check for bias to concepts such as race.Recent work in machine learning and computer vision has used concept-based explanations, where concepts are defined in the activation space of the neural network. Model sensitivity to those concepts is explored instead of individual pixels in the input space as in common techniques like saliency maps. However, to-date these techniques have not been widely applied to medical imaging, where, particularly for medical ultrasound, semantically different concepts can be similar in appearance. By providing interpretable deep learning we aim to increase the uptake of deep learning in medical imaging, with clinicians being more likely to trust and use the technology.
In this doctoral research, we propose to use concept-based interpretability methods - like testing with concept activation vectors (TCAV) - to explore which concepts a deep convolutional neural network utilises in its prediction. Using an initial exemplar problem of Gestational Age estimation of a fetus from ultrasound scans, we aim to explore if the presence/shape of specific structures in the fetal brain are important for prediction of gestational age in ultrasound, as they are concepts a clinician would use. Several large datasets are available for this research, including the INTERGROWTH-21st dataset, containing 121,000 images approximately uniformly distributed across 13-42 weeks gestation. Firstly, we will use TCAV to demonstrate if a GA prediction model uses similar concepts to clinicians to make a prediction, providing trust in its use. We plan to explore different methods of concept discovery and validation to make them suitable for this imaging modality. The research may require us to develop new metrics which measure how truthful the method is to the model's underlying behaviour and how well the concepts are represented within the network. New metrics may give us an increased understanding of when/where these methods succeed and fail and can drive further method developments. Additional image and video-based prediction tasks may be considered as the research progresses to expand the "vocabulary" of explanations that can be offered.
The project will utilise and improve the application and comparison of concept-based deep learning interpretability methods for medical imaging. These methods will provide a useful way to explore a model's behaviour and be a valuable tool in increasing the uptake of deep learning image- and video-based prediction models in clinical care.
This project falls within the EPSRC healthcare technologies, ICT and Artificial Intelligence and robotics research areas.

Planned Impact

In the same way that bioinformatics has transformed genomic research and clinical practice, health data science will have a dramatic and lasting impact upon the broader fields of medical research, population health, and healthcare delivery. The beneficiaries of the proposed training programme, and of the research that it delivers and enables, will include academia, industry, healthcare, and the broader UK economy.

Academia: Graduates of the training programme will be well placed to start their post-doctoral careers in leading academic institutions, engaging in high-impact multi-disciplinary research, helping to build training and research capacity, sharing their experience within the wider academic community.

Industry: Partner organisations will benefit from close collaboration with leading researchers, from the joint exploration of research priorities, and from the commercialisation of arising intellectual property. Other organisations will benefit from the availability of highly-qualified graduates with skills in big health data analytics.

Healthcare: Healthcare organisations and patients will benefit from the results of enabled and accelerated health research, leading to new treatments and technologies, and an improved ability to identify and evaluate potential improvements in practice through the analysis of real-world health data.

Economy: The life sciences sector is a key component of the UK economy. The programme will provide partner companies with direct access to leading-edge research. Graduates of the programme will be well-qualified to contribute to economic growth - supporting health research and the development of new products and services - and will be able to inform policy and decision making at organisational, regional, and national levels.

Student:

Angus Nicolson

Period of Study:

Oct 20 - Sep 24

Funder:

EPSRC

Project Status:

Active

Project Category:

Studentship

Project Reference:

2432736

Research Topic:

Unclassified

Organisations

University of Oxford (Lead Research Organisation)

People	ORCID iD
Alison Noble (Primary Supervisor)
Jim Davies (Primary Supervisor)
Yarin Gal (Primary Supervisor)
Angus Nicolson (Student)

Publications

Author Name

Title Publication Date Published

10 25 50

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/S02428X/1			01/04/2019	30/09/2027
2432736	Studentship	EP/S02428X/1	01/10/2020	30/09/2024	Angus Nicolson