Using artificial intelligence to mitigate scanner bias in brain disorders

Lead Research Organisation: King's College London
Department Name: Psychosis Studies

Abstract

Brain-based disorders represent 14% of the global burden of disease. Artificial Intelligence (AI) applications to neuroimaging have the potential to improve the detection and treatment of these disorders. This potential has been demonstrated in controlled research setups, but the real-world clinical implementation of these techniques is limited by differences in scanning equipment and protocols. These differences, known as scanner bias, means that diagnostic and prognostic AI applications developed using images from a certain machine tend to perform poorly when applied to images from different machines. There is preliminary evidence that some AI-based approaches can mitigate this problem, however, they are limited to healthy subjects. We will adapt and validate Neuroharmony, an AI-based solution for this problem, using data from clinical populations with psychiatric or neurological disorders obtained across multiple machines. The results obtained here will improve Neuroharmony, validate it for clinical application and make it available to the research and clinical communities. This tool can help bridge the gap between AI-based research and clinical practice.

Technical Summary

In 2017 brain-based disorders represented 14% of the global burden of disease. Imaging-based analyses have the potential to provide reliable diagnostic and prognostic markers for brain-related disorders. A number of successful applications to brain disorders have been reported. However, a critical challenge in bringing these machine learning applications to clinical practice is that the use of different scanners and acquisition protocols results in inconsistent measures of the brain. In some cases, scanner-related variability can blur or even eliminate clinically relevant information from the scans. A number of methods for mitigating inter-scanner bias, commonly known as data harmonisation, have been proposed. However, the consolidated harmonisation methods have limited translational value as they do not allow the harmonisation of individual images coming from a new unseen scanner, which is essential for a real-world application of machine learning. They typically need a statistically significant sample as a reference, therefore they cannot be implemented in a cross-validation design. To overcome this challenge, we developed Neuroharmony, a tool that can harmonise single images from unseen/unknown scanners based on a set of image quality metrics (e.g. intrinsic characteristics like signal-to-noise ratio, contrast-to-noise ratio, etc.) which can be extracted from individual images without requiring a statistically representative sample from each new scanner of interest. Neuroharmony was proven effective in reducing systematic scanner-related bias from the distribution of brain volumes of healthy controls. However, its generalizability to patient data is yet to be examined. In this project, we aim to demonstrate that Neuroharmony can be used to harmonise data collected from clinical samples while preserving disease-related variability.

Publications

10 25 50
 
Title Neuroharmony 
Description We have developed the open-source tool 'Neuroharmony' for scanner harmonization in structural Magnetic Resonance Imaging (MRI), first published by Garcia-Dias et al. (2020). Neuroharmony uses the regional volumes of an MRI scan, intrinsic image quality metrics generated by MRIQC (Esteban et al., 2017) (https://github.com/nipreps/mriqc), and basic demographic information to predict the ComBat (Fortin et al., 2018) (https://github.com/Jfortin1/ComBatHarmonization) corrections of a new scan. The main advantage of Neuroharmony over existing harmonization tools is that it requires no prior knowledge about the way the MRI scan was acquired, so it can be applied to single MRI scans. In 2022, we were awarded an MRC grant to address three aims: (1) the evaluation of Neuroharmony on clinical samples to assess if disease-related variability can be preserved during harmonisation, (2) the exploration of other machine learning approaches, and (3) revision of the online code documentation to make the tool more user-friendly. The results are summarized below. (1) For the clinical evaluation, we trained a Neuroharmony model (using random forest regression, as published) on 4,903 healthy controls from 56 datasets and tested it on 701 subjects with first-episode psychosis (FEP) and 714 healthy controls from the same five scanners. In these independent test sets, scanner bias was reduced not only in healthy controls but also in patients. (2) We trained a Neuroharmony model using deep autoencoders to compare it to the published approach of random forest regression. The new model was faster to train and led to similar results as the original model. We plan to make this improved model publicly available in a new release this year. (3) The Neuroharmony documentation is available on the Read the Docs platform (https://neuroharmony.readthedocs.io) and is linked to its public GitHub repository (https://github.com/garciadias/Neuroharmony). We updated the documentation with more information, such as Frequently Asked Questions, and will publish further updates in the next release. References Esteban et al. (2017). MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites. PLOS ONE, 12(9), e0184661. https://doi.org/10.1371/journal.pone.0184661 Fortin et al. (2018). Harmonization of cortical thickness measurements across scanners and sites. NeuroImage, 167, 104-120. https://doi.org/10.1016/j.neuroimage.2017.11.024 Garcia-Dias et al. (2020). Neuroharmony: A new tool for harmonizing volumetric MRI data from unseen scanners. NeuroImage, 220. https://doi.org/10.1016/j.neuroimage.2020.117127 
Type Of Technology Webtool/Application 
Year Produced 2022 
Impact At this stage we do not have data on number of downloads and number of publications from external researchers. 
URL https://neuroharmony.readthedocs.io