Environment and Listener Optimised Speech Processing for Hearing Enhancement in Real Situations (ELO-SPHERES)

Lead Research Organisation: Imperial College London

Department Name: Electrical and Electronic Engineering

Abstract

Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.

Funded Value:

£587,008

Funded Period:

Sep 19 - Mar 23

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/S035842/1

Principal Investigator:

Patrick Naylor

Research Subject:

Info. & commun. Technol. (50%)

Medical & health interface (50%)

Research Topic:

Artificial Intelligence (10%)

Biomechanics & Rehabilitation (50%)

Computer Graphics & Visual. (10%)

Music & Acoustic Technology (15%)

Vision & Senses - ICT appl. (15%)

Organisations

People	ORCID iD
Patrick Naylor (Principal Investigator)
David Brookes (Co-Investigator)	http://orcid.org/0000-0001-7105-4936
Alastair Moore (Researcher)

Publications

Author Name

Title Publication Date Published

|< < 1 2 3 4 > >|

10 25 50

D'Olne E (2022) Frame-Based Space-Time Covariance Matrix Estimation for Polynomial Eigenvalue Decomposition-Based Speech Enhancement

D'Olne E (2021) Model-based Beamforming for Wearable Microphone Arrays

D'Olne E. (2022) Speech Enhancement in Distributed Microphone Arrays Using Polynomial Eigenvalue Decomposition in European Signal Processing Conference

Green T (2022) Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement. in Trends in hearing

Grinstein E (2023) Dual input neural networks for positional sound source localization in EURASIP Journal on Audio, Speech, and Music Processing

Guiraud P (2022) Using a Single-Channel Reference with the Mbstoi Binaural Intelligibility Metric in SSRN Electronic Journal

Guiraud P (2023) Using a single-channel reference with the MBSTOI binaural intelligibility metric in Speech Communication

Guiraud P (2022) Machine Learning for Parameter Estimation in the MBSTOI Binaural Intelligibility Metric

Guiraud P. (2023) The MBSTOI Binaural Intelligibility Metric using a Close-Talking Microphone Reference

Hafezi S (2021) Narrowband multi-source direction-of-arrival estimation in the spherical harmonic domain. in The Journal of the Acoustical Society of America

Artistic and Creative Products
Key Findings
Impact Summary
Research Databases and Models
Collaboration
Engagement Activities


Title	Materials used in "Measuring audio-visual speech intelligibility under dynamic listening conditions using virtual reality"
Description	The materials in this record are the audio and video files, together with various configuration files, used by the "SEAT" software in the study described in Moore, Green, Brookes & Naylor (2022) "Measuring audio-visual speech intelligibility under dynamic listening conditions using virtual reality" They are shared in this form so that the experiment may be reproduced. For any other use please contact the authors to obtain the original database(s) from which these materials are derived. The materials were created to be compatible with v0.3 of SEAT, which is available from GitHub. Note that the materials must be placed at `C:\seat_experiments\cafe_AV.`
Type Of Art	Film/Video/Animation
Year Produced	2022
URL	https://zenodo.org/record/6889159


Description	The intelligibility of binaural speech can be estimated using a machine learning model with an accuracy comparable to, and in some cases better than, the classical metric.
Exploitation Route	Dissemination of our results has been done and, together with sharing our software, will enable others to estimate measures of binaural intelligibility better than before.
Sectors	Communities and Social Services/Policy Digital/Communication/Information Technologies (including Software) Electronics Healthcare


Description	A subsequent collaboration was established between Imperial College and Meta for research on hearing devices linked to augmented reality. The principal contact at Meta is Thomas Lunner.
First Year Of Impact	2022
Sector	Digital/Communication/Information Technologies (including Software)
Impact Types	Societal Economic


Title	HearVR - Virtual Reality Video database for audiovisual speech intelligibility assessment
Description	British English matrix sentences were recorded in our anechoic chamber against a green screen, using a 360° camera and a high-quality condenser microphone. These sentences contain 5 slots with 10 choices in each slot. Different sets of 200 sentences were recorded by 5 male and 5 female talkers sitting at a table. The individual talkers have been cropped from the videos and the green screen and table replaced by a transparent background to allow compositing of speakers in new 360° scenes. The individual matrix sentences have been extracted as 11s videos together with level-normalised monophonic audio. We have developed scripts that build multi-talker videos from these elements in a format that can be played on VR Headsets such as the Oculus 2. We have also developed a Unity application for the headsets, which will play the 360° composite videos and create spatialised sound sources for the talkers together with background noise delivered from multiple virtual loudspeakers.
Type Of Material	Database/Collection of data
Year Produced	2023
Provided To Others?	Yes
Impact	This database facilitates experiments into audiovisual listening behaviour with multiple speakers in realistic environments involving spatialised audio.
URL	https://speechandhearing.net/hearvr/


Title	Materials used in "Measuring audio-visual speech intelligibility under dynamic listening conditions using virtual reality"
Description	The materials in this record are the audio and video files, together with various configuration files, used by the "SEAT" software in the study described in Moore, Green, Brookes & Naylor (2022) "Measuring audio-visual speech intelligibility under dynamic listening conditions using virtual reality" They are shared in this form so that the experiment may be reproduced. For any other use please contact the authors to obtain the original database(s) from which these materials are derived. The materials were created to be compatible with v0.3 of SEAT, which is available from GitHub. Note that the materials must be placed at `C:\seat_experiments\cafe_AV.`
Type Of Material	Database/Collection of data
Year Produced	2022
Provided To Others?	Yes
Impact	These data were used in the publication: "Moore, Green, Brookes & Naylor (2022) "Measuring audio-visual speech intelligibility under dynamic listening conditions using virtual reality""
URL	https://zenodo.org/record/6889160


Title	Moore, Alastair H; Green, Tim The materials in this record are the audio and video files, together with various configuration files, used by the "SEAT" software in the study described in Moore, Green, Brookes & Naylor (2022) "Measuring audio-visual
Description	Materials of data and methodology
Type Of Material	Database/Collection of data
Year Produced	2022
Provided To Others?	Yes
Impact	Research paper
URL	https://zenodo.org/record/6889160#.Y_za6C-l3GA


Title	eBrIRD - ELOSPHERES binaural room impulse response database
Description	The ELOSPHERES binaural room impulse response database (eBrIRD) is a resource for generating audio for binaural hearing-aid (HA) experiments. It allows testing the performance of new and existing audio processing algorithms under ideal as well as real-life auditory scenes in different environments: an anechoic chamber, a restaurant, a kitchen and a car cabin. The database consists of a collection of binaural room impulse responses (BRIRs) measured with six microphones. Two microphones represent the listener's eardrums. Four microphones are located at the front and back of two behind-the-ear hearing aids placed over the listener's pinnae. The database allows simulations of head movement in the transverse plane. [Updated July 2021 to include Car cabin BRIR].
Type Of Material	Database/Collection of data
Year Produced	2021
Provided To Others?	Yes
Impact	Materials to use for construction of listening tests exploiting spatial audio.
URL	https://www.phon.ucl.ac.uk/resource/ebrird/


Description	Clarity Challenges for Machine Learning for Hearing Devices
Organisation	University of Nottingham
Department	School of Medicine
Country	United Kingdom
Sector	Academic/University
PI Contribution	The ELO-SPHERES project team are taking part in the Clarity Enhancement and Prediction Challenges.
Collaborator Contribution	The Clarity challenges are a series of machine learning challenges to enhance hearing-aid signal processing and to better predict how people perceive speech-in-noise. The ELO-SPHERES project has contributed an enhancement system to the first enhancement challenge and is taking part in future challenges.
Impact	Moore, A, Sina Hafezi, Rebecca Vos, Mike Brookes, Patrick A. Naylor, Mark Huckvale, Stuart Rosen... Gaston Hilkhuysen. (2021). A binaural MVDR beamformer for the 2021 Clarity Enhancement Challenge: ELO-SPHERES consortium system description.
Start Year	2021


Description	Clarity Challenges for Machine Learning for Hearing Devices
Organisation	University of Nottingham
Department	School of Medicine
Country	United Kingdom
Sector	Academic/University
PI Contribution	The ELO-SPHERES project team are taking part in the Clarity Enhancement and Prediction Challenges.
Collaborator Contribution	The Clarity challenges are a series of machine learning challenges to enhance hearing-aid signal processing and to better predict how people perceive speech-in-noise. The ELO-SPHERES project has contributed an enhancement system to the first enhancement challenge and is taking part in future challenges.
Impact	Moore, A, Sina Hafezi, Rebecca Vos, Mike Brookes, Patrick A. Naylor, Mark Huckvale, Stuart Rosen... Gaston Hilkhuysen. (2021). A binaural MVDR beamformer for the 2021 Clarity Enhancement Challenge: ELO-SPHERES consortium system description.
Start Year	2021


Description	Clarity Challenges for Machine Learning for Hearing Devices
Organisation	University of Salford
Country	United Kingdom
Sector	Academic/University
PI Contribution	The ELO-SPHERES project team are taking part in the Clarity Enhancement and Prediction Challenges.
Collaborator Contribution	The Clarity challenges are a series of machine learning challenges to enhance hearing-aid signal processing and to better predict how people perceive speech-in-noise. The ELO-SPHERES project has contributed an enhancement system to the first enhancement challenge and is taking part in future challenges.
Impact	Moore, A, Sina Hafezi, Rebecca Vos, Mike Brookes, Patrick A. Naylor, Mark Huckvale, Stuart Rosen... Gaston Hilkhuysen. (2021). A binaural MVDR beamformer for the 2021 Clarity Enhancement Challenge: ELO-SPHERES consortium system description.
Start Year	2021


Description	Clarity Challenges for Machine Learning for Hearing Devices
Organisation	University of Salford
Country	United Kingdom
Sector	Academic/University
PI Contribution	The ELO-SPHERES project team are taking part in the Clarity Enhancement and Prediction Challenges.
Collaborator Contribution	The Clarity challenges are a series of machine learning challenges to enhance hearing-aid signal processing and to better predict how people perceive speech-in-noise. The ELO-SPHERES project has contributed an enhancement system to the first enhancement challenge and is taking part in future challenges.
Impact	Moore, A, Sina Hafezi, Rebecca Vos, Mike Brookes, Patrick A. Naylor, Mark Huckvale, Stuart Rosen... Gaston Hilkhuysen. (2021). A binaural MVDR beamformer for the 2021 Clarity Enhancement Challenge: ELO-SPHERES consortium system description.
Start Year	2021


Description	Clarity Challenges for Machine Learning for Hearing Devices
Organisation	University of Sheffield
Department	Department of Computer Science
Country	United Kingdom
Sector	Academic/University
PI Contribution	The ELO-SPHERES project team are taking part in the Clarity Enhancement and Prediction Challenges.
Collaborator Contribution	The Clarity challenges are a series of machine learning challenges to enhance hearing-aid signal processing and to better predict how people perceive speech-in-noise. The ELO-SPHERES project has contributed an enhancement system to the first enhancement challenge and is taking part in future challenges.
Impact	Moore, A, Sina Hafezi, Rebecca Vos, Mike Brookes, Patrick A. Naylor, Mark Huckvale, Stuart Rosen... Gaston Hilkhuysen. (2021). A binaural MVDR beamformer for the 2021 Clarity Enhancement Challenge: ELO-SPHERES consortium system description.
Start Year	2021


Description	Clarity Challenges for Machine Learning for Hearing Devices
Organisation	University of Sheffield
Department	Department of Computer Science
Country	United Kingdom
Sector	Academic/University
PI Contribution	The ELO-SPHERES project team are taking part in the Clarity Enhancement and Prediction Challenges.
Collaborator Contribution	The Clarity challenges are a series of machine learning challenges to enhance hearing-aid signal processing and to better predict how people perceive speech-in-noise. The ELO-SPHERES project has contributed an enhancement system to the first enhancement challenge and is taking part in future challenges.
Impact	Moore, A, Sina Hafezi, Rebecca Vos, Mike Brookes, Patrick A. Naylor, Mark Huckvale, Stuart Rosen... Gaston Hilkhuysen. (2021). A binaural MVDR beamformer for the 2021 Clarity Enhancement Challenge: ELO-SPHERES consortium system description.
Start Year	2021


Description	Thomas Simm Littler Lecture to the British Society of Audiology (BSA)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Professional Practitioners
Results and Impact	In October 2020, Stuart Rosen gave the Thomas Simm Littler Lectureship to the British Society of Audiology (BSA) This is a biennial prize awarded 'in recognition of a sustained academic contribution to hearing science and audiology. This is the BSA's most prestigious award and consists of a certificate and honorarium ...'. Title: 'How do background sounds interfere with speech?'
Year(s) Of Engagement Activity	2020

Abstract

Organisations

People

ORCID iD

Publications