Environment-aware Listener-Optimized Binaural Enhancement of Speech (E-LOBES)

Lead Research Organisation: Imperial College London
Department Name: Electrical and Electronic Engineering

Abstract

Age-related hearing loss affects over half the UK population aged over 60. Hearing loss makes communication difficult and so has severe negative consequences for quality of life. The most common treatment for mild-to-moderate hearing loss is the use of hearing aids. However even with aids, hearing impaired listeners are worse at understanding speech in noisy environments because their auditory system is less good at separating wanted speech from unwanted noise. One solution for this is to use speech enhancement algorithms to amplify the desired speech signals selectively while attenuating the unwanted background noise.

It is well known that normal hearing listeners can better understand speech in noise when listening with two ears rather than with only one. Differences between the signals at the two ears allow the speech and noise to be separated based on their spatial locations resulting in improved intelligibility. Technological advances now make feasible the use of two hearing aids that are able to share information via a wireless link. By sharing information in this way, it becomes possible for the speech enhancement algorithms within the hearing aids to localize sound sources more accurately and, by jointly processing the signals for both ears, to ensure that the spatial cues that are present in the acoustic signals are retained. It is the goal of this project to exploit these binaural advantages by developing speech enhancement algorithms that jointly enhance the speech received by the two ears.

Most current speech enhancement techniques have evolved from the telecommunications industry and are designed to act only on monaural signals. Many of the techniques can improve the perceived quality of already intelligible speech but binary masking is one of the few techniques that has been shown to improve the intelligibility of noisy speech for both normal and hearing impaired listeners. In the binary masking approach regions of the time-frequency domain that contain significant speech energy are left unchanged while regions that contain little speech energy are muted. In this project we will extend existing monaural binary masking techniques to provide binaural speech enhancement while preserving the inter-aural time and level differences that are critical for the spatial separation of sound sources.

To train and tune our binaural speech enhancement algorithm we will also develop within the project an intelligibility metric that is able to predict the intelligibility of a speech signal for a binaural listener with normal or impaired hearing in the presence of competing noise sources. This metric is the key to finding automatically the optimum settings an individual listener's hearing aids in a particular environment.

The final evaluation and development of the binaural enhancement algorithm assess speech perception in noise in a panel of hearing-impaired listeners who will also be asked to assess the quality of the enhanced speech signals.

Planned Impact

The immediate beneficiaries from this research will be hearing aid manufacturers and, through them, hearing aid users. We expect that the techniques and algorithms developed in this research will make it substantially easier for hearing aid users to distinguish the individual speakers that are present especially when in a noisy listening environment.

However there are other circumstances in which listeners must use headphones or earphones when operating in an environment with high ambient noise. The techniques developed here can also benefit such users by allowing them to communicate intelligibly with others in a noisy environment.

The project will develop methods of characterizing the listening context and the acoustic properties of the listening environment from the received microphone signals and, from this information, identifying the listening context automatically. This is potentially of considerable benefit to designers of mobile phones or other mobile devices who will be able to adapt the operation of their systems to the context in which it is being used.

Publications

10 25 50
 
Description Recent technological developments in hearing aid design have made possible binaural processing in which the aids in the left and right ears communicate to improve the listening experience of the hearing-impaired user. This project was concerned with how the opportunities provided by binaural processing could best be exploited to address a common complaint of hearing impaired listeners, namely the intelligibility of speech in noisy situations. Binaural processing is particularly valuable when the target and interfering sound sources are at different spatial locations. Therefore, we have developed engineering algorithms that can characterise the acoustic environment of a hearing-aid user by estimating the spectral properties of interfering noise sources as well as their directions and spatial distribution. To exploit this knowledge of the environment, we developed new methods of binaural beamforming that can allow aids to have stable directional sensitivity even in situations where the listener's head is moving. We investigated novel binaural speech signal enhancement methods that are able to reduce unwanted background noise while preserving the spatial cues that allow a listener to distinguish sound sources based on their directions. These new approaches were then evaluated using a series of listening experiments involving both normally-hearing and hearing-impaired listeners. The results of these experiments were then exploited to help develop and evaluate mathematical models of speech intelligibility under binaural listening conditions. These models may be used to predict the binaural intelligibility of speech in a particular listening situation for a specific hearing-impaired listener. We expect that such metrics will be useful in the future to guide the optimisation of the performance of binaural hearing aids matched to the listener and to the environment.
Exploitation Route The primary focus of this project has been on exploiting the opportunities provided by recent binaural hearing-aids that incorporate a communications link between the left and right aids. The achievements of this project are of direct relevance not only to hearing-aid manufacturers and to other researchers in the field but also to those working in the field of robot audition where similar issues are encountered. In the course of this project, we have collected two substantial databases. The first database, which is publicly available, provides binaural intelligibility measurements on tests subjects wearing hearing-aids whose output signals are generated using either personalized or generic listener characteristics; this database will allow hearing-aid manufacturers to assess the benefits of incorporating personalized signal processing algorithms into their devices. The second database combines psychoacoustic measurements with binaural intelligibility measurements on both normally-hearing and hearing-impaired test subjects in a range of acoustic situations. This database, which we intend to make public, will allow the development and evaluation of improved binaural intelligibility metrics that can be personalized to a specific listener. Such metrics will in turn facilitate improved personalized enhancement algorithms
Sectors Digital/Communication/Information Technologies (including Software)

Healthcare

URL http://www.phon.ucl.ac.uk/project/elobes
 
Description This project focussed on exploiting the opportunities provided by current binaural hearing-aid designs that incorporate a communications link between the left and right aids. Within the project we developed techniques for characterising the acoustic environment and for performing binaural beamforming in order to permit aids to have a stable directional sensitivity even in situations where the listener's head is moving. The publicly available OTIMP database, which was collected as part of the project, is designed to allow researchers and hearing-aid manufacturers to assess the benefits of incorporating personalized signal processing algorithms into their devices. As of March 2022, this database has been downloaded over 300 times. The project led to two studies, sponsored by a consumer's association, that evaluated the performance of existing hearing aids. These studies found no consistent difference in terms of speech quality or intelligibility between the performance of hearing aids that were provided through public health insurance rather than by private companies. The challenges faced by the designers of hearing aids are closely related to those that arise in other applications involving head-worn microphones. The results from this project have led directly to a collaboration with Facebook Reality Labs, begun in September 2021, for a programme of fundamental research into multichannel acoustic signal processing that will extend the work done in this project into the domain of augmented reality devices.
First Year Of Impact 2021
Sector Digital/Communication/Information Technologies (including Software),Leisure Activities, including Sports, Recreation and Tourism
Impact Types Economic

 
Description Environment and Listener Optimised Speech Processing for Hearing Enhancement in Real Situations (ELO-SPHERES)
Amount £554,976 (GBP)
Funding ID EP/S03580X/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 09/2019 
End 09/2023
 
Description Environment and Listener Optimised Speech Processing for Hearing Enhancement in Real Situations (ELO-SPHERES)
Amount £587,008 (GBP)
Funding ID EP/S035842/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 09/2019 
End 09/2023
 
Title ELOBES evaluations 
Description This database combines psychoacoustic measurements with binaural intelligibility measurements on both normally-hearing and hearing-impaired test subjects in a range of acoustic situations. This database will allow the development and evaluation of improved binaural intelligibility metrics that can be personalized to a specific listener. Such metrics will in turn facilitate improved personalized enhancement algorithms. We plan to make this database publicly available once the related research papers are published. 
Type Of Material Database/Collection of data 
Year Produced 2019 
Provided To Others? No  
Impact We are currently using this database to evaluate binaural enhacement algorithms and binaural intelligibility prediction metrics. 
 
Title OTIMP: The Oticon-Imperial hearing aid impulse response database 
Description A database of acoustic impulse responses measured between a sphere of loudspeakers and hearing aids in a mildly reverberant listening room. The database includes measurements on 46 individuals and is particularly intended to allow the evaluation of hearing aid algorithms when applied to devices worn by different individuals. 
Type Of Material Database/Collection of data 
Year Produced 2019 
Provided To Others? Yes  
Impact This database is used within Oticon A/S for the development of hearing aids and has also been made freely available for other researchers to use. The database has been downloaded over 300 times (as of March 2022). 
URL https://doi.org/10.5281/zenodo.2660782
 
Title eBrIRD - ELOSPHERES binaural room impulse response database 
Description The ELOSPHERES binaural room impulse response database (eBrIRD) is a resource for generating audio for binaural hearing-aid (HA) experiments. It allows testing the performance of new and existing audio processing algorithms under ideal as well as real-life auditory scenes in different environments: an anechoic chamber, a restaurant, a kitchen and a car cabin. The database consists of a collection of binaural room impulse responses (BRIRs) measured with six microphones. Two microphones represent the listener's eardrums. Four microphones are located at the front and back of two behind-the-ear hearing aids placed over the listener's pinnae. The database allows simulations of head movement in the transverse plane. [Updated July 2021 to include Car cabin BRIR]. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
Impact Materials to use for construction of listening tests exploiting spatial audio. 
URL https://www.phon.ucl.ac.uk/resource/ebrird/
 
Description MRC Microphone Network 
Organisation Cardiff University
Department School of Psychology
Country United Kingdom 
Sector Academic/University 
PI Contribution Three members of the research team have participated in meetings of the MRC "Microphone" Network (MR/M025616/1) contributing intellectual input and expertise.
Collaborator Contribution Other members of the network have contributed intellectual input and expertise from a wide range of backgrounds.
Impact The network has provided an opportunity to meet others working in the field and it is expected that this will lead to further bilateral collaboration in the future.
Start Year 2015
 
Description MRC Over-Hear Network 
Organisation University College London
Country United Kingdom 
Sector Academic/University 
PI Contribution Two members of the research team have participated in meetings of the MRC "Over-Hear" Network (MR/M025659/1 ) contributing intellectual input and expertise.
Collaborator Contribution Other members of the network have contributed intellectual input and expertise from a wide range of backgrounds.
Impact The network has provided an opportunity to meet others working in the field.
Start Year 2015
 
Title Audio3D - Spatial Audio Simulation System 
Description Audio3D is a program to simulate simple virtual audio environments. Multiple monophonic sound sources can be placed in a virtual room and are combined using room response and head-related transfer functions into a real-time binaural audio signal that can be listened to over headphones. Audio3D also supports a head-tracker such that the sound sources appear at a constant room position even if the header orientation changes. A server version can be used to run listening experiments in virtual rooms. The server runs the real time binaural rendering and head tracking, while the experiment can be controlled by external clients written in Javascript or MATLAB. 
Type Of Technology Software 
Year Produced 2016 
Impact The Audio3D system has been used to simulate listening environments for speech-in-noise intelligibility experiments. We have shown that intelligibility scores in a virtual environment match scores measured in a physical environment. In future we will use the system with hearing-impaired listeners to investigate the effects of novel spatial signal processing algorithms on the intelligibility of speech in noise. 
URL http://www.phon.ucl.ac.uk/resource/audio3d/
 
Title psycest - estimate psychometric functions 
Description The software efficiently estimates one or more psychometric functions such as, for example, the intelligibility of speech as a function of signal-to-noise ratio 
Type Of Technology Software 
Year Produced 2016 
Open Source License? Yes  
Impact The software is open source but no usage or download statistics are available at present. 
URL http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/mdoc/v_mfiles/v_psycest.html
 
Description Institute of Acoustics/Audio Eng Soc - invited talk by Alastair Moore 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Professional Practitioners
Results and Impact This invited talk was presented to professional practitioners in audio and acoustics. Its purpose was to broaden knowledge and to raise awareness of techniques for combating unwanted reverberation.
Year(s) Of Engagement Activity 2016
URL https://ioa.org.uk/civicrm/event/info?id=210&reset=1
 
Description Invited Talk to Facebook Reality Labs 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact This invited talk to Facebook Reality Labs has resulted in discussions for future collaboration.
Year(s) Of Engagement Activity 2020
 
Description Invited talk (Oticon A/S, Copenhagen, Denmark) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact I presented some research research to an audience at a seminar given at Oticon's premises in Copenhagen. The audience was primarily research and development engineers working on hearing aids in the company. The secondary audience was audiology practitioners who work directly with the hearing impaired. The talk was simultaneously broadcast to researchers at the company's research facility in Eriksholm and to collaborators in Sennheiser. The talk gave rise to several follow-up discussions and an invitation to visit the research centre at Eriksholm. The slides from the talk were requested and have been posted within the company's intranet.
Year(s) Of Engagement Activity 2017
 
Description Invited talk at Cirrus Logic, London, UK 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Industry/Business
Results and Impact Around 8 people attended this technical talk from the company plus a graduate student. During and immediately following the talk there were numerous discussion arising out of audience questions.
Year(s) Of Engagement Activity 2018
 
Description Invited talk at Massachusetts Institute of Technology, Cambridge, MA, USA 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Around 50 people attended this technical talk include several staff members of MIT Lincoln labs as well as staff and graduate students from MIT. During and immediately following the talk there were numerous discussion arising out of audience questions.
Year(s) Of Engagement Activity 2018
 
Description Invited talk at Toshiba Cambridge Research Lab., Cambridge, UK 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Industry/Business
Results and Impact Around 12 people attended this technical talk, predominately industry-based researchers and managers from the Toshiba company. During and immediately following the talk there were numerous discussion arising out of audience questions. The convener of this talk has become a visiting professor at Imperial College London and participates in related research and teaching activities.
Year(s) Of Engagement Activity 2018
 
Description Presentation at Winter School, Aalborg Univ, Denmark 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This presentation was part of a week-long Winter School on Signal Processing for Hearing Assistive Devices for an audience who primarily came from the hearing aid manufacturing and fitting industries. The talk focussed on the signal processing techniques used in hearing aids and the effect that they have on intelligibility when listening to speech.
Year(s) Of Engagement Activity 2017
URL http://caspr.es.aau.dk/education/courses/
 
Description Research Presentation at Uni. Aalborg, Denmark 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact I presented a research seminar within the Centre for Acoustic Signal Processing Research (CASPR) at University of Aalborg. This talk covered recent research in acoustic signal processing related to hearing aids and was attended by a mixture of university professors and their PhD/UG students as well as representatives from industry. The talk made the front-page news item of their website: http://caspr.es.aau.dk at the time. Collaboration links and potential for student exchanges were established subsequently.
Year(s) Of Engagement Activity 2017
URL http://caspr.es.aau.dk
 
Description Talk at ISVR, Southampton 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Postgraduate students
Results and Impact Research seminar to the Institute of Sound and Vibration Research presenting new research and followed by an interactive technical discussion.
Year(s) Of Engagement Activity 2016
 
Description Thomas Simm Littler Lecture to the British Society of Audiology (BSA) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact In October 2020, Stuart Rosen gave the Thomas Simm Littler Lectureship to the British Society of Audiology (BSA) This is a biennial prize awarded 'in recognition of a sustained academic contribution to hearing science and audiology. This is the BSA's most prestigious award and consists of a certificate and honorarium ...'.

Title: 'How do background sounds interfere with speech?'
Year(s) Of Engagement Activity 2020
 
Description Workshop on Optimising Binaural Hearing for Environment and Listener (ELOBES2019), Ghent, Belgium 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact This one-day workshop was held as a satellite meeting of the annual Speech-in-Noise Workshop which brings together professional practitioners, commercial companies and academic researchers who are involved in the area of speech communication in noisy conditions. The workshop consisted of a series of invited talks and audience discussions about binaural listening to speech in noise. The workshop was well attended and the talks provoked lively discussions between the participants.
Year(s) Of Engagement Activity 2019
URL https://www.phon.ucl.ac.uk/events/elobes2019/