Audio-Visual Scene Recognition

Lead Research Organisation: University of Surrey
Department Name: Vision Speech and Signal Proc CVSSP

Abstract

Traditionally audio and visual recognition departments have operated independently, however with the emergence of new deep learning techniques there is a possibility to combine elements from both sets of research groups, in the audio and visual domain.
My proposal is to work on a novel technique that will allow a system to draw on image and speech recognition in order to describe a scene by exploring the recent research in Deep Learning, specifically, Cycle GANs. There are many potentially useful applications for such a system, such as YouTube videos or for companies such as IKEA for video instructions on constructing furniture.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/N509772/1 01/10/2016 30/09/2021
2197029 Studentship EP/N509772/1 01/01/2019 31/08/2022 Andrew Bailey
EP/R513350/1 01/10/2018 30/09/2023
2197029 Studentship EP/R513350/1 01/01/2019 31/08/2022 Andrew Bailey