Audio-Visual Scene Recognition
Lead Research Organisation:
University of Surrey
Department Name: Vision Speech and Signal Proc CVSSP
Abstract
Traditionally audio and visual recognition departments have operated independently, however with the emergence of new deep learning techniques there is a possibility to combine elements from both sets of research groups, in the audio and visual domain.
My proposal is to work on a novel technique that will allow a system to draw on image and speech recognition in order to describe a scene by exploring the recent research in Deep Learning, specifically, Cycle GANs. There are many potentially useful applications for such a system, such as YouTube videos or for companies such as IKEA for video instructions on constructing furniture.
My proposal is to work on a novel technique that will allow a system to draw on image and speech recognition in order to describe a scene by exploring the recent research in Deep Learning, specifically, Cycle GANs. There are many potentially useful applications for such a system, such as YouTube videos or for companies such as IKEA for video instructions on constructing furniture.
Organisations
People |
ORCID iD |
Mark Plumbley (Primary Supervisor) | |
Andrew Bailey (Student) |
Studentship Projects
Project Reference | Relationship | Related To | Start | End | Student Name |
---|---|---|---|---|---|
EP/N509772/1 | 01/10/2016 | 30/09/2021 | |||
2197029 | Studentship | EP/N509772/1 | 01/01/2019 | 29/02/2024 | Andrew Bailey |
EP/R513350/1 | 01/10/2018 | 30/09/2023 | |||
2197029 | Studentship | EP/R513350/1 | 01/01/2019 | 29/02/2024 | Andrew Bailey |
EP/T518050/1 | 01/10/2020 | 30/09/2025 | |||
2197029 | Studentship | EP/T518050/1 | 01/01/2019 | 29/02/2024 | Andrew Bailey |