Automatic Extraction of Rich Metadata from Broadcast Speech

Lead Research Organisation: University of Edinburgh
Department Name: Sch of Informatics

Abstract

The research studentship will be concerned with automatically learning to extract rich metadata information from broadcast television recordings, using speech recognition and natural language processing techniques. We will build on recent advances in convolutional and recurrent neural networks, using architectures which learn representations jointly, considering both acoustic and textual data. The project will build on our current work in the rich transcription of broadcast speech using neural network based speech recognition systems, along with neural network approaches to machine reading and summarisation. In particular, we are interested in developing approaches to transcribing broadcast speech in a way appropriate to the particular context. This may include compression or distillation of the content (perhaps to fit in with the constraints of subtitling), transforming conversational speech into a form that is more easy to read as text, or transcribing broadcast speech in a way appropriate for a particular reading age.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/P510488/1 01/10/2016 30/09/2021
2104504 Studentship EP/P510488/1 01/09/2018 31/08/2021 Van Quy Luu