WhisperX: Time-Accurate Speech Transcription of Long-Form Audio (2023)
Attributed to:
Visual AI: An Open World Interpretable Visual Transformer
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.21437/interspeech.2023-78
Publication URI: http://dx.doi.org/10.21437/interspeech.2023-78
Type: Conference/Paper/Proceeding/Abstract