WhisperX: Time-Accurate Speech Transcription of Long-Form Audio (2023)
Attributed to:
Visual AI: An Open World Interpretable Visual Transformer
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.2303.00747
Publication URI: https://arxiv.org/abs/2303.00747
Type: Preprint