WhisperX: Time-Accurate Speech Transcription of Long-Form Audio (2023)

First Author: Bain M

Attributed to: Visual AI: An Open World Interpretable Visual Transformer funded by EPSRC

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.2303.00747

Publication URI: https://arxiv.org/abs/2303.00747

Type: Preprint