Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization (2024)
Attributed to:
BBC Prosperity Partnership: Future Personalised Object-Based Media Experiences Delivered at Scale Anywhere
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.1109/taslp.2023.3346643
Publication URI: http://dx.doi.org/10.1109/taslp.2023.3346643
Type: Journal Article/Review
Parent Publication: IEEE/ACM Transactions on Audio, Speech, and Language Processing