Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization (2024)
Attributed to:
S3A: Future Spatial Audio for an Immersive Listener Experience at Home
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.1109/taslp.2023.3346643
Publication URI: http://dx.doi.org/10.1109/taslp.2023.3346643
Type: Journal Article/Review
Parent Publication: IEEE/ACM Transactions on Audio, Speech, and Language Processing