Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection (2024)
Attributed to:
BBC Prosperity Partnership: Future Personalised Object-Based Media Experiences Delivered at Scale Anywhere
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Publication URI: https://arxiv.org/abs/2312.09034
Type: Conference/Paper/Proceeding/Abstract