Face, Body, Voice: Video Person-Clustering with Multiple Modalities (2021)
Attributed to:
Visual AI: An Open World Interpretable Visual Transformer
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.2105.09939
Publication URI: https://arxiv.org/abs/2105.09939
Type: Preprint