Multimodal Video Search by Examples (MVSE)
Lead Research Organisation:
University of Surrey
Department Name: Vision Speech and Signal Proc CVSSP
Abstract
Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.
Publications
Liu D
(2024)
Importance Weighted Structure Learning for Scene Graph Generation.
in IEEE transactions on pattern analysis and machine intelligence
Ju L
(2023)
Keep an eye on faces: Robust face detection with heatmap-Assisted spatial attention and scale-Aware layer attention
in Pattern Recognition
Marikkar U
(2023)
LT-ViT: A Vision Transformer for Multi-Label Chest X-Ray Classification
Xu T
(2024)
Memory Prompt for Spatiotemporal Transformer Visual Object Tracking
in IEEE Transactions on Artificial Intelligence
Chen Z
(2025)
Multi-layer multi-level comprehensive learning for deep multi-view clustering
in Information Fusion
Dong W
(2024)
One-pass View-unaligned Clustering
in IEEE Transactions on Multimedia
Li R
(2024)
Perceiving Actions via Temporal Video Frame Pairs
in ACM Transactions on Intelligent Systems and Technology
| Description | Successfully demonstrated the ability to index the BBC image and video archive by face identity. |
| Exploitation Route | There are many potential applications, specially in policing and security. |
| Sectors | Digital/Communication/Information Technologies (including Software) |
| Description | The findings could be used for automating the retrieval of individuals from large image and video archives by face. |
| First Year Of Impact | 2023 |
| Sector | Culture, Heritage, Museums and Collections |
| Impact Types | Cultural Societal Economic |
| Description | BBC |
| Organisation | British Broadcasting Corporation (BBC) |
| Country | United Kingdom |
| Sector | Public |
| PI Contribution | This is a collaborative project involving Queen University of Belfast, Cambridge University and BBC. Surrey is focusing on face indexing and retrieval, sound classification and scene analysis. |
| Collaborator Contribution | Problem specification, user advise and feedback, evaluation, and development dataset. |
| Impact | Recorded in the publication section |
| Start Year | 2021 |
| Description | International Joint Laboratory for Pattern Recognition and Computational Intelligence |
| Organisation | Jianghan University |
| Country | China |
| Sector | Academic/University |
| PI Contribution | Supervising visiting PhD students/researchers from Jiangnan University, visiting Jiangnan University to collaborate with the local research team. |
| Collaborator Contribution | Working on joint problems in machine learning. |
| Impact | See the Publications section |
| Start Year | 2016 |
| Description | Queen's University of Belfast |
| Organisation | Queen's University Belfast |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | This is a collaborative project involving Queen University of Belfast, Cambridge University and BBC. Surrey is focusing on face indexing and retrieval, sound classification and scene analysis. |
| Collaborator Contribution | Queen's University of Belfast focuses on hashing, and on developing the demonstrator of the retrieval system. |
| Impact | Recorded in the publication section |
| Start Year | 2021 |
| Description | University of Cambridge |
| Organisation | University of Cambridge |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | This is a collaborative project involving Queen University of Belfast, Cambridge University and BBC. Surrey is focusing on face indexing and retrieval, sound classification and scene analysis. |
| Collaborator Contribution | Cambridge is focusing on speech understanding, speaker recognition and topic analysis. |
| Impact | Reported in the publications section |
| Start Year | 2021 |
