Developing Visual Data Understanding through Self-Similarity

Lead Research Organisation: University of Oxford

Abstract

EPSRC Research Areas: Image and Vision Computing, Artificial Intelligence Technologies

Research Context: Comparison between data of the same type (e.g. vectors, text files, images) is a critical task in information processing. In the context of visual data however (images and videos), computer vision methods still lack meaningful tools for comparison relative to human visual perception. For example, a child which sees a single picture of a zebra at the end an alphabet book, will often be able to recognise the animal in a zoo, despite inevitable differences in pose, appearance, scale and other quantities. In this case the child has learnt the similarity between two very different pieces of raw visual data.

Much of the current progress of deep neural networks has relied on large human-annotated datasets, which are expensive and labour-intensive to collect. Returning to the example, current methods will struggle to understand the character of a zebra from a single example which can then be applied immediately, as a child could.

Aims and Objectives: This research aims to leverage self-similarity present in natural images to solve multiple tasks in the supervised-learning framework and to develop representations of visual data which can be widely applied to multiple computer vision tasks.
Short-term objectives revolve around using self-similarity to solve the problem of class-agnostic counting and salient object subitizing, i.e. predicting the existence and the number of salient objects in an image. Provided an example of any object of interest, we aim to provide accurate counts in a collection of query images, with no/minimal fine-tuning. This could be applied in microbiology in the context of counting cell growth given different initial conditions or in zoology to count the number of a certain species at different times of the day.
Long-term research aims are to develop the current work of feature-learning using the self-supervised framework with a focus on techniques relying on self-similarity.

Novelty of the Research Methodology: With reference to the short-term objective of class-agnostic counting and salient object subitizing. The emphasis on no/minimal fine-tuning on a given domain is connected to zero/few-shot learning in the counting task. No previous model has been able to do this effectively without requiring significant training on a domain of interest.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
NE/W502728/1 01/04/2021 31/03/2022
2076905 Studentship NE/W502728/1 01/10/2018 15/01/2023 Prannay Kaul