Learning about Activities from Video

Lead Research Organisation: University of Leeds

Department Name: Sch of Computing

Abstract

Imagine a system that could search the web for video clips containing an activity similar to one already highlighted (e.g. a car parking or two people having a conversation); that could participate in a card game after observing others do the same; and that could detect someone involved in an unfamiliar activity in a car park. Furthermore, suppose that it could do all of these things with no prior knowledge about the specific objects and activities depicted. All of these capabilities can be couched in terms of looking for similar or analogous activities in video clips.A lot of work has been done over the past forty years on devising methods for finding objects and activities in pictures and video clips by hand-crafting computer-models of what they are expected to look like. Ways have now been found to fully automate the creation of such models for objects (e.g. pedestrians) and simple movements (e.g. running) by learning from large sets of pictures and video clips. This should therefore make it possible to search for similar objects and movements with no prior knowledge of those things.Some progress has been made recently on extending this level of automation to handle a limited range of more complex activities in very simple scenes. This has been achieved by firstly learning about the appearance of objects and then learning about the activities in which they are involved using logical induction. Unfortunately there isn't yet an easy way to ensure the object categories produced are appropriate for the activities to be learnt. Our main aim is to resolve this problem by steering the search for object categories towards those that lead to the most coherent set of activities. A consequence of this could be to change the way we think about age-old problems in computer vision and logical reasoning.

Funded Value:

£426,517

Funded Period:

May 06 - Nov 09

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/D061334/1

Principal Investigator:

David Hogg

Research Subject:

Info. & commun. Technol. (100%)

Research Topic:

Image & Vision Computing (100%)

Organisations

University of Leeds (Lead Research Organisation)

People	ORCID iD
David Hogg (Principal Investigator)	http://orcid.org/0000-0002-6125-9564
Anthony Cohn (Co-Investigator)

Publications

Author Name

Title Publication Date Published

10 25 50

Damen D (2011) Explaining Activities as Consistent Groups of Events A Bayesian Framework Using Attribute Multiset Grammars in International Journal of Computer Vision

Dee H (2009) Spatial Information Theory

Dee H (2012) Building semantic scene models from unconstrained video in Computer Vision and Image Understanding

Dee H (2009) Navigational strategies in behaviour modelling in Artificial Intelligence

Dee H (2008) Spatial Cognition VI. Learning, Reasoning, and Talking about Space

Delafontaine M (2011) Implementing a qualitative calculus to analyse moving point objects in Expert Systems with Applications

Delafontaine M (2011) Inferring additional knowledge from QTCN relations in Information Sciences

Fenelon V (2012) Reasoning about shadows in a mobile robot environment in Applied Intelligence

K Sridhar (2008) Learning functional object categories from a relational spatio-temporal representation

Li S (2012) REASONING WITH TOPOLOGICAL AND DIRECTIONAL SPATIAL INFORMATION in Computational Intelligence

R Fraile (2008) Motion Segmentation by Consensus

Abstract

Organisations

People

ORCID iD

Publications