Learning to Recognise Dynamic Visual Content from Broadcast Footage

Lead Research Organisation: University of Leeds

Department Name: Sch of Computing

Abstract

Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.

Funded Value:

£439,457

Funded Period:

Sep 11 - Mar 16

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/I01229X/1

Principal Investigator:

Derek Magee

Mark Everingham

Research Subject:

Info. & commun. Technol. (100%)

Research Topic:

Image & Vision Computing (100%)

Organisations

People	ORCID iD
Derek Magee (Principal Investigator)
Mark Everingham (Principal Investigator)
David Hogg (Co-Investigator)	http://orcid.org/0000-0002-6125-9564

Publications

Author Name

Title Publication Date Published

|< < 1 2 > >|

10 25 50

Charles J (2016) Personalizing Human Video Pose Estimation

Charles J (2013) Automatic and Efficient Human Pose Estimation for Sign Language Videos in International Journal of Computer Vision

Charles J (2016) Computer Vision - ECCV 2016 Workshops - Amsterdam, The Netherlands, October 8-10 and 15-16, 2016, Proceedings, Part III

Charles J. Personalizing Human Video Pose Estimation

Charles, J (2013) Domain Adaptation for Upper Body Pose Tracking in Signed TV Broadcasts

Charles, J. (2014) Upper body pose estimation with temporal sequential forests

Charles, J. (2014) Upper Body Pose Estimation with Temporal Sequential Forests. in []

Pfister T (2014) Computer Vision - ECCV 2014 - 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part VI

Pfister, T (2014) Deep Convolutional Neural Networks for Efficient Pose Estimation in Gesture Videos in N/A

Pfister, T (2013) Large-scale Learning of Sign Language by Watching TV (Using Co-occurrences)

Key Findings
Impact Summary
Collaboration


Description	This grant (in part) relates to sign language recognition using large corpus's of video data. One major Leeds aspect (this is a collaboration with Oxford and Surrey) is tracking of upper body pose. Our efforts have led to personalised models that can be generated automatically for new video data sets. This means large corpus's can be annotated automatically and 'deep learning' used on this massive volume of data to generate a state of the art pose detector. Additionally, we have collated a number of large video data sets within the project which are available to other researchers. The Methods developed are also openly available. http://www.robots.ox.ac.uk/~vgg/data/pose/
Exploitation Route	Generic pose detection is useful in the Chosen domain (sign language interpretation) and is being used by our project partners in Oxford. However, pose detection from a single monocular camera has wider applicability in areas such as security monitoring, entertainment, assisted living etc. Conventionally special equipment (e.g. Microsoft Kinekt) are required for such applications.
Sectors	Digital/Communication/Information Technologies (including Software)
URL	http://www.robots.ox.ac.uk/~vgg/research/pose_track/index.html


Description	The project has been working with providors of sign language augmentation to the BBC in order to determine applicability of developed technologies to real world application.
Sector	Digital/Communication/Information Technologies (including Software),Education


Description	Oxford
Organisation	University of Oxford
Country	United Kingdom
Sector	Academic/University
PI Contribution	Joint research
Collaborator Contribution	Code and Ideas
Impact	See joint publications: 1) Charles, J., Pfister, T., Magee, D., Hogg D. and Zisserman A. Upper Body Pose Estimation with Temporal Sequential Forests. in British Machine Vision Conference (BMVC), 2014. 2) Pfister, T., Simonyan, K., Charles, J. and Zisserman A. Deep Convolutional Neural Networks for Efficient Pose Estimation in Gesture Videos. in Asian Conference on Computer Vision (ACCV), 2014. 3) Pfister, T., Charles, J. and Zisserman A. Domain-adaptive Discriminative One-shot Learning of Gestures. in European Conference on Computer Vision (ECCV), 2014. 4) Charles, J., Pfister, T, Everingham, M. and Zisserman A. Automatic and Efficient Human Pose Estimation for Sign Language Videos. International Journal of Computer Vision (IJCV) 5) Charles, J., Pfister, T., Magee, D., Hogg D. and Zisserman A. Domain Adaptation for Upper Body Pose Tracking in Signed TV Broadcasts. in British Machine Vision Conference (BMVC), 2013. 6) Pfister, T., Charles, J. and Zisserman A. Large-scale Learning of Sign Language by Watching TV (Using Co-occurrences). in British Machine Vision Conference (BMVC), 2013 7) Pfister, T., Charles, J., Everingham, M. and Zisserman A. Automatic and Efficient Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts. in British Machine Vision Conference (BMVC), 2012.
Start Year	2010

Abstract

Organisations

People

ORCID iD

Publications