Deep Learning for Free-Viewpoint Video in Sports and Immersive VR Experiences

Lead Research Organisation: University of Surrey

Department Name: Vision Speech and Signal Proc CVSSP

Abstract

Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.

Student:

LEWIS BRIDGEMAN

Period of Study:

Sep 17 - Jun 21

Funder:

EPSRC

Project Status:

Closed

Project Category:

Studentship

Project Reference:

1976240

Research Topic:

Unclassified

Organisations

University of Surrey (Lead Research Organisation)

People	ORCID iD
Adrian Hilton (Primary Supervisor)
LEWIS BRIDGEMAN (Student)

Publications

Author Name Title Publication Date Published

10 25 50

Bridgeman L (2019) Full-body Performance Capture of Sports from Multi-view Video (Short Paper)

Bridgeman L (2019) Multi-Person 3D Pose Estimation and Tracking in Sports

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/N509772/1			30/09/2016	29/09/2021
1976240	Studentship	EP/N509772/1	30/09/2017	29/06/2021	LEWIS BRIDGEMAN

Key Findings


Description	We have identified that model-based reconstruction of people from multi-view video proves a lightweight, but realistic method of generating full-body reconstructions of people in constrained environments. We have produced FVV rendering results on single people in constrained studio environments that are able to be played back in virutal reality (VR) and augmented reality (AR). Model-based reconstruction is able to capture finer details (such as fingers, and facial details) where other reconstruction methods fail. However, there is still room for improvement in capturing details not present within the model, such as clothing. Our work in "Multi-person 3D Pose Estimation and Tracking in Sports" provides a stepping-stone to FVV of multiple people in sports scenes. This work focuses on sorting and tracking pose estimations of multiple people from multiple camera views in sports scenes; pose estimations are a critical component of the model-based reconstruction pipeline. This work provides a new method for: correcting errors in pose estimations using multi-view consensus; associating 2D pose estimations between camera viewpoints; and sorting associated poses between frames to generate tracked 3D skeletons. Our approach achieves a significant improvement in speed over the state-of-the-art. "Full-body Performance Capture of Sports from Multi-view Video" extends our previous work by using the sorted pose estimations in a model-based reconstruction of multiple people in sports environments. We demonstrate results for our method on a soccer dataset comprising over 20 subjects. These initial results show that model-based reconstruction has the potential to provide smooth, temporally consistent reconstructions of multiple people on challenging sports datasets.
Exploitation Route	Our intermediate work on multiple person 3D pose estimation could find applications in a range of fields: motion capture & animation; sports player analysis; or even gait analysis in healthcare. The extension of this work in multi-person reconstruction could prove useful in the creative industry: the method allows us to generate 4D reconstructions of real human performances from video cameras, which could help to save hours of animators' and digital artists' time.
Sectors	Creative Economy Digital/Communication/Information Technologies (including Software) Leisure Activities including Sports Recreation and Tourism

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects