Geometric Evaluation of Stereoscopic Video

Lead Research Organisation: Queen Mary University of London
Department Name: Sch of Electronic Eng & Computer Science

Abstract

3D films and games have become a popular part of digital entertainment. There are more and more consumer devices that can show 3D, including the new generation of head-mounted displays, which are reawakening the idea of virtual reality. When it works properly, 3D adds a fun and interesting new dimension to the viewing experience. But sometimes there are problems; for example, the scene may look flat, or distorted. Even worse, there may be too much depth, which can eventually cause eye-strain and headaches.

This project is about evaluating the amount of depth that is being shown, so that video producers and game designers can avoid audience discomfort. There are many rules of thumb for setting the right amount of depth, but it would be safer to take a scientific approach. This project will develop the theory of 3D content evaluation, based on the following principles.

In order to see 3D, the images shown to the left and right eye must be slightly different - just as they are when viewing a real scene. The differences between the two images, which are small horizontal offsets, are called binocular disparities. Roughly speaking, the amount of disparity is proportional to the perceived depth. But in order to fully understand the depth/disparity relationship, it is necessary to know something about the human visual system, as well as the 3D display. In particular, we need to know (or guess) where in the scene the viewer is likely to be looking. The project will develop a system that can predict the point of interest, at any given moment. This information can then be used, in conjunction with the binocular disparities, to quantify how much depth is being shown. The video producer or game designer will then be able to modify the content, so that it includes the right amount of 3D.

The project is at the intersection of several research areas, including geometry, signal processing, and visual perception. A prototype software tool will be developed, in conjunction with a UK visual effects company, for the automatic analysis of 3D content. The final system will be tested by asking ordinary people to evaluate a range of 3D videos; if the system can predict the human responses, then it will be useful to video producers and game designers.

Planned Impact

The ultimate impact of this project will be to help expand the range of 3D content, including films, games, visualizations, and virtual reality environments. This will be achieved by developing a new model of stereoscopic discomfort, which will be used by video producers and game designers, to ensure that 3D content is safe to view. This will benefit the general public, who will thereby have access to a wider range of digital entertainment. In particular, the tools developed in this project will help to bring 3D content to the new generation of head-mounted displays, including the Oculus Rift and Sony Morpheus.

Software tools, based on the outcomes of this project, will be developed by engineers and producers in the visual effects industry. This sector is very strong in the UK, and makes an important contribution to the national economy (see Case for support). Hence there will be a medium-term economic benefit to these companies, which are well-placed to engage with the project and its outcomes. Impact will be ensured by working directly with a UK visual effects company (see industrial Letter of support).

More generally, the project will have an impact on 3D cinema standards, and on broadcasting research and development. In addition, there will be an impact on specialist applications, such as stereoscopic medical visualization and security scanning. The impact of the project will be transmitted through presentations and publications of the work, tailored for the engineering, biological and industrial audiences. In addition, a software implementation of the new stereo model will be made freely available on the project website, so that other researchers can adapt and extend it.

In the long term, the model developed in this project will contribute to the understanding of human binocular vision. This has clinical implications for the treatment of oculomotor disorders. The pathway to clinical impact would be via extensive testing of the new model. This could be performed, in a future project, in conjunction with psychophysics and ophthalmology researchers. The UK has a very strong vision research community, which makes this a promising strategy for long-term clinical impact.

Publications

10 25 50
publication icon
Ahmed S (2018) Constrained Optimization for Plane-Based Stereo. in IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

publication icon
Terzic K (2016) Methods for reducing visual discomfort in stereoscopic 3D: A review in Signal Processing: Image Communication

publication icon
Terzic Kasim (2017) Causes of discomfort in stereoscopic content: a review in arXiv e-prints

publication icon
Vasilyev A (2019) Optimal Control of Eye Movements During Visual Search in IEEE Transactions on Cognitive and Developmental Systems

 
Description We have developed research software for analyzing 3DTV images. The system can automatically predict whether or not a particular image will be comfortable to view in 3D. We have also developed a range of software tools, which are used to prepare stereo images for display on 3DTVs.
Exploitation Route We may make our software and data publicly available, after the key papers have been published (still pending).
Sectors Creative Economy,Digital/Communication/Information Technologies (including Software)

 
Title Preprocessed stereoscopic content 
Description Collection of stereoscopic content, preprocessed for display on 3DTVs (rectified, cropped, and stacked). 
Type Of Material Database/Collection of data 
Provided To Others? No  
Impact This database was used in a psychophysical experiment, to measure visual discomfort. 
 
Title Statistical model of stereoscopic visual discomfort 
Description This model predicts subjective visual discomfort ratings, for stereoscopic content, based on the statistics of the associated disparity map. 
Type Of Material Computer model/algorithm 
Provided To Others? No  
Impact Currently being used to analyze experimental data. 
 
Title Software for stereo image analysis 
Description Software for processing and analyzing stereoscopic 3D content. Designed for statistical analysis of disparity maps, in the context of visual discomfort. Written in C99 and R. 
Type Of Technology Software 
Year Produced 2016 
Impact Currently being used to analyze data, for research publications.