Estimating the intrinsic characteristics of real images to aid analysis

Lead Research Organisation: University of Birmingham

Department Name: School of Psychology

Abstract

Humans find seeing things effortless and this hides the fact that making sense of the visual world is a very difficult problem. Vision is difficult because each image we see could have been made by an infinite number of object and lighting combinations. Think of the simplest image property - grey level. The grey level of each pixel in an image is determined by the amount of light falling onto each object and the amount of light that is reflected back from each object. Dark objects have lower grey levels than light ones but even light objects have low grey levels when in shadow. We cannot tell whether we are looking at a dark object in bright light or a light object in shadow just by measuring grey levels. Even worse, when grey levels are different in different parts of an image we cannot tell if this difference is due to there being two objects or a change in the amount of light. Despite this problem humans are very good at working out the reasons for grey level changes; we CAN tell objects from shadows.One reason why we are so good at working out what's going on in images maybe that we use other properties such as colour and pattern to tell us what the grey levels mean. This idea has led to the concept of 'intrinsic images'. An intrinsic image is an image that describes one property of the scene. So instead of having a single image that mixes up shadows and object reflectances we might produce two intrinsic images one each for shadows and reflectance. Scientists have already succeeded in producing intrinsic images like these by using colour changes to work out what the grey levels mean. But, there is more than one type of shadow and more than one type of reflection. We want to improve on the existing methods by producing four intrinsic images instead of two. Our first intrinsic image will contain the type of gentle shading that is found on undulating surfaces. Our second intrinsic image will contain the hard shadows that are produced when an object blocks the light. Our third intrinsic image will describe the reflectance of matte objects and our forth image the reflections from shiny objects. To separate out these four images we will need to use additional information beyond colour. We think that surface patterns (e.g. wood grain) will provide the necessary information.Extracting four intrinsic images will be very helpful to those engineers who try to make computers understand what's going on in an image. To take just one example, humans seem to be very good at is estimating the shape of undulations on a surface from the way that it is shaded. We are so good at this that we do it automatically and the people who write computer software can trick us into thinking that their 'buttons' stand out from the screen just by adding a some highlights and shading to the edges. There are many computer programs that try to interpret shape-from-shading. While many of these programs work well they tend to assume that all changes in grey level are due to shading which is in tern due to surface undulations. We know that this assumption is not true in real pictures and these programs tend to do badly when looking at such images. But if we can produce shading only images from real images then these programs may work better.To test our ideas and decide on the best way achieve our desired results we will collect a large number of photographs of objects whose shape we either already know or can work out. We will calibrate these pictures very carefully and then use them to workout what information is conveyed by colour and pattern that can help us to workout the meaning of each grey level change. We will also test humans to see which cues they might be using. We will make our images available on the Internet so that others can try out their ideas too. We intend to work with a software company who will take the best of our ideas and implement them in a computer program that can automatically design embossed jewellery from photographs.

Funded Value:

£381,159

Funded Period:

Apr 08 - Oct 11

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/F026269/1

Principal Investigator:

Andrew Schofield

Research Subject:

Info. & commun. Technol. (100%)

Research Topic:

Vision & Senses - ICT appl. (100%)

Organisations

People	ORCID iD
Andrew Schofield (Principal Investigator)
Jeremy Wyatt (Co-Investigator)
Michael Spann (Researcher)

Publications

Author Name

Title Publication Date Published

10 25 50

Dövencioglu D (2013) Perceptual integration for qualitatively different 3-D cues in the human brain. in Journal of cognitive neuroscience

Dövencioglu DN (2010) Perceptual learning for second-order cues in a shape-from-shading task in Perception

Dövencioglu DN (2013) Perceptual learning of second order cues for layer decomposition. in Vision research

Georgeson MA (2016) Binocular functional architecture for detection of contrast-modulated gratings. in Vision research

Jiang X (2010) Layer segmentation using hue, texture and luminance amplitude in a steerable filter framework in Perception

Jiang X (2010) Computer Vision - ECCV 2010 - 11th European Conference on Computer Vision, Heraklion, Crete, Greece, September 5-11, 2010, Proceedings, Part IV

Jiang X. (2010) Layer segmentation using hue, texture and luminance amplitude in a steerable filter framework in PERCEPTION

Mazzilli G (2012) The effects of lighting direction and elevation on judgements of shape-from-shading. in Journal of Vision

Mazzilli G (2013) A cue-free method to probe human lighting biases. in Perception

Schofield A (2010) The role of second-order vision in discriminating shading versus material changes in Journal of Vision

Schofield A (2013) Shape Perception in Human and Computer Vision - An Interdisciplinary Perspective

Schofield A. J. (2010) The Birmingham Object Lighting Database: a stereoscopic database of objects, faces, surfaces and outdoor scenes captured under characterised lighting conditions in PERCEPTION

Schofield AJ (2011) Sun and sky: Does human vision assume a mixture of point and diffuse illumination when interpreting shape-from-shading? in Vision research

Schofield AJ (2014) Texture variations suppress suprathreshold brightness and colour variations. in PloS one

Schofield AJ (2010) What is second-order vision for? Discriminating illumination versus material changes. in Journal of vision

Schofield AJ (2010) The Birmingham Object Lighting Database: a stereoscopic database of objects, faces, surfaces and outdoor scenes captured under characterised lighting in Perception

Sun P (2011) The efficacy of local luminance amplitude in disambiguating the origin of luminance signals depends on carrier frequency: further evidence for the active role of second-order vision in layer decomposition. in Vision research

Sun P (2012) Two operational modes in the perception of shape from shading revealed by the effects of edge information in slant settings. in Journal of vision

Sun P (2009) Using texture to recover shading and reflectance information from real images in Perception

Sun Peng (2009) Using texture to recover shading and reflectance information from real images in PERCEPTION

Key Findings
Research Databases and Models
Engagement Activities


Description	We developed and tested a novel algorithm for separating shadows and illumination changes from material changes in natural images. We developed a new model for how humans might perform the same operation. We studies the way that humans use lighting cues in shape-from-shading judgement. we developed a stereoscopic database of objects, faces and natural environments under specified lighting conditions.
Exploitation Route	intrinsic image separation method could be used for shadow removal by the creative arts and software industries.
Sectors	Creative Economy,Digital/Communication/Information Technologies (including Software)


Title	Birmingham Object Lighting Database
Description	Database of stereoscopic image pairs for objects, faces and outdoor scenes for a range of specified lighting conditions.
Type Of Material	Database/Collection of data
Year Produced	2011
Provided To Others?	Yes
Impact	The database has been accessed by other research groups
URL	http://www.bold.bham.ac.uk


Description	Meet the Scientist
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	Regional
Primary Audience	Public/other audiences
Results and Impact	Demonstrations sparked questions. None
Year(s) Of Engagement Activity	2012

Abstract

Organisations

People

ORCID iD

Publications