Image statistics in the visual system

Lead Research Organisation: City, University of London
Department Name: Optometry and Visual Science

Abstract

The overall aim of our project is discover and describe statistical calculations performed by the human visual system.Our perceptions are thought to result from the combination of prior knowledge with data we gather from the environment. Of course, the visual system cannot afford to be too meticulous a scientist and measure everything. Real environments are just too complex and dynamic. Instead, the visual system must settle for being a clever statistician and quickly estimate various visual features.Exactly which visual features can be quickly estimated is a matter of some debate, but there is no question that spatial orientation (i.e. tilt) is one of them. The primary evidence for orientation's special status in the visual system comes from search experiments, in which observers must find a target region, having a unique orientation, amongst distractors, having different orientations. When the average orientation of the target region is sufficiently different from that of the distractors, then the number of distractors will be irrelevant; the target just pops out. Thus, we know that the visual system can compute mean orientation without too much conscious effort. What about orientation variance? That is, would a region with lots of different orientations pop out from regions with largely similar orientations? If the visual system proves capable of this type of statistical analysis, can it do something similar with other visual features, like colour and size?

Publications

10 25 50

publication icon
Nam JH (2009) Coherent plaids are preattentively more than the sum of their parts. in Attention, perception & psychophysics

publication icon
Solomon JA (2009) The history of dipper functions. in Attention, perception & psychophysics

publication icon
Tomassini A (2010) Orientation uncertainty reduces perceived obliquity. in Vision research

 
Description The overall aim of our project was to discover and describe statistical calculations performed by the human visual system.

Our perceptions are thought to result from the combination of prior knowledge with data we gather from the environment. Of course, the visual system cannot afford to be too meticulous a scientist and measure everything. Real environments are just too complex and dynamic. Instead, the visual system must settle for being a clever statistician and quickly estimate various visual features.

Exactly which visual features can be quickly estimated remains a matter of some debate, but there is no question that spatial orientation (i.e. tilt) is one of them. For example, the observer may infer a forest scene if there are more than 8 or 9 trees in the picture. If most of those trees aren't within a few degrees of vertical, the observer may infer that the camera was tilted.

We have now determined that people use relatively few trees (or lines) when quickly computing average tilt. They are much more efficient when asked to make a judgement regarding the range of tilts. Futhermore, we have discovered that estimates of average and range are not independent. When the range of tilts is large, estimates of the average tend to be closer to vertical than when the range is small. This latter result is consistent with a stable prior, or prejudice, for those orientations most prevalent in natural scenes.

The primary evidence for orientation's special status in the visual system comes from search experiments, in which observers must find a "target region," having a unique orientation, amongst "distractors," having different orientations. When the average orientation of the target region is sufficiently different from that of the distractors, then the number of distractors will be irrelevant; the target just "pops out."

Consider the plus sign (+), which has two perpendicular orientations at the same position. Our observers were much faster when asked to find a plus sign amongst minuses (-) and slashes (/,\) than they were when asked to find one of the latter (e.g. a minus sign) amongst pluses. This asymmetry further suggests that the range or multiplicity of orientations in different parts of a scene can be can be accurately and simultaneously estimated at a glance.
 
Description EPSRC
Amount £541,000 (GBP)
Funding ID Efficiencies 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start  
 
Description EPSRC
Amount £541,000 (GBP)
Funding ID Efficiencies 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start