Computational modelling visual perception in a biologically realistic neural network: Developing rich, hierarchical representations of visual scenes
Lead Research Organisation:
University of Oxford
Department Name: Experimental Psychology
Abstract
The task of visual object recognition has been solved by many computational neural networks; however, this alone is insufficient for making sense of the visual world. It is also necessary to form hierarchical representations of the features and objects within a scene, encoding their semantic relations to one another (e.g. "wheel" is part of "car"). This is the binding problem in psychology: when multiple objects are present, how do we know which features are part of which objects?
A potential solution to the binding problem has recently been proposed, in the form of a model network which incorporates four key biological features that set it apart from previous computer vision networks: (i) top-down and lateral connections, (ii) spiking dynamics, (iii) randomly distributed axonal delays, and (iv) spike-timing dependent plasticity. This network has the potential to encode stimuli in the form of groups of neurons which fire in a temporally precise sequence (polychronous neural groups), and has been shown to develop neurons that fire only if a low-level feature representation is driving a high-level feature representation. So far, however, only very limited stimulus sets have been used for training and relatively surface-level analyses employed.
I propose to train this network on a more ecologically valid stimulus set than that previously used, including images of 3D objects and multi-object scenes, and carry out an in-depth analysis of the representations that develop through the layers of the network, encoded both by individual neurons and polychronous groups. Such a project would take steps towards the creation of a biologically- realistic computer network that can truly make sense of the visual world. This would be of much practical use, and provide insight into how the primate visual system develops hierarchical representations in which visual features are correctly bound: a long-standing and crucial problem in visual psychology.
A potential solution to the binding problem has recently been proposed, in the form of a model network which incorporates four key biological features that set it apart from previous computer vision networks: (i) top-down and lateral connections, (ii) spiking dynamics, (iii) randomly distributed axonal delays, and (iv) spike-timing dependent plasticity. This network has the potential to encode stimuli in the form of groups of neurons which fire in a temporally precise sequence (polychronous neural groups), and has been shown to develop neurons that fire only if a low-level feature representation is driving a high-level feature representation. So far, however, only very limited stimulus sets have been used for training and relatively surface-level analyses employed.
I propose to train this network on a more ecologically valid stimulus set than that previously used, including images of 3D objects and multi-object scenes, and carry out an in-depth analysis of the representations that develop through the layers of the network, encoded both by individual neurons and polychronous groups. Such a project would take steps towards the creation of a biologically- realistic computer network that can truly make sense of the visual world. This would be of much practical use, and provide insight into how the primate visual system develops hierarchical representations in which visual features are correctly bound: a long-standing and crucial problem in visual psychology.
Organisations
People |
ORCID iD |
Simon Stringer (Primary Supervisor) | |
Elliot Smith (Student) |
Studentship Projects
Project Reference | Relationship | Related To | Start | End | Student Name |
---|---|---|---|---|---|
MR/S502509/1 | 01/10/2018 | 30/06/2022 | |||
2108388 | Studentship | MR/S502509/1 | 01/10/2018 | 19/02/2021 | Elliot Smith |