Paper
8 September 1993 Behavioral model of visual perception and recognition
Ilya A. Rybak, Alexander V. Golovan, Valentina I. Gusakova
Author Affiliations +
Proceedings Volume 1913, Human Vision, Visual Processing, and Digital Display IV; (1993) https://doi.org/10.1117/12.152729
Event: IS&T/SPIE's Symposium on Electronic Imaging: Science and Technology, 1993, San Jose, CA, United States
Abstract
In the processes of visual perception and recognition human eyes actively select essential information by way of successive fixations at the most informative points of the image. A behavioral program defining a scanpath of the image is formed at the stage of learning (object memorizing) and consists of sequential motor actions, which are shifts of attention from one to another point of fixation, and sensory signals expected to arrive in response to each shift of attention. In the modern view of the problem, invariant object recognition is provided by the following: (1) separated processing of `what' (object features) and `where' (spatial features) information at high levels of the visual system; (2) mechanisms of visual attention using `where' information; (3) representation of `what' information in an object-based frame of reference (OFR). However, most recent models of vision based on OFR have demonstrated the ability of invariant recognition of only simple objects like letters or binary objects without background, i.e. objects to which a frame of reference is easily attached. In contrast, we use not OFR, but a feature-based frame of reference (FFR), connected with the basic feature (edge) at the fixation point. This has provided for our model, the ability for invariant representation of complex objects in gray-level images, but demands realization of behavioral aspects of vision described above. The developed model contains a neural network subsystem of low-level vision which extracts a set of primary features (edges) in each fixation, and high- level subsystem consisting of `what' (Sensory Memory) and `where' (Motor Memory) modules. The resolution of primary features extraction decreases with distances from the point of fixation. FFR provides both the invariant representation of object features in Sensor Memory and shifts of attention in Motor Memory. Object recognition consists in successive recall (from Motor Memory) and execution of shifts of attention and successive verification of the expected sets of features (stored in Sensory Memory). The model shows the ability of recognition of complex objects (such as faces) in gray-level images invariant with respect to shift, rotation, and scale.
© (1993) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Ilya A. Rybak, Alexander V. Golovan, and Valentina I. Gusakova "Behavioral model of visual perception and recognition", Proc. SPIE 1913, Human Vision, Visual Processing, and Digital Display IV, (8 September 1993); https://doi.org/10.1117/12.152729
Lens.org Logo
CITATIONS
Cited by 7 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image processing

Image segmentation

Visual process modeling

Visualization

Sensors

Feature extraction

Image sensors

RELATED CONTENT

Refractory neural nets and vision
Proceedings of SPIE (February 25 2014)
Data acquisition from cemetery headstones
Proceedings of SPIE (February 04 2013)
Image segmentation through Gabor-based neural networks
Proceedings of SPIE (September 16 1992)
Palm print image processing with PCNN
Proceedings of SPIE (August 20 2010)

Back to Top