Paper
27 April 2007 A bio-inspired system for spatio-temporal recognition in static and video imagery
Deepak Khosla, Christopher K. Moore, Suhas Chelian
Author Affiliations +
Abstract
This paper presents a bio-inspired method for spatio-temporal recognition in static and video imagery. It builds upon and extends our previous work on a bio-inspired Visual Attention and object Recognition System (VARS). The VARS approach locates and recognizes objects in a single frame. This work presents two extensions of VARS. The first extension is a Scene Recognition Engine (SCE) that learns to recognize spatial relationships between objects that compose a particular scene category in static imagery. This could be used for recognizing the category of a scene, e.g., office vs. kitchen scene. The second extension is the Event Recognition Engine (ERE) that recognizes spatio-temporal sequences or events in sequences. This extension uses a working memory model to recognize events and behaviors in video imagery by maintaining and recognizing ordered spatio-temporal sequences. The working memory model is based on an ARTSTORE1 neural network that combines an ART-based neural network with a cascade of sustained temporal order recurrent (STORE)1 neural networks. A series of Default ARTMAP classifiers ascribes event labels to these sequences. Our preliminary studies have shown that this extension is robust to variations in an object's motion profile. We evaluated the performance of the SCE and ERE on real datasets. The SCE module was tested on a visual scene classification task using the LabelMe2 dataset. The ERE was tested on real world video footage of vehicles and pedestrians in a street scene. Our system is able to recognize the events in this footage involving vehicles and pedestrians.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Deepak Khosla, Christopher K. Moore, and Suhas Chelian "A bio-inspired system for spatio-temporal recognition in static and video imagery", Proc. SPIE 6560, Intelligent Computing: Theory and Applications V, 656002 (27 April 2007); https://doi.org/10.1117/12.719975
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications and 15 patents.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Scene classification

Visualization

Neural networks

Biomimetics

Computer programming

Image segmentation

RELATED CONTENT

Bio-inspired visual attention and object recognition
Proceedings of SPIE (April 30 2007)
Hyperlinked video
Proceedings of SPIE (January 22 1999)
Head gesture recognition technique for visual user interface
Proceedings of SPIE (October 05 1998)
Lip reading using neural networks
Proceedings of SPIE (September 30 2011)

Back to Top