Paper
8 February 2015 Semantic video segmentation using both appearance and geometric information
Jihwan Woo, Kris Kitani, Sehoon Kim, Hantak Kwak, Woosung Shim
Author Affiliations +
Proceedings Volume 9406, Intelligent Robots and Computer Vision XXXII: Algorithms and Techniques; 94060B (2015) https://doi.org/10.1117/12.2083165
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States
Abstract
The segmentation is the first step and core technology for semantic understanding of the video. Many tasks in the computer vision such as tracking, recognition and 3D reconstruction, etc. rely on the segmentation result as preprocessing. However, the video segmentation has been known to be a very complicated and hard problem. The objects in the video change their colors and shapes according to the surrounding illumination, the camera position, or the object motion. The color, motion, or depth has been utilized individually as a key clue for the segmentation in many researches. However, every object in the image is composed of several features such as color, texture, depth and motion. That is why single-feature based segmentation method often fails. Humans can segment the objects in video with ease because the human visual system enables to consider color, texture, depth and motion at the same time. In this paper, we propose the video segmentation algorithm which is motivated by the human visual system. The algorithm performs the video segmentation task by simultaneously utilizing the color histogram of the color, the optical flow of the motion, and the homography of the structure. Our results show that the proposed algorithm outperforms other appearance based segmentation method in terms of semantic quality of the segmentation [15]. The proposed segmentation method will serve as a basis for better high-level tasks such as recognition, tracking [3],[4] and video understanding [1].
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jihwan Woo, Kris Kitani, Sehoon Kim, Hantak Kwak, and Woosung Shim "Semantic video segmentation using both appearance and geometric information", Proc. SPIE 9406, Intelligent Robots and Computer Vision XXXII: Algorithms and Techniques, 94060B (8 February 2015); https://doi.org/10.1117/12.2083165
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Video

Optical flow

Image processing algorithms and systems

Semantic video

Detection and tracking algorithms

Visual system

Back to Top