Paper
31 December 2019 Optimization of the OSVOS model
Shizhan Hong, Tieyong Cao, Shengkai Xiang, Zheng Fang, Xiaotong Deng, Yifeng Peng, Lei Xiang
Author Affiliations +
Proceedings Volume 11384, Eleventh International Conference on Signal Processing Systems; 113840Z (2019) https://doi.org/10.1117/12.2559779
Event: Eleventh International Conference on Signal Processing Systems, 2019, Chengdu, China
Abstract
We solve the problem of video object segmentation by investigating how to expand the role of convolution in convolutional neural networks. Based on the One-Shot Video Object Segmentation (OSVOS) which can successfully tackle the task of semi-supervised video object segmentation, we introduce U-shape architecture. We first build a Global Guidance Module (GGM) on the bottom-up path to provide location information of potentially significant objects for layers of different feature levels. Then we design a Multi-scale Convolution Module (MCM) to fully get feature information and a Feature Fusion Module (FFM) to make the coarse-level semantic information well fused with the finelevel features from the top-down pathway. GGM and FFM allow the high-level semantic features to be progressively refined, yielding detail enriched segmentation maps. The experimental results on DAVIS 2016 data set shows that our proposed approach can more accurately locate the segmentation objects with sharpened details and our model has improved on all indicators than OSVOS.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Shizhan Hong, Tieyong Cao, Shengkai Xiang, Zheng Fang, Xiaotong Deng, Yifeng Peng, and Lei Xiang "Optimization of the OSVOS model", Proc. SPIE 11384, Eleventh International Conference on Signal Processing Systems, 113840Z (31 December 2019); https://doi.org/10.1117/12.2559779
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Image segmentation

Convolution

Image processing

Optimization (mathematics)

Network architectures

Data modeling

RELATED CONTENT

A HWMSE for clue detection the system design and...
Proceedings of SPIE (October 26 2013)
Deep learning-based multi-object association retrieval
Proceedings of SPIE (May 13 2022)
Knowledge-guided parsing in video databases
Proceedings of SPIE (April 14 1993)

Back to Top