Paper
15 March 2019 Exploring the two-stage approach in neural network compression for object detection
Xiao Meng, Lixin Yu D.D.S., Zhiyong Qin D.D.S.
Author Affiliations +
Proceedings Volume 11041, Eleventh International Conference on Machine Vision (ICMV 2018); 110411W (2019) https://doi.org/10.1117/12.2522911
Event: Eleventh International Conference on Machine Vision (ICMV 2018), 2018, Munich, Germany
Abstract
Recently, convolutional neural network (CNN) has been widely implemented in the compute vision, nature language processing and automatic driving. However, it makes much difficulties to employ the neural network in the embedded system because of the limit of memory storage and the computation bandwidth. To address those limitations, we explore a two-stage approach in neural network compression for the scene, object detection. In this paper, we first propose an effective pruning approach on a trained neural network, and achieve total 81.86%-91.54% sparse rate with the accuracy losing 1-3%. Then we explore the quantization method to apply on the pruned neural network, and propose an adaptive codebook to store the quantized weight parameters and the index of the weight parameters. We utilize the two-stage model compression approach, model pruning and weights quantization, to implement on tiny-YOLO, the state-of-art object detection model, achieving total 41.9-62.7X compression rate with the accuracy loss less than 3.3%.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Xiao Meng, Lixin Yu D.D.S., and Zhiyong Qin D.D.S. "Exploring the two-stage approach in neural network compression for object detection", Proc. SPIE 11041, Eleventh International Conference on Machine Vision (ICMV 2018), 110411W (15 March 2019); https://doi.org/10.1117/12.2522911
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Neural networks

Quantization

Binary data

Convolutional neural networks

Performance modeling

Visual process modeling

Computer vision technology

RELATED CONTENT


Back to Top