Paper
30 April 2022 Deep learning-based feature compression for video coding for machine
Author Affiliations +
Proceedings Volume 12177, International Workshop on Advanced Imaging Technology (IWAIT) 2022; 121773B (2022) https://doi.org/10.1117/12.2626099
Event: International Workshop on Advanced Imaging Technology 2022 (IWAIT 2022), 2022, Hong Kong, China
Abstract
We previously trained the compression network via optimization of bit-rate and distortion (feature domain MSE) [1]. In this paper, we propose feature map compression method for video coding for machine (VCM) based on deep learning-based compression network that joint training for optimizing both compressed bit rate and machine vision task performance. We use bmshij2018-hyperporior model in the CompressAI [2] as the compression network, and compress the feature map which is the output of stem layer in the Faster R-CNN X101-FPN network of Detectron2 [3]. We evaluated the proposed method by evaluation framework for MPEG VCM. The proposed method shows the better results than VVC of MPEG VCM anchor.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jihoon Do, Jooyoung Lee, Younhee Kim, Se Yoon Jeong, and Jin Soo Choi "Deep learning-based feature compression for video coding for machine", Proc. SPIE 12177, International Workshop on Advanced Imaging Technology (IWAIT) 2022, 121773B (30 April 2022); https://doi.org/10.1117/12.2626099
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Video compression

Machine vision

Distortion

Image compression

Signal processing

Video coding

RELATED CONTENT


Back to Top