Paper
23 June 2003 Semantic transcoding of video based on regions of interest
Jeongyeon Lim, Munchurl Kim, Jong-Nam Kim, Kyeongsoo Kim
Author Affiliations +
Proceedings Volume 5150, Visual Communications and Image Processing 2003; (2003) https://doi.org/10.1117/12.503081
Event: Visual Communications and Image Processing 2003, 2003, Lugano, Switzerland
Abstract
Traditional transcoding on multimedia has been performed from the perspectives of user terminal capabilities such as display sizes and decoding processing power, and network resources such as available network bandwidth and quality of services (QoS) etc. The adaptation (or transcoding) of multimedia contents to given such constraints has been made by frame dropping and resizing of audiovisual, as well as reduction of SNR (Signal-to-Noise Ratio) values by saving the resulting bitrates. Not only such traditional transcoding is performed from the perspective of user’s environment, but also we incorporate a method of semantic transcoding of audiovisual based on region of interest (ROI) from user’s perspective. Users can designate their interested parts in images or video so that the corresponding video contents can be adapted focused on the user’s ROI. We incorporate the MPEG-21 DIA (Digital Item Adaptation) framework in which such semantic information of the user’s ROI is represented and delivered to the content provider side as XDI (context digital item). Representation schema of our semantic information of the user’s ROI has been adopted in MPEG-21 DIA Adaptation Model. In this paper, we present the usage of semantic information of user’s ROI for transcoding and show our system implementation with experimental results.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jeongyeon Lim, Munchurl Kim, Jong-Nam Kim, and Kyeongsoo Kim "Semantic transcoding of video based on regions of interest", Proc. SPIE 5150, Visual Communications and Image Processing 2003, (23 June 2003); https://doi.org/10.1117/12.503081
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Multimedia

Semantic video

Signal to noise ratio

Visualization

Binary data

Spatial resolution

Back to Top