Paper
14 March 2005 Vision-based speaker location detection
Author Affiliations +
Proceedings Volume 5685, Image and Video Communications and Processing 2005; (2005) https://doi.org/10.1117/12.587326
Event: Electronic Imaging 2005, 2005, San Jose, California, United States
Abstract
Generally, speaker location detection in video conferencing is audio-based. However, physical room environment which is beyond the control of the speaker detection system can severely change room acoustics. Room acoustics introduce interference and can deteriorate the performance of audio-based speaker detection system. In this paper, we propose a video-based speaker detection method which can be used independently or along with audio-based detection systems. The information on speaker location is intended to create 3-dimensional audio reproduction in order to provide more reality to video conference. In the proposed ethod, we detect moving lips in video sequences. We first detect lips using color information and determine whether the lips are moving. Experiments with real videos provide promising results.
© (2005) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jaehyun Lim, Jonggeun Park, and Chulhee Lee "Vision-based speaker location detection", Proc. SPIE 5685, Image and Video Communications and Processing 2005, (14 March 2005); https://doi.org/10.1117/12.587326
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Laser induced plasma spectroscopy

Skin

Facial recognition systems

Image segmentation

Eye

Nose

Back to Top