Paper
15 November 2007 Audio-visual gender recognition
Ming Liu, Xun Xu, Thomas S. Huang
Author Affiliations +
Proceedings Volume 6788, MIPPR 2007: Pattern Recognition and Computer Vision; 678803 (2007) https://doi.org/10.1117/12.774687
Event: International Symposium on Multispectral Image Processing and Pattern Recognition, 2007, Wuhan, China
Abstract
Combining different modalities for pattern recognition task is a very promising field. Basically, human always fuse information from different modalities to recognize object and perform inference, etc. Audio-Visual gender recognition is one of the most common task in human social communication. Human can identify the gender by facial appearance, by speech and also by body gait. Indeed, human gender recognition is a multi-modal data acquisition and processing procedure. However, computational multimodal gender recognition has not been extensively investigated in the literature. In this paper, speech and facial image are fused to perform a mutli-modal gender recognition for exploring the improvement of combining different modalities.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Ming Liu, Xun Xu, and Thomas S. Huang "Audio-visual gender recognition", Proc. SPIE 6788, MIPPR 2007: Pattern Recognition and Computer Vision, 678803 (15 November 2007); https://doi.org/10.1117/12.774687
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Acoustics

Binary data

Databases

Pattern recognition

Neural networks

Computer vision technology

Back to Top