Paper
25 August 2004 Generalized dimensions applied to speaker identification
Limin Hou, Shuozhong Wang
Author Affiliations +
Abstract
This paper describes an application of fractal dimensions to speech processing and speaker identification. There are several dimensions that can be used to characterize speech signals such as box dimension, correlation dimension, etc. We are mainly concerned with the generalized dimensions of speech signals as they provide more information than individual dimensions. Generalized dimensions of arbitrary orders are used in speaker identification in this work. Based on the experimental data, the artificial phase space is generated and smooth behavior of correlation integral is obtained in a straightforward and accurate analysis. Using the dimension D(2) derived from the correlation integral, the generalized dimension D(q) of an arbitrary order q is calculated. Moreover, experiments applying the generalized dimension in speaker identification have been carried out. A speaker recognition dedicated Chinese language speech corpus with PKU-SRSC, recorded by Peking University, was used in the experiments. The results are compared to a baseline speaker identification that uses MFCC features. Experimental results have indicated the usefulness of fractal dimensions in characterizing speaker's identity.
© (2004) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Limin Hou and Shuozhong Wang "Generalized dimensions applied to speaker identification", Proc. SPIE 5404, Biometric Technology for Human Identification, (25 August 2004); https://doi.org/10.1117/12.542828
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Fractal analysis

Communication engineering

Speaker recognition

Turbulence

Acoustics

Chromium

Electronics

RELATED CONTENT

Tracing parallel vectors
Proceedings of SPIE (January 16 2006)
Add prior knowledge to speaker recognition
Proceedings of SPIE (March 28 2005)
Wavelet analysis of multifractal functions
Proceedings of SPIE (September 01 1995)

Back to Top