Paper
23 August 2022 Forensic synthetic speech inspection technique based on formant comparison
Qimeng Lu, Puyang Geng, Hong Guo, Xiaohong Chen, Shaopei Shi
Author Affiliations +
Proceedings Volume 12330, International Conference on Cyber Security, Artificial Intelligence, and Digital Economy (CSAIDE 2022); 1233007 (2022) https://doi.org/10.1117/12.2646316
Event: International Conference on Cyber Security, Artificial Intelligence, and Digital Economy (CSAIDE 2022), 2022, Huzhou, China
Abstract
With the development of speech synthesis technology, the simulation of specific individual’s speech has gradually matured, synthetic speech is easily and perceptually recognized as real speech, which may occur frequently in illegal activities. To identify crimes, forensic technology is widely used such as comparing the formants, pitches, and rhythm. The present study aims to investigate whether the method of comparison of formants can recognize the differences between the perceptually similar synthetic speech (hereinafter “personal anchor” speech) and real speech. To this end, two young males and two young females from different dialect regions were recruited to read the same text. Their voices were recorded and used to generate four “personal anchors” by the software of sound spectrum and statistics analysis. The method of comparison of various parameters of the formant, including numerical statistical, stability analysis, and transitional segments feature were applied to analyze the differences between the real speech and the corresponding “personal anchors”. It was found that the numerical or stability analysis of formants was not sufficient to fully determine whether the speech was synthesized, while comparing the transitional segments of some specific syllables could efficiently detect the synthetic speech from the real speech.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Qimeng Lu, Puyang Geng, Hong Guo, Xiaohong Chen, and Shaopei Shi "Forensic synthetic speech inspection technique based on formant comparison", Proc. SPIE 12330, International Conference on Cyber Security, Artificial Intelligence, and Digital Economy (CSAIDE 2022), 1233007 (23 August 2022); https://doi.org/10.1117/12.2646316
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Forensic science

Inspection

Algorithm development

Statistical analysis

Neural networks

Software development

Speaker recognition

Back to Top