Paper
6 May 2022 A new type of Chinese speech synthesis technology and system research
Xinguang Li, Chuhua Liang, Shanxian Ma, Congcong Liu, Shuai Chen, Ruisi Li, Haoxin He
Author Affiliations +
Proceedings Volume 12256, International Conference on Electronic Information Engineering, Big Data, and Computer Technology (EIBDCT 2022); 122562Q (2022) https://doi.org/10.1117/12.2635374
Event: 2022 International Conference on Electronic Information Engineering, Big Data and Computer Technology, 2022, Sanya, China
Abstract
In this paper, we propose a method of Chinese speech synthesis. In order to achieve the purpose of synthesizing fluent and natural speech, two processing methods, rule-based and statistics-based, are used in the developed system. Firstly, a module design method with the function of text language recognition is introduced in this paper. This module can classify and recognize the text of Chinese, English and special symbols, and deal with the recognition problem that the input text contains Chinese, English and special symbols. Secondly, the Chinese speech synthesis methods used in the system are explained. In the prosody control module, we use a prosody structure prediction model combining neural network and decision tree; when concatenating speech, we propose two smoothing method. One is smoothing algorithm based on spectrum analysis and the other is smoothing algorithm based on dual-threshold. Experiments prove that the synthesis effect of our system is great, and the synthesized speech is clear and natural. Due to the moderate amount of system data and high code efficiency, it is suitable for application systems in mobile Android.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Xinguang Li, Chuhua Liang, Shanxian Ma, Congcong Liu, Shuai Chen, Ruisi Li, and Haoxin He "A new type of Chinese speech synthesis technology and system research", Proc. SPIE 12256, International Conference on Electronic Information Engineering, Big Data, and Computer Technology (EIBDCT 2022), 122562Q (6 May 2022); https://doi.org/10.1117/12.2635374
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Neural networks

Control systems

Molybdenum

Smoothing

Associative arrays

Data processing

Evolutionary algorithms

Back to Top