Audio style transfer in non-native speech recognition

Kacper Radzikowski

doi:10.1117/12.2501495

1 October 2018 Audio style transfer in non-native speech recognition

Kacper Radzikowski

Proceedings Volume 10808, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2018; 1080839 (2018) https://doi.org/10.1117/12.2501495
Event: Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2018, 2018, Wilga, Poland

Abstract

Current automatic speech recognition (ASR) systems achieve the over 90-95% accuracy, depending on methodology applied and datasets. However, the accuracy drops significantly, while the ASR system is being used with a non-native speaker of the language to be recognized, mainly because of specific pronunciation features. At the same time, the volume of labeled datasets of non-native speech samples is extremely limited both in size as well as in the number of existing languages, which makes it difficult to train sufficiently accurate ASR systems targeted for non-native speakers. Therefore applying a different method is necessary. In this paper, we suggest an idea for an alternative approach to the problem, by employing so-called style transfer methodology. Style transfer, used mainly in graphical domain until now, could help solve the problem of non-native speech. Another advantage is that the style transferring algorithm could be compatible with already existing ASR systems, which means it would not be necessary to train new systems which can be difficult and time consuming.

Citation Download Citation

Kacper Radzikowski "Audio style transfer in non-native speech recognition", Proc. SPIE 10808, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2018, 1080839 (1 October 2018); https://doi.org/10.1117/12.2501495

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available