Software for automatic analysis of image and sound data simultaneously acquired from high-speed videoendocopy

Tao Jiang; Shouhua Luo; Yuling Yan

doi:10.1117/12.2014251

8 March 2013 Software for automatic analysis of image and sound data simultaneously acquired from high-speed videoendocopy

Tao Jiang, Shouhua Luo, Yuling Yan

Author Affiliations +

Proceedings Volume 8565, Photonic Therapeutics and Diagnostics IX; 856520 (2013) https://doi.org/10.1117/12.2014251
Event: SPIE BiOS, 2013, San Francisco, California, United States

Abstract

High-speed digital videoendoscopy system is emerging as a new clinical tool for voice assessment. The system can acquire images of the vibrating vocal folds with simultaneous recording of voice data from the patient. The laryngeal image-based analysis has been proven valuable for objective and quantitative assessment of voice kinematics in health and disease, and meanwhile, acoustic analysis of voice data could assist in the study of phonatory characteristics and reveal useful information related to laryngeal pathophysiology. Contrast to the hardware acquisition systems, the development of effective software for handling such massive visual/sound data has lagged behind. In this paper, a software system is designed to process the laryngeal image sequences and perform image-based analyses as well as acoustic analyses. Our software contains following modules: (1) Import and view Module - to read AVI video data and sound data (wave file), edit/compile and save selected data, make image montages using DirectShow technology and display the acoustic waveform using DirectSound technology; (2) Image Process Module – to perform frame-by-frame image segmentation to delineate the glottis, to extract the GAW and bilateral vocal fold displacements; (3) Image Analysis Module – to adopt Nyquist plot displays that involves the Hilbert transform based analysis of GAW, and to provide instantaneous frequency and amplitude distributions; (4) Acoustic Analysis Module – to perform Fast Fourier Transform (FFT) and Spectrogram analyses of the imported sound data, to display the plot of the sound data and provide instantaneous frequency and amplitude distributions and Nyqiust plot and (5) Dual GAW and sound wave display module. Upon rigorous testing of this software using clinical data samples we demonstrate the applications of the software to the study of dynamic characteristics of the glottis, which may correlate with voice quality and health condition.

Citation Download Citation

Tao Jiang, Shouhua Luo, and Yuling Yan "Software for automatic analysis of image and sound data simultaneously acquired from high-speed videoendocopy", Proc. SPIE 8565, Photonic Therapeutics and Diagnostics IX, 856520 (8 March 2013); https://doi.org/10.1117/12.2014251

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
8 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Image segmentation

Image analysis

Acoustics

Image processing

Data acquisition

Image acquisition

Software development

Show All Keywords

Keywords/Phrases

Search In:

Publication Years