Paper
10 April 2018 Text extraction from images in the wild using the Viola-Jones algorithm
Raid M. Saabna, Eran Zingboim
Author Affiliations +
Proceedings Volume 10615, Ninth International Conference on Graphic and Image Processing (ICGIP 2017); 106151H (2018) https://doi.org/10.1117/12.2303559
Event: Ninth International Conference on Graphic and Image Processing, 2017, Qingdao, China
Abstract
Text Localization and extraction is an important issue in modern applications of computer vision. Applications such as reading and translating texts in the wild or from videos are among the many applications that can benefit results of this field. In this work, we adopt the well-known Viola-Jones algorithm to enable text extraction and localization from images in the wild. The Viola-Jones is an efficient, and a fast image-processing algorithm originally used for face detection. Based on some resemblance between text and face detection tasks in the wild, we have modified the viola-jones to detect regions of interest where text may be localized. In the proposed approach, some modification to the HAAR like features and a semi-automatic process of data set generating and manipulation were presented to train the algorithm. A process of sliding windows with different sizes have been used to scan the image for individual letters and letter clusters existence. A post processing step is used in order to combine the detected letters into words and to remove false positives. The novelty of the presented approach is using the strengths of a modified Viola-Jones algorithm to identify many different objects representing different letters and clusters of similar letters and later combine them into words of varying lengths. Impressive results were obtained on the ICDAR contest data sets.
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Raid M. Saabna and Eran Zingboim "Text extraction from images in the wild using the Viola-Jones algorithm", Proc. SPIE 10615, Ninth International Conference on Graphic and Image Processing (ICGIP 2017), 106151H (10 April 2018); https://doi.org/10.1117/12.2303559
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image processing

Computer vision technology

Machine vision

Algorithm development

RELATED CONTENT

Using constraints to incorporate domain knowledge
Proceedings of SPIE (February 01 1992)
Robust regularized image restoration
Proceedings of SPIE (October 01 1991)
Level-set approach for stereo
Proceedings of SPIE (February 19 1997)
Software environment for parallel computer vision
Proceedings of SPIE (April 30 1992)

Back to Top