Paper
1 March 2023 An efficient and effective text spotter for characters in natural scene images based on an improved YOLOv5 model
Quanxing Xu, Guanyi Zheng, Wanglong Ren, Xin Li, Zhuo Yang, Zhicheng Huang
Author Affiliations +
Proceedings Volume 12588, International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2022); 1258809 (2023) https://doi.org/10.1117/12.2667388
Event: International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2022), 2022, Chongqing, China
Abstract
Traditional scene text spotters aim to detect and recognize entire words or sentences in natural scene images, however, the detection and recognition of every single character is also as important as the spotting of unifying words or sentences in one image. There are few specialized methods to spot single character in scene text spotting, and some word-based methods can not recognize a series of characters in images if they can not be spelled as a correct word. In addition, some early models can only detect or recognize texts which are horizontal and distinctive. We realize that it is necessary to improve some existing models for achieving the goal of spotting characters, therefore, we propose a novel method based on an improved YOLOv5 model to accomplish the character-level spotting. It’s worth noting that this method can spots characters not only in regular texts but also in irregular texts (curved texts and oriented texts).
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Quanxing Xu, Guanyi Zheng, Wanglong Ren, Xin Li, Zhuo Yang, and Zhicheng Huang "An efficient and effective text spotter for characters in natural scene images based on an improved YOLOv5 model", Proc. SPIE 12588, International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2022), 1258809 (1 March 2023); https://doi.org/10.1117/12.2667388
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image processing

Visualization

Optics

Physical sciences

Sensors

Technology

Computer vision technology

RELATED CONTENT

Scene categorization based on heterogeneous features
Proceedings of SPIE (August 04 2010)
Performance of visual tasks from contour information
Proceedings of SPIE (September 07 2010)
Low-Level Representations for Robot Vision
Proceedings of SPIE (February 01 1990)
Neural Controller For Adaptive Sensory-Motor Coordination
Proceedings of SPIE (March 27 1989)

Back to Top