25 August 2022 Wavelet descriptor network for arbitrary-shaped text detection
Zixu Zhang, Minglei Tong
Author Affiliations +
Abstract

Arbitrary-shaped scene text detection is still a challenging task in computer vision, due to its high versatility and complexity. Most methods directly describe text instances by rectangles, quadrangles, or curves, probably leading to insufficient representations. We introduce wavelet descriptor (WD) for arbitrary-shaped text representation that decomposes text contours to wavelet coefficients and extracts feature sequences by discrete wavelet transform (DWT). Based on WD, we propose WD network, which predicts wavelet coefficients then reconstructs text instances accurately and smoothly via inverse DWT and nonmaximum suppression. We further study the restoration capability of various wavelet functions for text contour, including sym5, bior3.1, bior4.4, coif3, db5, dmey, and rbio2.8. To evaluate the proposed method, we conduct experiments on available public datasets. The F-measure of our method achieves 85.8% on challenging curved dataset CTW1500 and higher than 85% on Total-Text and ICDAR2015, demonstrating the excellent performance of our method.

© 2022 SPIE and IS&T
Zixu Zhang and Minglei Tong "Wavelet descriptor network for arbitrary-shaped text detection," Journal of Electronic Imaging 31(4), 043051 (25 August 2022). https://doi.org/10.1117/1.JEI.31.4.043051
Received: 22 March 2022; Accepted: 4 August 2022; Published: 25 August 2022
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Wavelets

Discrete wavelet transforms

Image segmentation

Signal processing

Network architectures

Data modeling

Sensors

Back to Top