Paper
21 February 2012 Document image orientation based on both text and image
Author Affiliations +
Proceedings Volume 8302, Imaging and Printing in a Web 2.0 World III; 83020U (2012) https://doi.org/10.1117/12.912984
Event: IS&T/SPIE Electronic Imaging, 2012, Burlingame, California, United States
Abstract
This paper investigated the problem of orientation detection for document images with Chinese characters. These images may be in four orientations: right side up, up-side down, 90° and 270° rotated counterclockwise. First, we presented the structure of text-recognition-based orientation detection algorithm. Text line verification and orientation judgment methods were mainly discussed, afterwards multiple experiments were carried. Distance-difference based text line verification and confidence based text line verification were proposed and compared with methods without text line verification. Then, a picture-based orientation detection framework was adopted for the situation where no text line was detected. This high-level classification problem was solved by relatively low-level vision features including Color Moments (CM) and Edge Direction Histogram (EDH), with distant-based classification scheme. Finally, confidencebased classifier combination strategy was employed in order to make full use of the complementarity between different features and classifiers. Experiments showed that both text line verification methods were able to improve the accuracy of orientation detection, and picture-based orientation detection had a good performance for no-text image set.
© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yuejia Sun, Changsong Liu, Xiaoqing Ding, Zhigang Fan, and Francis Tse "Document image orientation based on both text and image", Proc. SPIE 8302, Imaging and Printing in a Web 2.0 World III, 83020U (21 February 2012); https://doi.org/10.1117/12.912984
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Image segmentation

Detection and tracking algorithms

Image analysis

Curium

Feature extraction

Statistical analysis

RELATED CONTENT

Discrimination of handwritten from machine-printed text
Proceedings of SPIE (June 01 1994)
Text segmentation of machine-printed Gurmukhi script
Proceedings of SPIE (December 21 2000)
SemiBoost-based Arabic character recognition method
Proceedings of SPIE (January 24 2011)
Document image binarization based on texture analysis
Proceedings of SPIE (March 23 1994)
Selection of objectlike areas on images
Proceedings of SPIE (April 01 1998)

Back to Top