1 July 1996 Page segmentation for document image analysis using a neural network
Devesh Patel
Author Affiliations +
In this paper we present a method for segmenting document page images into text and nontext regions. The underlying assumption made by this approach is that the two regions can be viewed as different textures. We do not use any a priori knowledge of the document format. A convolution-based method is used to generate the texture feature images. The coefficients of the convolution masks are obtained using a single-layer artificial neural network that generates eigenvectors of the correlation matrix of the input data. The coefficients of these masks have been ‘‘learned’’ from examples of the document images and have a potential of being considerably more powerful than masks with preset coefficients. A thresholding scheme based on a measure of entropy is used to segment the feature images into the homogeneous regions.
Devesh Patel "Page segmentation for document image analysis using a neural network," Optical Engineering 35(7), (1 July 1996). https://doi.org/10.1117/1.600618
Published: 1 July 1996
Lens.org Logo
CITATIONS
Cited by 7 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Image processing

Convolution

Document image analysis

Visualization

Image filtering

Binary data

RELATED CONTENT

High-speed extraction of line segment features
Proceedings of SPIE (August 06 1993)

Back to Top