Paper
19 January 2009 Text-image alignment for historical handwritten documents
Author Affiliations +
Proceedings Volume 7247, Document Recognition and Retrieval XVI; 724703 (2009) https://doi.org/10.1117/12.805511
Event: IS&T/SPIE Electronic Imaging, 2009, San Jose, California, United States
Abstract
We describe our work on text-image alignment in context of building a historical document retrieval system. We aim at aligning images of words in handwritten lines with their text transcriptions. The images of handwritten lines are automatically segmented from the scanned pages of historical documents and then manually transcribed. To train automatic routines to detect words in an image of handwritten text, we need a training set - images of words with their transcriptions. We present our results on aligning words from the images of handwritten lines and their corresponding text transcriptions. Alignment based on the longest spaces between portions of handwriting is a baseline. We then show that relative lengths, i.e. proportions of words in their lines, can be used to improve the alignment results considerably. To take into account the relative word length, we define the expressions for the cost function that has to be minimized for aligning text words with their images. We apply right to left alignment as well as alignment based on exhaustive search. The quality assessment of these alignments shows correct results for 69% of words from 100 lines, or 90% of partially correct and correct alignments combined.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
S. Zinger, J. Nerbonne, and L. Schomaker "Text-image alignment for historical handwritten documents", Proc. SPIE 7247, Document Recognition and Retrieval XVI, 724703 (19 January 2009); https://doi.org/10.1117/12.805511
Lens.org Logo
CITATIONS
Cited by 16 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Statistical analysis

Pattern recognition

Chemical elements

Video

Visualization

Cognition

RELATED CONTENT

Saliency identified by absence of background structure
Proceedings of SPIE (March 14 2013)
Image thresholding using standard deviation
Proceedings of SPIE (March 07 2014)
Document flow segmentation for business applications
Proceedings of SPIE (March 24 2014)
Media segmentation using self-similarity decomposition
Proceedings of SPIE (January 10 2003)
Learning to see, seeing to learn visual aspects of...
Proceedings of SPIE (June 17 2003)

Back to Top