Translator Disclaimer
Paper
31 January 2020 MIDV-2019: challenges of the modern mobile-based document OCR
Author Affiliations +
Proceedings Volume 11433, Twelfth International Conference on Machine Vision (ICMV 2019); 114332N (2020) https://doi.org/10.1117/12.2558438
Event: Twelfth International Conference on Machine Vision, 2019, Amsterdam, Netherlands
Abstract
Recognition of identity documents using mobile devices has become a topic of a wide range of computer vision research. The portfolio of methods and algorithms for solving such tasks as face detection, document detection and rectification, text field recognition, and other, is growing, and the scarcity of datasets has become an important issue. One of the openly accessible datasets for evaluating such methods is MIDV-500, containing video clips of 50 identity document types in various conditions. However, the variability of capturing conditions in MIDV-500 did not address some of the key issues, mainly significant projective distortions and different lighting conditions. In this paper we present a MIDV-2019 dataset, containing video clips shot with modern high-resolution mobile cameras, with strong projective distortions and with low lighting conditions. The description of the added data is presented, and experimental baselines for text field recognition in different conditions.
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Konstantin Bulatov, Daniil Matalov, and Vladimir V. Arlazarov "MIDV-2019: challenges of the modern mobile-based document OCR", Proc. SPIE 11433, Twelfth International Conference on Machine Vision (ICMV 2019), 114332N (31 January 2020); https://doi.org/10.1117/12.2558438
PROCEEDINGS
6 PAGES


SHARE
Advertisement
Advertisement
Back to Top