Journal of Pattern Recognition and Intelligent Systems 
Journal of Pattern Recognition and Intelligent Systems(PRIS)
ISSN:2309-0669(Print)       ISSN:2309-0650(Online)
Fingerprints for Imposed Layers in Document Images Based on Huffman Code and Logical Layout Analysis
Full Paper(PDF, 669KB)
A document is characterized by its layout and component structure. Document layout is due to the placement of the content components and document structure is due to the geometrical shape of the content components. Content components in a filled-in document image consist of general information foreground layer and vital information imposed layer. The foreground layer consists of printed text, logos, tables and lines that are identical for documents of the same class; the imposed layer of the document image consists of handwritten text, signatures and seals imposed on the document image that are unique to every document image. Processing filled-in document images for indexing, considering general information along with vital information is complex with the possibility of generating identical indexes due to large amount of general information suppressing fewer imposed layer vital information. In this paper, a novel technique was proposed to generate a unique code by formulating a logical layout of the imposed layer which was extracted from the filled-in document image using registration. The extracted imposed layer components were represented by centroids based on their spatial occupancy and the imposed layer was hierarchically decomposed into 16 equal quadrants. The Huffman tree generation algorithm was applied based on the number of centroids in a quadrant and with quadrant indices were assimilated to generate a unique code for the logical layout of the document image. In order to verify the applicability of this method, extensive experimentation were conducted on extracted imposed layers from application forms, student records, bank cheques and declaration forms.
Keywords:Imposed Layer; Centroids; Huffman Codes; Quad Decomposition; Logical Layout
Author: Surabhi Narayan1, Sahana D Gowda1
1.Department of Computer Science & Engineering, B.N.M Institute of Technology, Bengaluru, India
  1. Anoop M. Namboodiri and Anil K. Jain, “Document Structure and Layout Analysis,” Digital document Processing: Major Directions and Recent Advances, Springer-Verilag, Advances in Pattern Recognition, pp. 29-48, 2007.
  2. R. Cattoni and T. Coianiz, “Geometric Layout Analysis Techniques for Document Image Understanding: A Review,” Technical Report, IRST, pp. 1-68, Trento, Italy, 1998.
  3. Joost van Beusekom, Daniel Keysers, Faisal Shafait, and Thomas M Breuel, “Distance Measures for Layout-Based Document Image Retrieval,” DIAL, IEEE, vol. 30(11), pp. 232-242, 2006.
  4. S. Tsujimoto and H. Asada, “Understanding multi-articled documents,” in Proceedings of International Conference on Pattern Recognition, pp. 551-556 (Atlantic City, NJ), June 1990.
  5. J. L. Fisher, “Logical structure descriptions of segmented document images,” in Proceedings of International Conference on Document Analysis and Recognition, pp. 302-310 (Saint-Malo, France), September 1991.
  6. A. Conway, “Page grammars and page parsing: A syntatic approach to document layout recognition,” in Proceedings of International Conference on Document Analysis and Recognition, pp. 761-764 (Tsukuba Science City, Japan), October 1993.
  7. G. Pirlo, M. Chimienti, M. Dassisti, D. Impedovo, and A. Galiano, “A Layout-Analysis Based System for Document Image Retrieval,” Mondo Digitale, vol. 13(49), pp. 1-16, 2014.
  8. R. Safari, N. Narasimhamurthi, M. Shridhar, and M. Ahmadi, “Document Registration Using Projective Geometry,” IEEE Trans. on Image Processing, vol. 6(9), pp. 1337-1341, 1997.
  9. Pinar Duygulu and Volkan Atalay, “A Hierarchical Representation of Form Documents for Identification and Retrieval,” IJDAR, vol. 5, iss. 1, pp. 17-27, November 2002.
  10. Yoshitake Tsuji, Hiroyuki Kami, Masaaki Mizumo, Toshiyuki Tanaka, Haruhiko Tanaka, Masao Iwashita, and Tsutomu Temma, “Document Recognition System With Layout Structure Generator,” IAPR, pp. 479-482, 1990, Tokyo.
  11. Francesca Cesarini, Simone Marinai, and Giovanni Soda, “Retrieval by Layout Similarity of Documents Represented with MXY Trees,” LNCS, DAS, vol. 2423, pp. 353-364, 2002.
  12. Christian Shin and David Doermann, “Document Image Retrieval Based on Layout Structural Similarity,” DAS, vol. 2, pp. 606-612, 2006.
  13. Hongxing Gao, Mar_cal Rusinol, Dimosthenis Karatzas, and Jand osep Llados, “Fast Structural Matching for Document Image Retrieval through Spatial Databases,” ICPR, vol. 9021, pp. 939-943, 2013.
  14. Jianying Hu, Ramanujan Kashi, and Gordon Wilfong, “Document Classification using Layout Analysis,” Database and Expert System Applications, vol. 6, pp. 556-560, 1999.
  15. P. Punitha, Naveen, and D.S. Guru, “Indexing and Retrieval of Document Images by Spatial Reasoning,” ICDCIT, LNCS, vol. 4317, pp. 457-464, 2006.
  16. Chao Sun and Ronghai Cai, “Document Image Registration Using Geometric Invariance and Hausdorff Distance,” First International Workshop on Education Technology and Computer Science, vol. 2, pp. 725-728, 2009.
  17. Mariusz Jankowski, “Erosion, dilation and related operators,” International Mathematics Symposium, 8th International Mathematica Symposium, June 2006.
  18. Francisco P. M. Oliveira, Faculdade de Engenharia, and João Manuel R. S. Tavares, “Matching Contours in Images through the use of Curvature, Distance to Centroid and Global Optimization with Order-Preserving Constraint,” CMES, vol. 43(1), pp. 91-110, 2009.
  19. Akondi Vyas, M B Roopashree, and B Raghavendra Prasad, “Centroid Detection by Gaussian Pattern Matching in Adaptive Optics,” IJCA, vol. 1(26), pp. 30-35, 2009.
  20. Tassos Markas, “Quad Tree Structures for Image Compression Applications,” Information Processing & Management, vol. 28, no. 6, pp. 707-721, 1992.
  21. Algorithmic Graph theory David Joyner, Minh van Nguyen and Nathan Cohen 2011 ebook,