Form item extraction based on line searching - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 1995

Form item extraction based on line searching

Eric Turolla
  • Fonction : Auteur
  • PersonId : 881798
Yolande Belaïd
Abdel Belaïd
  • Fonction : Auteur
  • PersonId : 830137

Résumé

This paper presents an item searching method which has been applied to various kinds of forms. This approach is based on line detection through the Hough transform. After obtaining the straight lines, Hough directions are used to detect the real segments in the image. Segments can correspond either to continuous line, or to black parts of dashed or dotted lines. So, the segments are grouped together and classified between both adjacent line crossing points. Items are located by searching the minimum cycles of the graph constructed from the line intersection points. The last step consists of verifying the line classes based on the homogeneity hypothesis of item sides. This method was applied to French Tax forms and tables coming from scientific publications. The experimental results have demonstrated the robustness and the reliability of such an approach to various forms with different types of item delimiters.

Dates et versions

inria-00537324 , version 1 (18-11-2010)

Identifiants

Citer

Eric Turolla, Yolande Belaïd, Abdel Belaïd. Form item extraction based on line searching. International Workshop on Graphics Recognition - GRCE, Aug 1995, University Park, PA, United States. pp.69-79, ⟨10.1007/3-540-61226-2_7⟩. ⟨inria-00537324⟩
105 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More