Automatic segmentation for Arabic characters in handwriting documents

Ahmed Lawgali, Ahmed Bouridane, Maia Angelova, Zabih Ghassemlooy

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

8 Citations (Scopus)

Abstract

The cursive and ligature nature of the Arabic script make the segmentation of words into individual characters a difficult task. Despite attempts to apply methods for cursive Latin and other scripts to Arabic script, it is generally insufficient to segment the Arabic text. This paper proposes a new segmentation algorithm for the handwritten Arabic text and the main idea consists of segmenting the word into sub-words and then computing the baseline of each sub-word. Using the descenders of sub-words and the baseline, candidate points are then calculated using a vertical projection. The algorithm has been tested using 800 handwritten Arabic words taken from the IFN/ENIT database and a comparison made against some existing methods and promising results have been obtained.
Original languageEnglish
Title of host publicationProceedings of the 18th IEEE International Conference on Image Processing
Place of PublicationPiscataway, NJ
PublisherIEEE
Pages3529-3532
ISBN (Print)978-1457713040
DOIs
Publication statusPublished - 2011

Cite this