Extracting lines of text from a manuscript is an important preprocessing step in many digital paleography applications. These extracted lines play a fundamental part in the identification of the author and/or age of the manuscript. In this paper we present an unsupervised approach to text line extraction in historical manuscripts that can be applied directly to a color manuscript image. Each of the red, green and blue channels are processed separately by applying DCT on them individually. One of the key advantages of this approach is that it can be applied directly to the manuscript image without any preprocessing, training or tuning steps. Extensive testing on complex Arabic handwritten manuscripts shows the effectiveness of the proposed approach.
|Title of host publication
|Image Analysis and Recognition: Proceedings of the 13th International Conference, ICIAR 2016, in Memory of Mohamed Kamel
|Aurélio Campilho, Fakhri Karray
|Place of Publication
|Published - 2016
|Lecture notes in computer science