Separating content from noise in historical manuscripts is a fundamental task in digital palaeography. This study presents a fully automated segmentation approach based on the response of Harris corner detectors. The strength and clustering efficiency of the detected corners in the manuscripts are evaluated and used to segment the content from the background and noise. In addition, a manuscript reconstruction technique is proposed from the gradient field using the Poisson method to guide the interpolation. This reconstruction is able to remove noise significantly and hence enhances the contrast of the content thus making it easier for users to read and process these documents. The proposed approaches are evaluated using various standard databases to highlight their effectiveness and robustness to a multitude of noise and writing styles. Subjective and objective evaluations of the experimental results show that these techniques are able to successfully segment and reconstruct a very diverse set of scanned documents. An analysis of the results has also shown that the proposed technique compares favourably against similar counterparts.
|Number of pages||11|
|Journal||IET Image Processing|
|Early online date||10 Jan 2018|
|Publication status||Published - 1 Apr 2018|