Abstract
Layout analysis, which aims to detect and categorize areas of interest on document images, is an increasingly important part in document image processing. Existing researches have conducted layout analysis on various documents, but none has been proposed for documents yielded from teaching, i.e. exam papers and workbooks, which are worth studying. In this paper, we propose a novel layout analysis system to achieve two tasks for workbook pages and exam papers respectively. On one hand, we segment text and non-text areas of workbook pages. On the other hand, we extract regions of interest on exam papers. Our system is based on connected component (CC) analysis, specifically, it extracts geometric features and spatial information of CCs to recognize page elements. We carried out experiments on images collected from real-world scenarios, and promising results confirmed the applicability and effectiveness of our system.
Original language | English |
---|---|
Title of host publication | ICCSE 2021 |
Subtitle of host publication | IEEE: The 16th International Conference on Computer Science and Education |
Place of Publication | Piscataway |
Publisher | IEEE |
Pages | 875-880 |
Number of pages | 6 |
ISBN (Electronic) | 9781665414685, 9781665414678 |
ISBN (Print) | 9781665447546 |
DOIs | |
Publication status | Published - 17 Aug 2021 |
Event | ICCSE 2021: The 16th International Conference on Computer Science and Education - Lancaster University, Lancaster, United Kingdom Duration: 17 Aug 2021 → 21 Aug 2021 http://www.ieee-iccse.org/?_v=1625228873586 |
Publication series
Name | Proceedings of the International Conference on Computer Science & Education (ICCSE) |
---|---|
Publisher | IEEE |
ISSN (Print) | 2471-6146 |
ISSN (Electronic) | 2473-9464 |
Conference
Conference | ICCSE 2021 |
---|---|
Country/Territory | United Kingdom |
City | Lancaster |
Period | 17/08/21 → 21/08/21 |
Internet address |
Keywords
- Layout Analysis
- Connected Component Analysis
- Digital Image Processing