An intelligent use of stemmer and morphology analysis for Arabic information retrieval

Ali Alnaied*, Mosa Elbendak, Abdullah Bulbul

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

19 Citations (Scopus)
23 Downloads (Pure)

Abstract

Arabic Information Retrieval has gained significant attention due to an increasing usage of Arabic text on the web and social media networks. This paper discusses a new approach for Arabic stem, called Arabic Morphology Information Retrieval (AMIR), to generate/extract stems by applying a set of rules regarding the relationship among Arabic letters to find the root/stem of the respective words used as indexing terms for the text search in Arabic retrieval systems. To demonstrate the usefulness of the proposed algorithm, we highlight the benefits of the proposed rules for different Arabic information retrieval systems. Finally, we have evaluated AMIR system by comparing its performance with LUCENE, FARASA, and no-stemmer counterpart system in terms of mean average precisions. The results obtained demonstrate that AMIR has achieved a mean average precision of 0.34% while LUCENE, FARASA and no stemmer giving 0.27%, 0.28% and 0.21, respectively. This demonstrates that AMIR is able to improve Arabic stemmer and increases retrieval as well as being strong against any type of stem.

Original languageEnglish
Pages (from-to)209-217
Number of pages9
JournalEgyptian Informatics Journal
Volume21
Issue number4
Early online date30 Dec 2020
DOIs
Publication statusPublished - Dec 2020
Externally publishedYes

Keywords

  • Arabic morphological analysis
  • Arabic stemmer
  • Information retrieval systems
  • Natural language processing

Fingerprint

Dive into the research topics of 'An intelligent use of stemmer and morphology analysis for Arabic information retrieval'. Together they form a unique fingerprint.

Cite this