Abstract
Due to its importance in studying people’s thoughts on various Web 2.0 services, emotion classification is a critical undertaking. Most existing research is focused on the English language, with little work on low-resource languages. Though sentiment analysis, particularly emotion classification in English, has received increasing attention in recent years, little study has been done in the context of Bangla, one of the world’s most widely spoken languages. In this research, we propose a complete set of approaches for identifying and extracting emotions from Bangla texts. We provide a Bangla emotion classifier for six classes, i.e., anger, disgust, fear, joy, sadness, and surprise, from Bangla words using transformer-based models, which exhibit phenomenal results in recent days, especially for high-resource languages. The Unified Bangla Multi-class Emotion Corpus (UBMEC) is used to assess the performance of our models. UBMEC is created by combining two previously released manually labelled datasets of Bangla comments on six emotion classes with fresh manually labelled Bangla comments created by us. The corpus dataset and code we used in this work are publicly available.
Original language | English |
---|---|
Title of host publication | 2024 25th International Arab Conference on Information Technology (ACIT) |
Place of Publication | Piscataway, US |
Publisher | IEEE |
Pages | 1-7 |
Number of pages | 7 |
ISBN (Electronic) | 9798331540012 |
ISBN (Print) | 9798331540029 |
DOIs | |
Publication status | Published - 10 Dec 2024 |
Externally published | Yes |
Event | 2024 25th International Arab Conference on Information Technology (ACIT) - Zarqa University, Zarqa, Jordan Duration: 10 Dec 2024 → 12 Dec 2024 https://acit2k.org/ACIT/ |
Publication series
Name | International Arab Conference on Information Technology (ACIT) |
---|---|
Publisher | IEEE |
ISSN (Print) | 2831-493X |
ISSN (Electronic) | 2831-4948 |
Conference
Conference | 2024 25th International Arab Conference on Information Technology (ACIT) |
---|---|
Abbreviated title | ACIT'2024 |
Country/Territory | Jordan |
City | Zarqa |
Period | 10/12/24 → 12/12/24 |
Internet address |
Keywords
- Bangla corpus
- Bangla emotion analysis
- Text classification
- Multi-class emotion classification
- Natural language processing