Multi-label classification of Arabic text using deep learning models

Khouloud, Arioua; Supervisor: Rahima, Bentercia

Multi-label classification of Arabic text using deep learning models

Files

Khouloud Arioua.pdf (1.32 MB)

Date

2025-06-15

Authors

Khouloud, Arioua

Supervisor: Rahima, Bentercia

Publisher

Mohamed Boudiaf University of M'sila

Abstract

In multi-label text classification, multiple related labels are assigned to relevant documents for more refined categorization. The research attempts to build a system that can effective ly categorize Arabic texts into multiple themes using the multi-label dataset NADiA1, which contains 35,404 files across 24 categories. Deep learning approaches, encompassing QARiB, MARBERT, and AraBERT, were used, with data preprocessing conducted using the pyAra bic package to maintain text quality. The models were assessed by accuracy, precision, recall, and Hamming loss. Among these, transformer-based AraBERT outclassed its peers by giving 95.76% accuracy and a micro F1-score of 0.81, followed by QARiB (95.48% accuracy and 0.80 micro F1-score) and MARBERT (94.99% accuracy and 0.77 micro F1-score). This study lays emphasis that deep transformer-based learning techniques are highly effective in multi-label Arabic text classification, with AraBERT showing the ability to better handle linguistic com plexities

Keywords

Multi-label Text Classification, Deep Learning, QARiB, MARBERT, AraBERT, Arabic NLP, Arabic text, MARBERT, transformers

URI

https://repository.univ-msila.dz/handle/123456789/46768

Collections

Master Thesis

Full item page

Multi-label classification of Arabic text using deep learning models

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections