Classification of Quranic topics based on imbalanced classification

Bassam Sulaiman Arkok, Akram Mohammed Zeki

Abstract


Imbalanced classification techniques have been applied widely in the field of data mining. It is used to classify the imbalanced classes that are not equal in the number of samples. The problem of imbalanced classes is that the classification performance tends to the class with more samples while the class with few samples will obtain poor performance. This problem can be occurred in the Qur’anic classification due to the different number of verses. Many studies classified Qur’anic verses, which depended on the traditional classification. However, no study classified Qur’anic topics based on the techniques of imbalanced classification. Therefore, this paper aims to apply the methods of imbalanced classification as synthetic minority over-sampling technique (SMOTE), random over sample (ROS), and random under sample (RUS) methods to classify the Qur’anic topics that are imbalanced. Many metrics were used in this research to evaluate the experimental results. These metrics are sensitivity/recall, specificity, overall accuracy, F-Measure, G-mean, and matthews correlation coefficient (MCC). The results showed that the Quranic classification performance improved when imbalanced classification techniques were applied

Keywords


Imbalanced Classification; Text Classification; Quranic Topics; Resampling methods.

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v22.i2.pp678-687

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

The Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics