Classifying toxicity in the Arabic Moroccan dialect on Instagram: a machine and deep learning approach
Rabia Rachidi, Mohamed Amine Ouassil, Mouaad Errami, Bouchaib Cherradi, Soufiane Hamida, Hassan Silkan
Abstract
People crave interaction and connection with other people. Therefore, social media became the center of society’s life. Among the brightest social media platforms nowadays with a massive number of daily users there is Instagram, which is due to its distinctive features. The excessive revealing of personal life has put users in the spots of getting bullied and harassed and getting toxic revues from other users. Numerous studies have targeted social media to fight its harmful side effects. Nevertheless, most of the datasets that were already available were in English, the Arabic Moroccan dialect ones were not. In this work, the Arabic Moroccan dialect dataset has been extracted from the Instagram platform. Furthermore, feature extraction techniques have been applied to the collected dataset to increase classification accuracy. Afterward, we developed models using machine learning and deep learning algorithms to detect and classify toxicity. For the models’ evaluation, we have used the most used metrics: accuracy, precision, F1-score, and recall. The experimental results gave modest scores of around 70% to 83%. These results imply that the models need improvement due to the lack of available datasets and the preprocessing libraries to handle the Moroccan dialect of Arabic.
Keywords
Cyberbullying; Deep learning; Machine learning; Moroccan dialect; Social media; Toxicity
DOI:
http://doi.org/10.11591/ijeecs.v31.i1.pp588-598
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).
IJEECS visitor statistics