Video spam comment features selection using machine learning techniques

Nabilah Alias; Cik Feresa Mohd Foozy; Sofia Najwa Ramli; Naqliyah Zainuddin

doi:10.11591/ijeecs.v15.i2.pp1046-1053

Video spam comment features selection using machine learning techniques

Nabilah Alias, Cik Feresa Mohd Foozy, Sofia Najwa Ramli, Naqliyah Zainuddin

Abstract

Nowadays, social media (e.g., YouTube and Facebook) provides connection and interaction between people by posting comments or videos. In fact, comments are a part of contents in a website that can attract spammer to spreading phishing, malware or advertising. Due to existing malicious users that can spread malware or phishing in the comments, this work proposes a technique used for video sharing spam comments feature detection. The first phase of the methodology used in this work is dataset collection. For this experiment, a dataset from UCI Machine Learning repository is used. In the next phase, the development of framework and experimentation. The dataset will be pre-processed using tokenization and lemmatization process. After that, the features to detect spam is selected and the experiments for classification were performed by using six classifiers which are Random Tree, Random Forest, Naïve Bayes, KStar, Decision Table, and Decision Stump. The result shows the highest accuracy is 90.57% and the lowest was 58.86%.

Keywords

Machine learning, Naïve bayes, KStar, Random forest

Full Text:

PDF

DOI: http://doi.org/10.11591/ijeecs.v15.i2.pp1046-1053

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES).

IJEECS visitor statistics

Username
Password
Remember me