Optimization of the algorithms use ensemble and synthetic minority oversampling technique for air quality classification

Aziz Jihadian Barid, Hadiyanto Hadiyanto, Adi Wibowo

Abstract


Rapid economic development, industrialization, and urbanization in Indonesia have caused a large increase in air pollution with negative impacts on the environment and public health. The aim of this research is to use machine learning techniques to categorize air quality and generate an air quality index (AQI) using a dataset that includes six prevalent air pollutants. Next steps are preprocessing and data extraction, K-nearest neighbors (KNN) classification, support vector machine (SVM), and random forest (RF) models are implemented. Furthermore, synthetic minority oversampling technique (SMOTE) is incorporated into the ensemble learning process to improve the results. This research uses K-fold cross validation for improve classification accuracy and reduce overfitting. Research findings show that the application of SMOTE brings a significant increase in model accuracy, effectively solving the problem of imbalanced data sets. These insights provide direction for effective air quality monitoring systems and informed decision making in air pollution management.


Keywords


K-fold cross validation; K-nearest neighbors; Random forest; SMOTE; Support vector machine

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v33.i3.pp1632-1640

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics