Comparison of machine learning algorithms with regression analysis to predict the COVID-19 outbreak in Thailand

Supanee Sengsri, Kheamparit Khunratchasana

Abstract


Coronavirus disease (COVID-19) is a public health problem in Thailand. Currently, there are more than 5 million infected people and the rate has been increasing at some point. It is therefore important to forecast the number of new cases over a short period of time to assist in strategic planning for the response to COVID-19. The purpose of this research paper was to compare the efficiency and prediction of the number of COVID-19 cases in Thailand using machine learning of 8 models using a regression analysis method. Using the 475-day dataset of COVID-19 cases in Thailand, the results showed that the predictive accuracy model (R2 score) from the testing dataset was the random forest (RF) model, which was 99.06%, followed by K-nearest neighbor (KNN), XGBoost. And the decision tree (DT) had the precision of 98.97, 98.67, and 98.64, respectively. And the results of the comparison of the number of infected people obtained from the prediction The models that predicted the number of real infections were the decision tree, random forest, and XGBoost, which were effective at predicting the number of infections correctly in the 2-4 day period.

Keywords


COVID-19; Decision tree; K-nearest neighbor; LASSO; Machine learning; Random forest; XGBoost

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v31.i1.pp299-304

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

The Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics