Two-level boosting classifiers ensemble based on feature selection for heart disease prediction

Kaushalya Dissanyake, Md Gapar Md Johar

Abstract


Heart disease is a prevalent global health concern, necessitating early detection to save lives. Machine learning has revolutionized medical research, prompting the investigation of boosting algorithms for heart disease prediction. This study employs three heart disease datasets from the University of California Irvine (UCI) repository: Cleveland, Statlog, and Long Beach, with 14 features each. Recursive feature elimination with a support vector machine (SVM) is utilized to identify significant features. Five boosting algorithms (gradient boosting algorithm (GB), adaptive boosting algorithms (AdaBoost), extreme gradient boosting algorithm (XGBoost), cat boost algorithm (CatBoost) and light gradient boosting algorithms (LightGBM)) are integrated into an ensemble model to achieve the best classification performance. The proposed model demonstrates superior accuracy, precision, recall, f-measure, and area under the curve (AUC) compared to individual boosting models, achieving 93.44%, 83.33%, and 79.75% accuracies for Cleveland, Statlog, and Long Beach datasets. This approach offers an accurate and efficient method for heart disease prediction, which is crucial for clinical decision-making and disease management.

Keywords


Boosting classifiers; Ensemble model; Feature selection; Heart disease; Machine learning

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v32.i1.pp381-391

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

The Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics