Prediction of Parkinson's disease using feature selection and ensemble learning techniques

Sharan T. D., Sujata Joshi

Abstract


Parkinson's disease (PD) is a progressive neurodegenerative disorder that significantly impacts quality of life and healthcare systems. Early detection is crucial for timely interventions that can mitigate disease progression and improve patient outcomes. This study leverages advanced machine learning (ML) techniques to detect PD using speech features as non-invasive biomarkers. A dataset containing 754 features derived from sustained vowel phonations of 252 individuals (188 PD patients, 64 healthy controls) was analyzed. The dataset, originally collected by Istanbul University and publicly hosted via the UCI ML repository, was accessed through Kaggle for preprocessing and analysis. To identify the most predictive features, we employed recursive feature elimination (RFE), random forest importance, lasso regression, and the boruta algorithm—ensuring robust feature selection while reducing dimensionality. The XGBoost model, optimised using synthetic minority oversampling technique (SMOTE) for class balancing, achieved an accuracy of 96.69%, a recall of 96%, and an F1-score of 98%. Model robustness was validated through 5-fold cross-validation, yielding an average accuracy of 89.54%. These findings establish a scalable, costeffective, and non-invasive framework for early PD detection, demonstrating the potential of speech analysis and ML in neurodegenerative disease management.

Keywords


Feature selection; Parkinson’s disease; SMOTE; Speech biomakers; XGBoost

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v39.i3.pp1736-1744

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES).

shopify stats IJEECS visitor statistics