Hybrid dynamic chunk ensemble model for multi-class data streams

Varsha Sachin Khandekar, Pravin Shrinath

Abstract


In the analysis more specifically in the classification of continuous data stream using machine learning algorithms joint occurrence of concept drift and imbalanced issue becomes more provocative. Also, imbalance issue is again more challenging when the data stream is multi-class with minority class and that is too with data-difficulty factors. Incremental learning with ensemble models found more promising in handling theses issues. But most of the approaches are for two-class data streams which can’t be utilized for multiclass data streams. In this paper we have designed hybrid dynamic chunk ensemble model (HDCEM) for the classification of multi-class insect-data stream for handling imbalance and concept drift issue. To deal with imbalance issue we have proposed effective split bagging algorithm which has achieved better performance on minority class recall and F-measure on arriving dynamic chunks of data from multi-class data stream. HDCEM model can adapt to abrupt and gradual drift because it has combined features of both online and chunk-based learning together. It has achieved average 78% minority class recall in abrupt insect data stream and 71% in gradual drift insect stream.

Keywords


Concept drift; Data stream; Dynamic chunk; Ensemble learning; Imbalance data;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v25.i2.pp1115-1122

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics