Big data clustering based on spark chaotic improved particle swarm optimization

Saida Ishak Boushaki; Brahim Hadj Mahammed; Omar Bendjeghaba; Messaoud Mosbah

doi:10.11591/ijeecs.v34.i1.pp419-429

Big data clustering based on spark chaotic improved particle swarm optimization

Saida Ishak Boushaki, Brahim Hadj Mahammed, Omar Bendjeghaba, Messaoud Mosbah

Abstract

In recent years, the surge in continuously accelerating data generation has given rise to the prominence of big data technology. The MapReduce architecture, situated at the core of this technology, provides a robust parallel environment. Spark, a leading framework in the big data landscape, extends the capabilities of the traditional MapReduce model. Coping with big data, especially in the realm of clustering, requires more efficient techniques. Meta-heuristic-based clustering, known for offering global solutions within reasonable time frames, emerges as a promising approach. This paper introduces a parallel-distributed clustering algorithm for big data within the Spark Framework, named Spark, chaotic improved PSO (S-CIPSO). Centered on particle swarm optimization (PSO), the proposed algorithm is enhanced with a chaotic map and an efficient procedure. Test results, conducted on both real and artificial datasets, establish the superior performance and quality of clustering results achieved by the proposed approach. Additionally, the scalability and robustness of S-CIPSO are validated, demonstrating its effectiveness in handling large-scale datasets.

Keywords

Big data clustering; Chaotic map; MapReduce; Particle swarm optimization; Spark

Full Text:

PDF

DOI: http://doi.org/10.11591/ijeecs.v34.i1.pp419-429

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES).

IJEECS visitor statistics

Username
Password
Remember me