Performance analysis of frequent pattern mining algorithm on different real-life dataset
Rakshit khajuria, Anuj Sharma, Sunny Sharma, Ashok Sharma, Jyoti Narayan Baliya, Parveen Singh
Abstract
The efficient finding of common patterns: a group of items that appear frequently in a dataset is a critical task in data mining, especially in transaction datasets. The goal of this paper is to look into the efficiency of various algorithms for frequent pattern mining in terms of computing time and memory consumption, as well as the problem of how to apply the algorithms to different datasets. In this paper, the algorithms investigated for mining the frequent patterns are; Pre-post, Pre-post+, FIN, H-mine, R-Elim, and estDec+ algorithms. These algorithms have been implemented and tested on four real-life datasets that are: The retail dataset, the Accidents dataset, the Chess dataset, and the Mushrooms dataset. From the results, it has been observed that, for the Retail dataset, estDec+ algorithm is the fastest among all algorithms in terms of run time as well as consumes less memory for its execution. Pre-post+ algorithm performs better than all other algorithms in terms of run time and maximum memory for the Mushrooms dataset. Pre-Post outperforms other algorithms in terms of performance. And for Accident datasets, in terms of execution time and memory consumption, the FIN method outperforms other algorithms.
Keywords
Data mining; estDec+ algorithm; Frequent pattern mining; Pre-post+ algorithm
DOI:
http://doi.org/10.11591/ijeecs.v29.i3.pp1355-1363
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).
IJEECS visitor statistics