Detail View
BIGMiner: a fast and scalable distributed frequent pattern miner for big data
WEB OF SCIENCE
SCOPUS
- Title
- BIGMiner: a fast and scalable distributed frequent pattern miner for big data
- DGIST Authors
- Chon, Kang-Wook ; Kim, Min-Soo
- Issued Date
- 2018-09
- Citation
- Chon, Kang-Wook. (2018-09). BIGMiner: a fast and scalable distributed frequent pattern miner for big data. doi: 10.1007/s10586-018-1812-0
- Type
- Article
- Article Type
- Article
- Author Keywords
- Big data ; Distributed algorithm ; Frequent pattern mining ; MapReduce ; Scalable algorithm
- ISSN
- 1386-7857
- Abstract
-
Frequent itemset mining is widely used as a fundamental data mining technique. Recently, there have been proposed a number of MapReduce-based frequent itemset mining methods in order to overcome the limits on data size and speed of mining that sequential mining methods have. However, the existing MapReduce-based methods still do not have a good scalability due to high workload skewness, large intermediate data, and large network communication overhead. In this paper, we propose BIGMiner, a fast and scalable MapReduce-based frequent itemset mining method. BIGMiner generates equal-sized sub-databases called transaction chunks and performs support counting only based on transaction chunks and bitwise operations without generating and shuffling intermediate data. As a result, BIGMiner achieves very high scalability due to no workload skewness, no intermediate data, and small network communication overhead. Through extensive experiments using large-scale datasets of up to 6.5 billion transactions, we have shown that BIGMiner consistently and significantly outperforms the state-of-the-art methods without any memory problems. © 2018 Springer Science+Business Media, LLC, part of Springer Nature
더보기
- Publisher
- Springer New York LLC
File Downloads
- There are no files associated with this item.
공유
Total Views & Downloads
???jsp.display-item.statistics.view???: , ???jsp.display-item.statistics.download???:
