Search Advanced Search
Total result 1
    Select All
    Display Type:|
    • Fast Mining Algorithm of Frequent Itemset Based on Spark

      2023, 34(5):2446-2464.DOI: 10.13328/j.cnki.jos.006404

      Keywords:frequent itemsetpattern growthBitStringbit-wise operationvertical groupingSpark
      Abstract (936)HTML (1783)PDF 17.79 M (2830)Favorites

      Abstract:Improving the efficiency of frequent itemset mining in big data is a hot research topic at present. With the continuous growth of data volume, the computing costs of traditional frequent itemset generation algorithms remain high. Therefore, this study proposes a fast mining algorithm of frequent itemset based on Spark (Fmafibs in short). Taking advantage of bit-wise operation, a novel pattern growth strategy is designed. Firstly, the algorithm converts itemset into BitString and exploits bit-wise operation to generate candidate itemset. Secondly, to improve the processing efficiency of long BitString, a vertical grouping strategy is designed and the candidate itemset are obtained by joining the frequent itemset between different groups of same transaction, and then aggregating and filtering them to get the final frequent itemset. Fmafibs is implemented in Spark environment. The experimental results on benchmark datasets show that the proposed method is correct and it can significantly improve the mining efficiency.

    Prev1Next
    Page 1 Result 1 Jump toPageGO
Year of publication

You are the first2038842Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063