Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

Journal of Beijing University of Posts and Telecommunications ›› 2011, Vol. 34 ›› Issue (6): 64-68.doi: 10.13190/jbupt.201106.64.yanh

• Papers • Previous Articles     Next Articles

Closed Frequent Itemsets Hierarchical Clustering based on Items’ Quantities

  

  • Received:2011-01-20 Revised:2011-05-18 Online:2011-12-28 Published:2011-10-18
  • Contact: Hao Yan E-mail:yanhao71@163.com

Abstract:

A Web Usage Mining algorithm named Closed Frequent Itemsets Hierarchical Clustering based on Quantities (CFIHCQ) is proposed. The algorithm first obtains Closed Frequent Itemsets with network user Web access data. Then it initial clusters with Closed Frequent Itemsets and points users in to the only cluster using scoring method. After that, it construct cluster tree using cluster labels. User access vectors are used to divide sub-clusters in cluster tree. Finally the cluster tree is pruned. Experimental results indicate CFIHCQ has many advantages such as accurate predicating network user Web access behavior, real-time mining in huge data set, and easy-browse result with tree structure.

CLC Number: