Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2015, Vol. 38 ›› Issue (s1): 63-66,71.doi: 10.13190/j.jbupt.2015.s1.015

• Papers • Previous Articles     Next Articles

The Application of Double Layer Clustering Model on Log Data Analysis

GU Heng1,2, CHEN Zhao3, WANG Cong2,4, ZHANG Si-yue2,4, FU Qun-chao2,4   

  1. 1. School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China;
    2. Key Laboratory of Trustworthy Distributed Computing and Service (BUPT), Ministry of Education, Beijing University of Posts and Telecommunications, Beijing 100876, China;
    3. Beijing Government Computer Emergency Response Center, Beijing 100101, China;
    4. School of Software, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2014-08-26 Online:2015-06-28 Published:2015-06-28

Abstract:

A double clustering model to make web log data sets clustering was proposed based on the self-organizing map (SOM) neural networks and the fuzzy c-means (FCM) method. The first tier uses unsupervised clustering method—SOM neural network, so the number of classes it generates significantly reduces compared with the original data set, it also reduces the FCM method's rely on class initial centers. Using the FCM clustering algorithm to cluster the center points of classes generated by the first layer, the time complexity of clustering is greatly reduced. Meanwhile, the parallel coordinates visualization technology to demonstrate the log dataset was used, it is suitable to analyze the log data.

Key words: parallel coordinates, log data, cluster, self-organizing map, fuzzy c-means

CLC Number: