Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2007, Vol. 30 ›› Issue (3): 1-5.doi: 10.13190/jbupt.200703.1.niuk

• Papers •     Next Articles

Subspace Clustering through Attribute Clustering

NIU Kun1, ZHANG Shu-bo2, CHEN Jun-liang1   

  1. 1. State Key Laboratory of Networking and Switching Technology, Beijing 100876, China;
    2. Dept. of Strategy Research, China Telecom Beijing Research Institute, Beijing 100035, China
  • Received:2006-08-07 Revised:1900-01-01 Online:2007-06-30 Published:2007-06-30
  • Contact: NIU Kun

Abstract:

Many recently proposed subspace clustering methods suffer from two severe problems: First, the algorithms typically scale exponentially with the data dimensionality or the subspace dimensionality of clusters. Second, the clustering results are often sensitive to input parameters. A fast algorithm of subspace clustering using attribute clustering is proposed to overcome these limitations. This algorithm first filters out redundant attributes by computing the gini coefficient. To evaluate the correlation of each two non-redundant attributes, the relation matrix of non-redundant attributes is constructed based on the relation function of two dimensional united gini coefficients. After applying overlapping clustering algorithm on relation matrix, the candidate of all interesting subspaces is achieved. Finally, all subspace clusters can be gotten by clustering on interesting subspaces. Experiments on both synthesis and real datasets show that the new algorithm not only achieves a significant gain of runtime and quality to find subspace clusters but also is insensitive to input parameters.

Key words: subspace clustering, high dimensional data, attribute clustering

CLC Number: