Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2017, Vol. 40 ›› Issue (4): 54-59.doi: 10.13190/j.jbupt.2017.04.009

• Papers • Previous Articles     Next Articles

Robust Outlier Detection Algorithm Based on k-Nearest Neighbor Region Center Migration

ZHAO Jian-long1, QU Hua1,2, ZHAO Ji-hong2,3   

  1. 1. School of Software Engineering, Xi'an Jiaotong University, Xi'an 710049, China;
    2. School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China;
    3. School of Communication and Information Engineering, Xi'an University of Posts and Telecommunications, Xi'an 710061, China
  • Received:2016-09-12 Online:2017-08-28 Published:2017-07-10

Abstract: Considering the distance- and density-based outlier detection algorithms are often sensitive to a nearest neighbor parameter k, termed k-center offset outlier factor (COOF), a robust outlier detection criterion for the characterization of abnormal degree of each data object was proposed. Each data object is included in a region within its k nearest neighbors, and the center of region will migrate with the change of nearest neighbor parameter k. In general, the variation of center offset of k nearest neighbor region is greater for an outlier than a normal object. According to this observation, for each data object, COOF is defined as the accumulation of this kind of offset when increasing the nearest neighbor parameter from one to k. Finally, the outlier detection algorithm based on COOF was also presented. Through artificial data and real data experimental simulations show that COOF is insensitive to parameter k, and has more stable and accurate outlier detection performance compared to k nearest neighbor, local distance-based outlier factor and local outlier factor, which are the distance-based method and density-based method respectively.

Key words: outlier detection, k nearest neighbor, local outlier factor, center offset outlier factor

CLC Number: