北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2009, Vol. 32 ›› Issue (6): 24-27.doi: 10.13190/jbupt.200906.24.xiongw

• 论文 • 上一篇    下一篇

改进粒子群与支持向量机混合的特征变换

熊文;王枞   

  1. (北京邮电大学 计算机学院, 北京 100876)
  • 收稿日期:2009-03-24 修回日期:2009-08-31 出版日期:2009-12-28 发布日期:2009-12-28
  • 通讯作者: 熊文

A Hybrid Feature Transformation Method Based on Modified Particle Swarm Optimization and Support Vector Machine

Xiong Wen;Wang Cong   

  1. (School of Computer Science and Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China)
  • Received:2009-03-24 Revised:2009-08-31 Online:2009-12-28 Published:2009-12-28
  • Contact: Xiong Wen

摘要:

研究了数据挖掘中通过特征变换的数据预处理来提高支持向量机(SVM)分类精度的方法,提出了改进粒子群优化(PSO)和SVM混合的方法. 用推广t统计、Fisher判别式和随机森林的线性加权度量来排序特征,得到预选特征子集,再用启发式信息加速改进PSO搜索特征的线性变换因子,并用二进制PSO对特征变换子集进行特征选择,在后处理中通过格子搜索获取了高精度SVM分类器. 在NIPS 2003的madelon及10个UCI数据集上的实验表明,与有C-SVM分类精度相比,新方法在4个数据集上的精度更高.

关键词: 粒子群, 特征变换, 支持向量机, 特征选择, 分类

Abstract:

Linear feature transformation was investigated to improve the classification accuracy of support vector machine (SVM) by preprocessing, and a hybrid method combining the modified particle swarm optimization (PSO) with SVM was presented. In the method, features top-ranked were preselected by linear weighted combination of t-statistic extended, Fisher's discriminant ratio and random forests feature importance scores, and a modified PSO and novel heuristic info were used to attract swarm to find optimal linear feature transformation factors. Features on dataset transformed were further refined by binary PSO, and a grid method was utilized to obtain SVM with high accuracy. Experiments on madelon of neural information processing system (NIPS) 2003 and ten data sets of university of California Irvine (UCI) verify this method has higher accuracy on 4 data sets than original C-SVM.

Key words: particle swarm optimization, feature transformation, support vector machine, feature selection, classification