北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2018, Vol. 41 ›› Issue (4): 69-75.doi: 10.13190/j.jbupt.2018-010

• 论文 • 上一篇    下一篇

度量学习改进语义自编码零样本分类算法

陈祥凤1,2, 陈雯柏1,2   

  1. 1. 北京信息科技大学 自动化学院, 北京 100192;
    2. 北京大学 机器感知与智能教育部重点实验室, 北京 100871
  • 收稿日期:2018-01-11 出版日期:2018-08-28 发布日期:2018-10-09
  • 作者简介:陈祥凤(1993-),女,硕士生,E-mail:1057133456@qq.com;陈雯柏(1975-),男,副教授,硕士生导师.
  • 基金资助:
    机器感知与智能教育部重点实验室2018年度开放课题(K-2018-08)

Improving Semantic Autoencoder Zero-Shot Classification Algorithm by Metric Learning

CHEN Xiang-feng1,2, CHEN Wen-bai1,2   

  1. 1. College of Automation, Beijing Information Science and Technology University, Beijing 100192, China;
    2. MOE Key Laboratory of Machine Perception, Peking University, Beijing 100871, China
  • Received:2018-01-11 Online:2018-08-28 Published:2018-10-09

摘要: 为改善零样本图像分类中相似度度量方法的鲁棒性,引入了一种用于零样本分类的度量学习方法.该方法由自编码构成,能在特征对齐后的语义嵌入空间中学习到最优的度量函数,用于计算测试样本特征和类标签的语义特征的相似度;然后利用近邻思想预测类别标签,进而避免产生不合适距离函数导致的分类错误.实验结果表明,与传统距离度量的算法相比,所提出的方法降低了识别错误率,在公开数据集AWA、CUB和ImNet-2上的分类准确率分别达到94.7%、63.7%和28.59%;同时表明了语义—视觉的映射方向比相反方向的识别准确率高出2.5%~10.1%.

关键词: 零样本分类, 度量学习, 语义自编码, 语义嵌入空间, 距离函数

Abstract: To improve the robustness of similarity metric method in zero-shot learning, a new metric learning for zero-shot image classification was introduced. It is composed of autoencoders, which can learn the optimal metric function in the feature-aligned semantic embedding space. The similarity between test sample features and the semantic features of the class labels can be calculated by metric function, predicting the label of the class by the neighboring method. Thus, the classification error caused by inappropriate distance function is prevented. Compared with the traditional distance metric algorithm, the experiments show that the proposed method reduces the recognition error rate; the recognition accuracy is improved to 94.7%, 63.7% and 28.59% on the AWA, CUB and ImNet-2 datasets. At the same time, it was confirmed that the recognition accuracy of the semantic-visual mapping direction was 2.5%~10.1% higher than the opposite direction.

Key words: zero-shot classification, metric learning, semantic autoencoder, semantic embedding space, distance function

中图分类号: