Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2017, Vol. 40 ›› Issue (2): 16-20.doi: 10.13190/j.jbupt.2017.02.003

• Papers • Previous Articles     Next Articles

A Part-of-Speech Tagging Algorithm for Essay Written by Chinese English Learner

TAN Yong-mei, YANG Lin, HU Dan   

  1. Intelligence Science and Technology Center, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2016-03-29 Online:2017-04-28 Published:2017-04-26

Abstract: A tagging algorithm about two layers part-of-speech base on word embedding was proposed. Only a few artificial features are needed in this algorithm, most features are replaced by word embedding and tagging vector that is got in the first layer. In addition, the tag set is divided into two categories, which are the tag sets of different layers. The ones which are easily to be tagged are tagged firstly in the first layer.Those tags which are hardly to be tagged as noun and verb are tagged in the second layer. Using this algorithm, the accuracy of part-of-speech tagging of essays written by Chinese English learner is improved from 95.23% to 95.63%, which outperforms the state-of-art word results of part-of-speech tagging of essays written by Chinese English learner based on vector based on word embedding.

Key words: part-of-speech tagging, Chinese English learner, essays, word vector

CLC Number: