Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2019, Vol. 42 ›› Issue (2): 83-89.doi: 10.13190/j.jbupt.2018-228

• Reports • Previous Articles     Next Articles

Voice Activity Detection Method Based on MFPH

WU Xin-zhong, XIA Ling-xiang, ZHANG Xu, ZHOU Cheng   

  1. School of Information and Control Engineering, China University of Mining and Technology, Jiangsu Xuzhou 221116, China
  • Received:2018-09-11 Online:2019-04-28 Published:2019-04-09

Abstract: In order to solve the problem that the accuracy of traditional voice activity detection algorithms is low in the low signal-to-noise ratio (SNR) environment,a voice activity detection algorithm based on product of spectral entropy and Mel (MFPH) was proposed. Firstly, the first dimensional parameter MFCC0 of Mel frequency spectrum coefficient of the speech signal with noisy was extracted, and the product of MFCC0 and spectral entropy was taken as fusion characteristic parameter of finally distinguishing speech segment from background noise. Then, the threshold value of MFPH characteristic parameters was estimated adaptively based on combination of fuzzy C-means clustering algorithm (FCM) and Bayesian information criterion (BIC). Finally, the double-threshold method was adopted for the voice activity detection. Experiments show that the accuracy of the proposed method is greatly improved in the -5~15 dB low SNR environment compared with traditional methods.

Key words: voice activity detection, Mel frequency spectrum coefficient, spectral entropy, spectral entropy Mel product, double-threshold method, low signal-to-noise ratio

CLC Number: