Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2016, Vol. 39 ›› Issue (4): 77-82.doi: 10.13190/j.jbupt.2016.04.015

• Reports • Previous Articles     Next Articles

Perceptual Hashing Algorithm for Multi-Format Audio

ZHANG Qiu-yu, XING Peng-fei, HUANG Yi-bo, DONG Rui-hong, YANG Zhong-ping   

  1. School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China
  • Received:2015-06-17 Online:2016-08-28 Published:2016-08-28

Abstract: A novel multi-format audio perceptual hashing algorithm based on dual tree complex wavelet transform(DT-CWT) was proposed. It solves the problems of the existing audio authentication algorithms, including that audio files are kept in a single format, and algorithms are not generic and low efficiency. The proposed algorithm first applies the global DT-CWT to the audio signal after pre-processing conducts to obtain the real and complex wavelet coefficients. Next, the coefficients are partitioned in some frames respectively, and the frame numbers are same. For the real wavelet coefficients, the module values of teager energy operator in every frame are computed to serve as its inter-frame feature. And then short-time energy of the new signal, which is generated to frame the frame signal, is computed to serve as its intra-frame feature. For the complex wavelet coefficients, entropy values are obtained in every frame to serve as its inter-frame feature. Finally, the above features are to conduct a hashing structure process to produce the perceptual hashing sequence. Experiments show that the proposed algorithm has good robustness and discrimination for audio signals with five different formats, with high efficiency and ability to implement tamper detection as well.

Key words: audio authentication, multi-format audio, perceptual Hashing, dual tree complex wavelet transform, tamper detection

CLC Number: