北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2016, Vol. 39 ›› Issue (4): 77-82.doi: 10.13190/j.jbupt.2016.04.015

• 研究报告 • 上一篇    下一篇

多格式音频感知哈希算法

张秋余, 省鹏飞, 黄羿博, 董瑞洪, 杨仲平   

  1. 兰州理工大学 计算机与通信学院, 兰州 730050
  • 收稿日期:2015-06-17 出版日期:2016-08-28 发布日期:2016-08-28
  • 作者简介:张秋余(1966-),男,研究员,博士生导师,E-mail:zhangqylz@163.com.
  • 基金资助:
    国家自然科学基金项目(61363078);甘肃省自然科学基金项目(1310RJYA004)

Perceptual Hashing Algorithm for Multi-Format Audio

ZHANG Qiu-yu, XING Peng-fei, HUANG Yi-bo, DONG Rui-hong, YANG Zhong-ping   

  1. School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China
  • Received:2015-06-17 Online:2016-08-28 Published:2016-08-28

摘要: 提出一种基于双树复小波变换的多格式音频感知哈希算法,解决了现有音频认证算法音频格式单一、算法不通用、效率低的问题. 首先对预处理后的音频信号进行全局双树复小波变换,获得信号的实小波和复小波系数,对它们分别分帧,帧数相同;对实小波系数计算每帧信号Teager能量算子的模值,作为实小波系数的帧间特征,接着对每帧信号再分帧,提取再分帧帧信号的短时能量作为实小波系数的帧内特征;对复小波系数求取每帧信号的熵值作为复小波系数的帧间特征;最后对上述特征分别进行哈希构造,生成感知哈希序列. 实验结果表明,该算法对5种不同格式的音频都具有强鲁棒性,且区分性好,效率高,并能实现小范围篡改检测.

关键词: 音频认证, 多格式音频, 感知哈希, 双树复小波变换, 篡改检测

Abstract: A novel multi-format audio perceptual hashing algorithm based on dual tree complex wavelet transform(DT-CWT) was proposed. It solves the problems of the existing audio authentication algorithms, including that audio files are kept in a single format, and algorithms are not generic and low efficiency. The proposed algorithm first applies the global DT-CWT to the audio signal after pre-processing conducts to obtain the real and complex wavelet coefficients. Next, the coefficients are partitioned in some frames respectively, and the frame numbers are same. For the real wavelet coefficients, the module values of teager energy operator in every frame are computed to serve as its inter-frame feature. And then short-time energy of the new signal, which is generated to frame the frame signal, is computed to serve as its intra-frame feature. For the complex wavelet coefficients, entropy values are obtained in every frame to serve as its inter-frame feature. Finally, the above features are to conduct a hashing structure process to produce the perceptual hashing sequence. Experiments show that the proposed algorithm has good robustness and discrimination for audio signals with five different formats, with high efficiency and ability to implement tamper detection as well.

Key words: audio authentication, multi-format audio, perceptual Hashing, dual tree complex wavelet transform, tamper detection

中图分类号: