一种基于ResNet网络特征的视觉目标跟踪算法

doi:10.13190/j.jbupt.2019-071

北京邮电大学学报 ›› 2020, Vol. 43 ›› Issue (2): 129-134.doi: 10.13190/j.jbupt.2019-071

• 研究报告 • 上一篇

一种基于ResNet网络特征的视觉目标跟踪算法

马素刚^1,2, 赵祥模¹, 侯志强², 王忠民^2,3, 孙韩林²

1. 长安大学信息工程学院, 西安 710064;
2. 西安邮电大学计算机学院, 西安 710121;
3. 西安邮电大学陕西省网络数据分析与智能处理重点实验室, 西安 710121

收稿日期:2019-04-30 发布日期:2020-04-28
作者简介:马素刚(1982-),男,高级工程师,E-mail:msg@xupt.edu.cn.
基金资助:
国家自然科学基金项目（61571458，61473309）；陕西省重点研发计划项目（2018ZDCXL-GY-04-02）；陕西省教育厅专项科研计划项目（17JK0696）；西安市科技计划项目（GXYD17.17）

A Visual Object Tracking Algorithm Based on Features Extracted by Deep Residual Network

MA Su-gang^1,2, ZHAO Xiang-mo¹, HOU Zhi-qiang², WANG Zhong-min^2,3, SUN Han-lin²

1. School of Information Engineering, Chang'an University, Xi'an 710064, China;
2. School of Computer Science and Technology, Xi'an University of Posts and Telecommunications, Xi'an 710121, China;
3. Shanxi Key Laboratory of Network Data Analysis and Intelligent Processing, Xi'an University of Posts and Telecommunications, Xi'an 710121, China

Received:2019-04-30 Published:2020-04-28

摘要/Abstract

摘要： 针对复杂场景下目标容易丢失的问题，提出了一种基于深度残差网络（ResNet）特征的尺度自适应视觉目标跟踪算法.首先，通过ResNet提取图像感兴趣区域的多层深度特征，考虑到修正线性单元（ReLU）激活函数对目标特征的抑制作用，在ReLU函数之前选取用于提取目标特征的卷积层；然后，在提取的多层特征上分别构建基于核相关滤波的位置滤波器，并对得到的多个响应图进行加权融合，选取响应值最大的点即为目标中心位置.目标位置确定后，对目标进行多个尺度采样，分别提取不同尺度图像的方向梯度直方图（fHOG）特征，在此基础上构建尺度相关滤波器，从而实现对目标尺度的准确估计.在视频集OTB100中与其他6种相关算法进行了比较，实验结果表明，所提算法取得了较高的跟踪成功率和精确度，能够较好地适应目标的尺度变化、背景干扰等复杂场景.

关键词: 视觉目标跟踪, 深度残差网络, 核相关滤波, 深度学习, 尺度估计

Abstract: Because the objects are easy to be lost in complex scenes, a scale adaptive visual object tracking algorithm based on deep residual network (ResNet) features is proposed. Firstly, the ResNet is used to extract the multi-layer deep features of the image region of interest. Considering the restraining effect of rectified linear units (ReLU) activation function on target features, only the convolutional layers before ReLU function are selected. Secondly, the translation filters based on kernelized correlation filter are constructed in the extracted multi-layer features, and then the weighted fusion of the multiple response maps is carried out to obtain the target position with the largest response value. After the target location is determined, the target is sampled at multiple scales, and the felzenszwalb histogram of oriented gradients (fHOG) features of different scale images are extracted separately. On this basis, a scale correlation filter is constructed to estimate the target scale accurately. Comparing with six related algorithms in OTB100, an experiment is carried. It is shown that the proposed algorithm achieves high tracking success rate and accuracy, and can adapt to scale variation, background clutter and other complex scenes.

Key words: visual object tracking, deep residual network, kernelized correlation filter, deep learning, scale estimation

中图分类号:

TP391.4

马素刚, 赵祥模, 侯志强, 王忠民, 孙韩林. 一种基于ResNet网络特征的视觉目标跟踪算法[J]. 北京邮电大学学报, 2020, 43(2): 129-134.

MA Su-gang, ZHAO Xiang-mo, HOU Zhi-qiang, WANG Zhong-min, SUN Han-lin. A Visual Object Tracking Algorithm Based on Features Extracted by Deep Residual Network[J]. Journal of Beijing University of Posts and Telecommunications, 2020, 43(2): 129-134.

[1]	刘阳, 滕颖蕾, 牛涛, 郅佳琳. 基于深度强化学习的滤波器剪枝方案[J]. 北京邮电大学学报, 2023, 46(3): 31-36.
[2]	靳梦凡, 黄智濒, 储志强. 复杂环境下多模态指导的点云补全方法[J]. 北京邮电大学学报, 2023, 46(3): 103-108.
[3]	荣震宇刘建毅. 基于Transformer和MLP的眼底血管分割算法[J]. 北京邮电大学学报, 2023, 46(1): 26-31.
[4]	刘金桐, 杨国兴, 刘晓鸿, 王光宇. 针对心电图分类模型的平滑攻击算法[J]. 北京邮电大学学报, 2022, 45(4): 44-50.
[5]	黄柳婷, 刘可欣, 牛凯, 常春, 贺志强. 基于深度学习的哮喘患者CT影像黏液栓自动识别[J]. 北京邮电大学学报, 2022, 45(4): 58-63.
[6]	柳长源, 虎浩媛, 毕晓君. 双线性融合网络的驾驶员分心行为识别[J]. 北京邮电大学学报, 2022, 45(2): 79-84.
[7]	赵海英, 朱会, 侯小刚. 基于改进EMA单元的传统服饰图像语义分割[J]. 北京邮电大学学报, 2022, 45(1): 69-74.
[8]	张滨宇, 赵衍运, 杜昀昊, 万俊峰, 佟知航. 一种基于深度学习的PCB图像字符检测方法[J]. 北京邮电大学学报, 2022, 45(1): 108-114.
[9]	贾珺, 冯春燕, 夏海轮, 张天魁, 李成钢. 基于样本均衡与特征交互的通信网络故障预测方法[J]. 北京邮电大学学报, 2021, 44(6): 59-66.
[10]	普运伟, 郭江, 刘涛涛, 吴海潇. 基于多学习单元卷积神经网络的雷达辐射源信号识别[J]. 北京邮电大学学报, 2021, 44(6): 74-82.
[11]	王艺霏, 莫爽, 吴文睿, 范少华, 肖丁. 基于内外卷积网络的网络入侵检测[J]. 北京邮电大学学报, 2021, 44(5): 94-100.
[12]	高慧, 张继威, 来扬, 王文东. 深度学习的人体图像半自动标注系统[J]. 北京邮电大学学报, 2021, 44(1): 104-109.
[13]	蒲悦逸, 王文涵, 朱强, 陈朋朋. 基于CNN-ResNet-LSTM模型的城市短时交通流量预测算法[J]. 北京邮电大学学报, 2020, 43(5): 9-14.
[14]	马向亮, 李冰, 杨丹, 黄克振, 段晓毅. 基于深度学习的类SM4算法S盒逆向分析[J]. 北京邮电大学学报, 2020, 43(5): 118-124.
[15]	张天麒, 康波, 孟祥飞, 刘奕琳, 周颖. 基于U-Net的颅内出血识别算法[J]. 北京邮电大学学报, 2020, 43(3): 92-98.

一种基于ResNet网络特征的视觉目标跟踪算法

A Visual Object Tracking Algorithm Based on Features Extracted by Deep Residual Network

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价