北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2025, Vol. 48 ›› Issue (2): 73-79.

• 论文 • 上一篇    下一篇

轻量级多尺度大核卷积的图像超分辨率重建网络

张锡英, 王世宇, 李金凤, 边继龙
  

  1. 1. 东北林业大学 计算机与控制工程学院 2. 牡丹江师范学院 计算机与信息技术学院
  • 收稿日期:2024-01-31 修回日期:2024-03-25 出版日期:2025-04-30 发布日期:2025-04-30
  • 通讯作者: 边继龙 E-mail:bianjilong@nefu.edu.cn
  • 基金资助:
    牡丹江师范学院青年骨干项目; 黑龙江省科研业务

Lightweight Multi-Order Gated Aggregation Network for Image Super-Resolution

  • Received:2024-01-31 Revised:2024-03-25 Online:2025-04-30 Published:2025-04-30

摘要: 基于深度学习的技术显著提高了图像超分辨率网络的性能,其中轻量级卷积神经网络 (CNNs) 具有参数量较小、模型复杂度低等特点,但性能受到极大限制,这影响了超分辨率方法的实际应用。为解决此类问题,提出了一种轻量级多尺度大核卷积的超分辨率重建网络,该方法充分利用门控机制和大核卷积,提出了一种多阶门控聚合块,使用串联的门控单元以获得全局信息和局部信息;提出了一种多阶大核注意力块,通过大核卷积获得更大感受野,并提取不同尺度的图像特征;提出了一种聚焦门控聚合块,使用跳跃连接加强模型重建图像的能力。为验证本模型的效果,在模型参数量、响应速度等评价指标,以及主观视觉效果上进行了实验,并对模型各模块进行了消融分析实验。实验表明,与轻量级网络模型相比,所提出的方法在参数量、响应速度相近甚至减少的同时,在4个基准数据集上取得的峰值信噪比和结构相似度均有明显提升,更好地平衡了模型复杂度和性能,并且取得了更有效的主观视觉效果,达到更优秀的图像重建能力。

关键词: 超分辨率重建, 卷积神经网络, 注意力机制, 多尺度特征

Abstract: Deep learning-based technologies have significantly improved the performance of image super-resolution networks, where lightweight convolutional neural networks (CNNs) are characterized by a small number of parameters and low model complexity. However, their performance is greatly limited, affecting the practical application of super-resolution methods. To address such issues, a lightweight multi-scale large kernel convolution super-resolution reconstruction network is proposed, which fully utilizes gating mechanisms and large kernel convolution. A multi-stage gated aggregation block is introduced, using a series of gated units to obtain both global and local information; a multi-stage large kernel attention block is proposed, which uses large kernel convolution to achieve a larger receptive field and extract features from images at different scales; a focused gated aggregation block is introduced, using skip connections to enhance the model's ability to reconstruct images. To validate the effectiveness of this model, experiments were conducted on evaluation metrics such as the number of model parameters and response speed, as well as subjective visual effects, and ablation analysis experiments were performed on each module of the model. The experiments show that compared to lightweight network models in recent years, the proposed method achieves a significant improvement in peak signal-to-noise ratio and structural similarity on four benchmark datasets while maintaining a similar or even reduced number of parameters and response speed. This better balances model complexity and performance, and achieves more effective subjective visual effects, leading to superior image reconstruction capabilities.

Key words: super-resolution reconstruction, convolutional neural network, attention mechanism, multi-scale features

中图分类号: