基于Transformer和MLP的眼底血管分割算法

北京邮电大学学报 ›› 2023, Vol. 46 ›› Issue (1): 26-31.

基于Transformer和MLP的眼底血管分割算法

荣震宇,刘建毅

北京邮电大学

收稿日期:2021-12-01 修回日期:2022-03-23 出版日期:2023-02-28 发布日期:2023-02-22
通讯作者: 刘建毅 E-mail:liujy@bupt.edu.cn
基金资助:
国家自然科学基金项目; 山东省重大科技创新工程项目

Retinal Blood Vessel Segmentation Based on Transformer and MLP

#br#

Received:2021-12-01 Revised:2022-03-23 Online:2023-02-28 Published:2023-02-22

摘要/Abstract

摘要： 为了解决眼底血管分割中存在的分割效果不佳、数据过拟合和正负样本不均衡等问题,提出了一种转换器(Transformer)和多层感知机(MLP)结合的眼底血管分割算法。首先,为预防数据过拟合问题,训练图像在输入模型前会执行多种数据增强操作;其次,设计一个融合了卷积模块的 Transformer 组成多尺度编码器对图像进行特征提取,以此获得鲁棒的多级特征信息;最后,使用 MLP 结构的解码器对特征图完成像素级的分类。为解决正负样本不均衡的问题,引入了 Tversky 损失和二进制交叉熵损失的组合损失函数。所提算法在多个数据集上都取得了良好的实验结果,优于现有的其他网络模型算法。

关键词: 深度学习 , 多头自注意力 , 多层感知机 , 图像分割 , 眼底血管

Abstract: To solve the problem of poor segmentation effect, data over-fitting, and imbalance of positive and negative samples in fundus blood vessel segmentation, a retinal blood vessel segmentation algorithm based on transformer architecture (Transformer) and multilayer perceptron (MLP) is proposed. First, data augmentation is used on training images to prevent over-fitting. Then,several transformers fused with convolution modules are used as a robust encoder to gain multi-scale feature information. Finally, a decoder consisting of MLP is adopted to complete pixel-level classification on a feature map. In addition, the combination of Tversky loss and binary cross-entropy loss is applied to solve the sample imbalance problem. Experiential results on various datasets indicate that the proposed algorithm has a good performance,which is better than other existing algorithms.

Key words: deep learning , multi-headed self-attention , multilayer perceptron , image segmentation , retinal blood vessel

中图分类号:

TP391.41

荣震宇刘建毅. 基于Transformer和MLP的眼底血管分割算法[J]. 北京邮电大学学报, 2023, 46(1): 26-31.

RONG Zhenyu, LIU Jianyi. Retinal Blood Vessel Segmentation Based on Transformer and MLP[J]. Journal of Beijing University of Posts and Telecommunications, 2023, 46(1): 26-31.

[1]	刘阳, 滕颖蕾, 牛涛, 郅佳琳. 基于深度强化学习的滤波器剪枝方案[J]. 北京邮电大学学报, 2023, 46(3): 31-36.
[2]	张小乾, 王潇, 薛旭倩, 谈振, 蒲磊. 基于加权多核子空间聚类的图像分割方法[J]. 北京邮电大学学报, 2023, 46(3): 78-83.
[3]	靳梦凡, 黄智濒, 储志强. 复杂环境下多模态指导的点云补全方法[J]. 北京邮电大学学报, 2023, 46(3): 103-108.
[4]	王薇谢沈惟闫浩田张闯吴铭. 基于道路中心线的分阶段弱监督遥感图像道路提取[J]. 北京邮电大学学报, 2023, 46(2): 84-90.
[5]	叶康张淑军郭淇李辉崔雪红. 基于CM-Transformer的连续手语识别[J]. 北京邮电大学学报, 2022, 45(5): 49-53,78.
[6]	刘金桐, 杨国兴, 刘晓鸿, 王光宇. 针对心电图分类模型的平滑攻击算法[J]. 北京邮电大学学报, 2022, 45(4): 44-50.
[7]	黄柳婷, 刘可欣, 牛凯, 常春, 贺志强. 基于深度学习的哮喘患者CT影像黏液栓自动识别[J]. 北京邮电大学学报, 2022, 45(4): 58-63.
[8]	柳长源, 虎浩媛, 毕晓君. 双线性融合网络的驾驶员分心行为识别[J]. 北京邮电大学学报, 2022, 45(2): 79-84.
[9]	赵海英, 朱会, 侯小刚. 基于改进EMA单元的传统服饰图像语义分割[J]. 北京邮电大学学报, 2022, 45(1): 69-74.
[10]	张滨宇, 赵衍运, 杜昀昊, 万俊峰, 佟知航. 一种基于深度学习的PCB图像字符检测方法[J]. 北京邮电大学学报, 2022, 45(1): 108-114.
[11]	贾珺, 冯春燕, 夏海轮, 张天魁, 李成钢. 基于样本均衡与特征交互的通信网络故障预测方法[J]. 北京邮电大学学报, 2021, 44(6): 59-66.
[12]	普运伟, 郭江, 刘涛涛, 吴海潇. 基于多学习单元卷积神经网络的雷达辐射源信号识别[J]. 北京邮电大学学报, 2021, 44(6): 74-82.
[13]	王艺霏, 莫爽, 吴文睿, 范少华, 肖丁. 基于内外卷积网络的网络入侵检测[J]. 北京邮电大学学报, 2021, 44(5): 94-100.
[14]	韩越林, 王小玉. 多头自注意力在双曲空间下的点击率预测[J]. 北京邮电大学学报, 2021, 44(5): 127-132.
[15]	李剑, 刘鹏, 刘维. 双重注意力充分组合评论特征的推荐模型[J]. 北京邮电大学学报, 2021, 44(4): 115-120.