J Shanghai Jiaotong Univ Sci ›› 2026, Vol. 31 ›› Issue (2): 348-358.doi: 10.1007/s12204-024-2721-4
收稿日期:2023-06-29
接受日期:2023-08-27
出版日期:2026-04-01
发布日期:2024-04-22
陶洪洁1,2,李兆飞1,2 ,祁飞3 ,陈景珏3 ,周豪1,2
Received:2023-06-29
Accepted:2023-08-27
Online:2026-04-01
Published:2024-04-22
摘要: 针对经典语义分割网络在高分辨率遥感图像语义分割效果不佳、复杂场景下分割性能受限、网络参数量多,训练网络代价高等问题,从网络参数量、计算量和性能三个方面综合考虑,提出一种基于改进DeepLabv3+的高分辨率遥感图像高效分割方法。该方法首先使用更轻量级的MobileNetV2网络替换DeepLabv3+原始主干网络Xception进行特征提取;其次在特征提取模块获得的深层有效特征之后加入轻量级的通用卷积注意力模块(CBAM),在减少网络参数量的同时增强网络特征提取能力;最后在特征提取模块获得的浅层特征后引入坐标注意力机制,使其更关注图像中有效的特征信息,忽略无关的背景信息。实验结果表明,该方法在高分图像数据集分割任务中mIoU达到75.33%,分别比SegNet、PspNet和U-Net等主流语义分割网络高12.49%、3.16%和1.62%,同时该模型的网络参数量为6.02×106,浮点运算量为26.45 GFLOPs,在计算效率和分割精度之间达到了较好的平衡,对边缘计算具有较高的应用价值。
中图分类号:
. 改进DeepLabv3+的高分辨率遥感图像分割方法[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 348-358.
Tao Hongjie, Li Zhaofei, Qi Fei, Chen Jingjue, Zhou Hao. High Resolution Remote Sensing Image Segmentation Method with Improved DeepLabv3+[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 348-358.
| [1] LI D R, WANG M, JIANG J. China’s high-resolution optical remote sensing satellites and their mapping applications [J]. Geo-spatial Information Science, 2021, 24(1): 85-94. [2] ZHANG J, JING H T, FAN S H. Sea-land segmentation for remote sensing imagery based on coastline database [J]. Electronic Measurement Technology, 2020, 43(23): 115-120 (in Chinese). [3] MATIKAINEN L, KARILA K. Segment-based land cover mapping of a suburban area—Comparison of high-resolution remotely sensed datasets using classification trees and test field points [J]. Remote Sensing, 2011, 3(8): 1777-1804. [4] TIAN X, WANG L, DING Q. Review of image semantic segmentation based on deep learning [J]. Journal of Software, 2019, 30(2): 440-468 (in Chinese). [5] ERUS G, LOMÉNIE N. How to involve structural modeling for cartographic object recognition tasks in high-resolution satellite images? [J]. Pattern Recognition Letters, 2010, 31(10): 1109-1119. [6] OTSU N. A threshold selection method from gray-level histograms [J]. IEEE Transactions on Systems, Man, and Cybernetics, 1979, 9(1): 62-66. [7] BEZDEK J C, EHRLICH R, FULL W. FCM: The fuzzy c-means clustering algorithm [J]. Computers & Geosciences, 1984, 10(2/3): 191-203. [8] PENG B, ZHANG L, ZHANG D. A survey of graph theoretical approaches to image segmentation [J]. Pattern Recognition, 2013, 46(3): 1020-1038. [9] MITRA P, SHANKAR B U, PAL S K. Segmentation of multispectral remote sensing images using active support vector machines [J]. Pattern Recognition Letters, 2004, 25(9): 1067-1074. [10] POGGI G, SCARPA G, ZERUBIA J B. Supervised segmentation of remote sensing images based on a tree-structured MRF model [J]. IEEE Transactions on Geoscience and Remote Sensing, 2005, 43(8): 1901-1911. [11] SHELHAMER E, LONG J, DARRELL T. Fully convolutional networks for semantic segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4): 640-651. [12] RONNEBERGER O, FISCHER P, BROX T. U-net: Convolutional networks for biomedical image segmentation[M]//Medical image computing and computer-assisted intervention – MICCAI 2015. Cham: Springer, 2015: 234-241. [13] BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495. [14] ZHAO H S, SHI J P, QI X J, et al. Pyramid scene parsing network [C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 6230-6239. [15] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. Semantic image segmentation with deep convolutional nets and fully connected CRFs [DB/OL]. (2014-12-22). https://arxiv.org/abs/1412.7062 [16] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4): 834-848. [17] HE K M, ZHANG X Y, REN S Q, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904-1916. [18] CHEN L C, PAPANDREOU G, SCHROFF F, et al. Rethinking atrous convolution for semantic image segmentation [DB/OL]. (2017-06-17). https://arxiv.org/abs/1706.05587 [19] DU S J, DU S H, LIU B, et al. Incorporating DeepLabv3+ and object-based image analysis for semantic segmentation of very high resolution remote sensing images [J]. International Journal of Digital Earth, 2021, 14(3): 357-378. [20] ZENG H B, PENG S Q, LI D X. Deeplabv3+ semantic segmentation model based on feature cross attention mechanism [J]. Journal of Physics: Conference Series, 2020, 1678(1): 012106. [21] HUANG C, YANG J, LIU Y, et al. Remote sensing image segmentation algorithm based on improved DeeplabV3+[J]. Electronic Measurement Technology, 2022, 45(21): 148-155 (in Chinese). [22] HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design [C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2021: 13708-13717. [23] WANG Z M, WANG J S, YANG K, et al. Semantic segmentation of high-resolution remote sensing images based on a class feature attention mechanism fused with Deeplabv3+ [J]. Computers & Geosciences, 2022, 158: 104969. [24] GUO M H, LU C G, HOU Q B, et al. SegNeXt: Rethinking convolutional attention design for semantic segmentation [C]// 36th Conference on Neural Information Processing Systems. New Orleans: NIPS, 2022: 1-17. [25] WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[M]// Computer vision – ECCV 2018. Cham: Springer, 2018: 3-19. [26] SANDLER M, HOWARD A G, ZHU M L, et al. Inverted residuals and linear bottlenecks: Mobile networks for classification, detection and segmentation [DB/OL]. (2018-01-13). https://arxiv.org/abs/1801.04381 [27] TONG X Y, XIA G S, LU Q, et al. Land-cover classification with high-resolution remote sensing images using transferable deep models[J]. Remote Sensing of Environment, 2020, 237: 111322. |
| [1] | . 用于水下图像增强的全局密集双分支级联网络[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 458-474. |
| [2] | 杨明慧, 韦亚利, 卢俊言, 李昕海. 基于窗口注意力机制的可见光-SAR跨模态目标匹配方法[J]. 空天防御, 2026, 9(1): 73-79. |
| [3] | 夏筱彦, 张宇, 胡锡坤, 钟平. 基于扩散模型的无人机遥感目标检测物理对抗攻击方法研究[J]. 空天防御, 2026, 9(1): 52-62. |
| [4] | 刘轩, 兰晓宸, 徐大鹏, 何良, 刘茂珅. 基于深度学习时序特征增强的海杂波抑制方法[J]. 空天防御, 2026, 9(1): 36-45. |
| [5] | . 基于改进CLRNet的铁轨线检测算法[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 923-934. |
| [6] | . 基于多注意力机制的轻量化人体姿态估计[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 899-910. |
| [7] | . 基于多特征提取方法的多场景烟雾检测[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 866-879. |
| [8] | . 基于ALBERT的中国诗酒文化命名实体识别[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 1065-1072. |
| [9] | . CenterLineFormer:基于单车载相机的车道中心线生成方法[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 1009-1017. |
| [10] | . 迁移学习和注意机制融合用于CT图像COVID-19病灶分割的计算机辅助诊断[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(3): 566-581. |
| [11] | 王可, 刘奕阳, 杨杰, 鲁爱国, 李哲, 徐明亮. 基于自适应特征增强和融合的舰载机着舰拉制状态识别[J]. 上海交通大学学报, 2025, 59(2): 274-282. |
| [12] | 汪维, 阮雅端, 顾鹏, 陈启美. 基于融合注意力机制DAUNet的最大裂缝宽度计算[J]. 上海交通大学学报, 2025, 59(12): 1866-1877. |
| [13] | 詹泽辉, 钟铭恩, 袁彬淦, 谭佳威, 范康. 随机平视摄像条件下的路边车辆违停检测[J]. 上海交通大学学报, 2025, 59(10): 1568-1580. |
| [14] | 丁黎辉1, 2, 付立军1, 3, 杨光4, 5, 6, 万林4, 5, 常志军7. 基于视频的婴儿癫痫性痉挛综合征检测:建模、检测与评估[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(1): 1-9. |
| [15] | 赵寅杰1,侯润萍1,曾琬琴2,秦玉磊1,沈天乐2,徐志勇2,傅小龙2,沈红斌1. 在三维医学图像分割中位置信息可以作为强监督信息[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(1): 121-129. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||