融合双注意力机制模型的遥感影像建筑物提取The building extraction of remote sensing image combined with dual attention mechanism model
张越,程春泉,杨书成,高瑞,常亚茹
摘要(Abstract):
针对深度学习模型提取高分辨率遥感影像建筑物效果不理想,存在漏提、误提和提取不完整等问题,该文基于U-Net提出一种融合双注意力机制和残差结构的网络模型。在U-Net的跳跃连接阶段融合了通道与空间双注意力机制,实现精细化特征融合,编码阶段使用残差模块代替普通卷积来提升模型对建筑物特征的学习能力。利用该文的模型在WHU高分辨率遥感影像数据集上进行建筑物提取实验,与SegNet、U-Net和ResUnet的结果进行对比,结果表明该方法能够有效提升建筑物提取的准确性和精度。
关键词(KeyWords): 遥感影像;建筑物提取;注意力机制;残差结构;U-Net
基金项目(Foundation): 高分航空数据处理系统项目(30-H40B02-9002-19/21);; 高分辨率对地观测重大专项(30-Y20A15-9003-17/18)
作者(Author): 张越,程春泉,杨书成,高瑞,常亚茹
DOI: 10.16251/j.cnki.1009-2307.2022.04.017
参考文献(References):
- [1]崔世勇.高分辨率遥感影像建筑物半自动提取方法研究[D].北京:中国测绘科学研究院,2009.(CUIShiyong.Semi-automatic building extraction from high resolution remote sensing images[D].Beijing:Chinese Academy of Surveying and Mapping,2009.)
- [2]姚高伟.一种高分辨率遥感图像建筑物特征分级提取算法[D].信阳:信阳师范学院,2013.(YAO Gaowei.Afractional extraction algorithm of building features from high resolution remote sensing images[D].Xinyang:Xinyang Normal University,2013.)
- [3]冯甜甜.基于高分辨率遥感数据的城市精细尺度人口估算研究[D].武汉:武汉大学,2010.(FENG Tiantian.Urban small area population estimation based on highresolution remote sensing data[D].Wuhan:Wuhan University,2010.)
- [4]孙金彦,黄祚继,周绍光,等.高分辨率遥感影像中建筑物轮廓信息矢量化[J].遥感学报,2017,21(3):396-405.(SUN Jinyan,HUANG Zuoji,ZHOU Shaoguang,et al.Building outline vectorization from high spatial resolution imagery[J].Journal of Remote Sensing,2017,21(3):396-405.)
- [5]陶超,谭毅华,蔡华杰,等.面向对象的高分辨率遥感影像城区建筑物分级提取方法[J].测绘学报,2010,39(1):39-45.(TAO Chao,TAN Yihua,CAI Huajie,et al.Object-oriented method of hierarchical urban building extraction from high-resolution remotesensing imagery[J].Acta Geodaetica et Cartographica Sinica,2010,39(1):39-45.)
- [6]林祥国,张继贤.面向对象的形态学建筑物指数及其高分辨率遥感影像建筑物提取应用[J].测绘学报,2017,46(6):724-733.(LIN Xiangguo,ZHANG Jixian.Object-based morphological building index for building extraction from high resolution remote sensing imagery[J].Acta Geodaetica et Cartographica Sinica,2017,46(6):724-733.)
- [7]崔卫红,熊宝玉,张丽瑶.多尺度全卷积神经网络建筑物提取[J].测绘学报,2019,48(5):597-608.(CUIWeihong,XIONG Baoyu,ZHANG Liyao.Multi-scale fully convolutional neural network for building extraction[J].Acta Geodaetica et Cartographica Sinica,2019,48(5):597-608.)
- [8]SHELHAMER E,LONG J,DARRELL T.Fully convolutional networks for semantic segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(4):640-651.
- [9]RONNEBERGER O,FISCHER P,BROX T.U-Net:convolutional networks for biomedical image segmentation[EB/OL].(2015-05-18)[2021-07-01].https:∥arxiv.org/pdf/1505.04597.pdf.
- [10]张玉鑫,颜青松,邓非.高分辨率遥感影像建筑物提取多路径RSU网络法[J].测绘学报,2022,55(1):135-144.(ZHANG Yuxin,YAN Qingsong,DENG Fei.Multi-path RSU network method for high-resolution remote sensing image building extraction[J].Acta Geodaetica et Cartographica Sinica,2022,55 (1):135-144.)
- [11]徐佳伟,刘伟,单浩宇,等.基于PRCUnet的高分遥感影像建筑物提取[J].地球信息科学学报,2021,23(10):1838-1849.(XU Jiawei,LIU Wei,SHANHaoyu,et al.High-resolution remote sensing image building extraction based on PRCUnet[J].Journal of Geo-Information Science,2021,23(10):1838-1849.)
- [12]肖朝霞,陈胜.图像语义分割问题研究综述[J].软件导刊,2018,17(8):6-8.(XIAO Zhaoxia,CHEN Sheng.Review of image semantic segmentation[J].Software Guide,2018,17(8):6-8.)
- [13]杨诺尔.遥感影像分类方法的研究[J].科技创新导报,2014,11(18):29-30.(YANG Nuoer.Research on methods of remote sensing image classification[J].Science and Technology Innovation Herald,2014,11(18):29-30.)
- [14]WOO S,PARK J,LEE J Y,et al.CBAM:convolutional block attention module[C]∥Proceedings of the European conference on computer vision (ECCV).[S.l.]:[s.n.],2018:3-19.
- [15]周幸,陈立福.基于双注意力机制的遥感图像目标检测[J].计算机与现代化,2020(8):1-7.(ZHOU Xing,CHEN Lifu.Object detection of remote sensing image based on dual attention mechanism[J].Computer and Modernization,2020(8):1-7.)
- [16]HE K M,ZHANG X Y,REN S Q,et al.Deep residual learning for image recognition[EB/OL].(2015-12-10)[2021-07-01].https:∥arxiv.org/pdf/1512.03385.pdf.
- [17]何超琦,魏静波,汤文超.基于金字塔瓶颈残差网络优化算法的多光谱影像分类[J].测绘地理信息,2021,46(S1):221-226.(HE Chaoqi,WEI Jingbo,TANGWenchao.Multi-spectral image classification based on pyramid bottleneck residual network optimization algorithm[J].Journal of Geomatics,2021,46(S1):221-226.)
- [18]JI S P,WEI S Q,LU M.Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set[J].IEEE Transactions on Geoscience and Remote Sensing,2019,57(1):574-586.
- [19]张津,魏峰远,冯凡,等.基于注意力机制和编码解码网络的遥感影像分类[J].测绘科学技术学报,2020,37(6):610-615.(ZHANG Jin,WEI Fengyuan,FENGFan,et al.Remote sensing imagery classification based on attention mechanism and encoder-decoder network[J].Journal of Geomatics Science and Technology,2020,37(6):610-615.)