基于自监督对比学习的寒旱区遥感图像河流识别方法

doi:10.6041/j.issn.1000-1298.2023.06.013

首页 > 过刊浏览>2023年第54卷第6期 >125-135. DOI:10.6041/j.issn.1000-1298.2023.06.013

基于自监督对比学习的寒旱区遥感图像河流识别方法
DOI:
                        10.6041/j.issn.1000-1298.2023.06.013
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金项目（61861025、62241106）、2021年陇原青年创新创业人才（团队）项目、甘肃省高等学校青年博士基金项目（2021QB-49）、〖JP〗甘肃省高校大学生就业创业能力提升工程项目（2021-35）、智能化隧道监理机器人研究项目（中铁科研院（科研）字2020-KJ016-Z016-A2）和四电BIM工程与智能应用铁路行业重点实验室开放项目（BIMKF-2021-04）

River Extraction Method from Remote Sensing Images of Cold and Arid Regions Based on Self-supervised Comparative Learning

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对遥感图像河流数据样本人工标注成本高且难以大量获取，以及网络在河流图像边缘细节提取的效果不佳问题，提出一种通过自监督对比学习方式利用大量无标签遥感河流图像数据进行编码器预训练，并使用少量标签数据对预训练后的编码器进行微调，同时在编解码结构中使用一种新的非均匀采样方式的语义分割网络。自监督对比学习可以利用大量无标签数据进行前置任务模型训练，仅需少量标签数据对下游河流提取任务模型微调即可；非均匀采样方式能够通过对高频区域密集采样、对低频区域稀疏采样的方式获得图像中不同类别之间清晰的边界信息和同类别区域中的细节信息，减少模型的冗余度。在河流数据集上的实验表明，利用360幅有标签数据对预训练后的网络进行微调，其像素准确率、交并比、召回率分别达到90.4%、68.6%和83.2%，与使用1200幅有标签数据训练的有监督AFR-LinkNet网络性能相当；在使用全部数据标签进行微调后，网络的像素准确率、交并比、召回率分别达到93.7%、73.2%和88.5%，相比AFR-LinkNet、DeepLabv3+、LinkNet、ResNet50和UNet网络，像素准确率分别提高3.1、7.6、12.3、14.9、19.8个百分点，交并比分别提高3.5、8.7、10.5、16.9、24.0个百分点，召回率分别提高2.1、4.8、6.7、9.4、12.9个百分点，验证了模型在河流图像上精准提取河流的有效性。该算法模型对于解决缺少大量有标签数据和分析我国寒旱区河流分布、水灾害预警、水资源合理利用以及农业灌溉发展等具有重要意义。

Abstract:

Aiming at the problems of high cost of manual labeling of river data samples in remote sensing images and difficulty in obtaining a large number of them, as well as poor effect of network extraction of river image edge details, a self-supervised comparative learning method was proposed to use a large number of unlabeled remote sensing river image data for encoder pre-training, and a small amount of label data was used to fine-tune the encoder after pre-training. Meanwhile, a semantic segmentation network based on non-uniform sampling was used in the codec structure. Self-supervised comparative learning can use a large number of unlabeled data for pre-task model training, and only a small amount of label data was needed to fine-tune the downstream river extraction task model. The non-uniform sampling method can obtain clear boundary information between different categories in the image and details in the same category by intensive sampling in the high frequency region and sparse sampling in the low frequency region, thus reducing the redundancy of the model. Experiments on river data sets showed that the pixel accuracy, intersection over union and recall rate of the pre-trained network can reach 90.4%, 68.6% and 83.2%, respectively, when 360 labeled datas was used to fine-tune the network, which was comparable to the performance of the supervised AFR-LinkNet network trained with 1200 labeled datas. After fine-tuning with all data labels, the pixel accuracy, intersection ratio and recall rate of the network reached 93.7%, 73.2% and 88.5%, respectively. Compared with AFR-LinkNet, DeepLabv3+, LinkNet, ResNet50 and UNet networks, the pixel accuracy was increased by 3.1, 7.6, 12.3, 14.9, 19.8 percentage points, the intersection over union was increased by 3.5, 8.7, 10.5, 16.9, 24.0 percentage points, and the recall rate was increased by 2.1, 4.8, 6.7, 9.4, 12.9 percentage points, respectively. The effectiveness of the model to accurately extract rivers from river images was verified. This algorithm model was of great significance for solving the lack of a large number of labeled data and analyzing the distribution of rivers in cold and arid regions, water disaster warning, rational utilization of water resources and agricultural irrigation development.

参考文献

相似文献

引证文献

引用本文

沈瑜,王海龙,梁栋,牛东兴,严源,李阳阳.基于自监督对比学习的寒旱区遥感图像河流识别方法[J].农业机械学报,2023,54(6):125-135. SHEN Yu, WANG Hailong, LIANG Dong, NIU Dongxing, YAN Yuan, LI Yangyang. River Extraction Method from Remote Sensing Images of Cold and Arid Regions Based on Self-supervised Comparative Learning[J]. Transactions of the Chinese Society for Agricultural Machinery,2023,54(6):125-135.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2022-10-05
最后修改日期:
录用日期:
在线发布日期: 2022-11-20
出版日期:

期刊浏览

EI收录结果

引用本文

分享

文章指标

历史