基于改进DeepLabV3+的人采机运协作机器人冠下导航路径识别方法
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

中国烟草总公司重点研发计划项目(110202301017)和国家自然科学基金项目(32372592)


Canopy-under Navigation Path Recognition Method for Human-machine Collaborative Harvesting Robots Based on Improved DeepLabV3+
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    冠下导航是人采机运协作机器人自主转运的关键技术。针对冠下导航路径识别算法在作物叶片遮挡、杂草干扰和植株空间异质性等复杂环境下适应性差、准确性与实时性难以兼顾的问题,本研究提出一种基于垄间图像语义分割模型EME-Net的冠下导航路径识别方法。在DeepLabV3+结构基础上进行轻量化设计,采用融合高效通道注意力(Efficient channel attention,ECA)的MobileNetV2(命名EMNet)代替主干网络Xception,在显著提高检测速度和实时性的同时提升模型在干扰下捕捉垄间路径关键特征的能力。同时,在模型中引入金字塔分割注意力机制(Pyramid split attention,PSA)增强多尺度特征融合能力,提升被遮挡路径边界表征精度。在EMNet输出端和末端嵌入ECA机制,过滤垄作农田图像中不相关的特征,提升特征利用率。针对前景与背景比例不均衡导致精度下降问题,设计了基于BCELoss和DiceLoss的鲁棒损失函数BCE_DiceLoss,有效改善路径分割结果稳定性。基于EME-Net模型输出的可行驶区域掩码,利用最小二乘法重塑分割区域边缘点并提取垄间导航线。模型评估结果显示,EME-Net平均像素准确率和平均交并比分别为91.3%和88.9%,较基线模型DeepLabV3+分别提升7.9、6.1个百分点,检测速度为29.5 f/s,整体优于PSPNet、U-Net、HRNet、Segformer等主流分割模型。烟叶采收作业场景试验结果表明,模型在冠下不同复杂环境下均能有效实现导航线检测,航偏角均值为1.45°~3.80°、平均像素横向距离为1.46~3.68像素。提出的冠下导航路径识别方法能满足垄间导航任务的实际需求,可为人采机运协作机器人自主转运作业提供技术支撑。

    Abstract:

    Canopy-under navigation path recognition in tobacco fields is often hindered by leaf and weed occlusion, as well as significant variations in plant morphology, presenting challenges for the autonomous operation of human-machine collaborative harvesting robots. To address these issues, a novel canopy-under navigation path recognition method was proposed based on the EME-Net, an inter-row image semantic segmentation model. Built upon the DeepLabV3+ architecture, EME-Net featured an encoder that employed the ECA-MobileNetV2 (dubbed EMNet) to replace the original Xception backbone for efficient feature extraction, enabling the model to effectively capture key inter-row path features. The pyramid split attention (PSA) multi-scale feature fusion mechanism was introduced to enhance the representation of inter-row boundary features, particularly under occlusion. Additionally, the ECA mechanism, embedded at both the output and terminal of EMNet, filtered out irrelevant features from tobacco field images, improving feature utilization without reducing the number of channels. To mitigate accuracy degradation caused by foreground-background imbalance, a robust BCE_DiceLoss function was proposed, combining binary cross entropy (BCE) Loss and DiceLoss. Based on the autonomous traversal region masks outputed by EME-Net, the least squares method was used to reshape edge points and extract inter-row navigation lines. Experimental results showed that EME-Net achieved a mean pixel accuracy (mPA) of 91.3% and a mean intersection over union (mIoU) of 88.9%, surpassing the baseline model DeepLabV3+ by 7.9 and 6.1 percentage points, respectively. The average detection frame rate reached 29.5 frames per second, outperforming mainstream segmentation models such as PSPNet, U-Net, HRNet, and Segformer. In practical tobacco field navigation path recognition experiments, the proposed method effectively extracted navigation lines in canopy-under areas with varying levels of occlusion. The mean heading deviation ranged from 1.45° to 3.80°, and the average lateral pixel distance varied from 1.46 pixels to 3.68 pixels. This method met the practical requirements of inter-row navigation tasks and provides a reliable technical solution for the autonomous transportation operation of human-machine collaborative harvesting robots.

    参考文献
    相似文献
    引证文献
引用本文

苏锐,高磊,朱振涛,刘骋,陈度.基于改进DeepLabV3+的人采机运协作机器人冠下导航路径识别方法[J].农业机械学报,2026,57(6):1-12. SU Rui, GAO Lei, ZHU Zhentao, LIU Cheng, CHEN Du. Canopy-under Navigation Path Recognition Method for Human-machine Collaborative Harvesting Robots Based on Improved DeepLabV3+[J]. Transactions of the Chinese Society for Agricultural Machinery,2026,57(6):1-12.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-27
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-04-15
  • 出版日期:
文章二维码