基于多头自注意力机制的茶叶采摘点语义分割算法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家重点研发计划项目(2021YFD1601102)


Semantic Segmentation Algorithm Based Multi-headed Self-attention for Tea Picking Points
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    茶叶采摘点定位是茶叶选择性采摘的关键技术之一,在茶树采摘场景中,存在采摘点尺度小、背景干扰大、光照情况复杂等问题,导致准确分割茶叶采摘点成为难题。本研究针对茶园场景下采摘点精确分割问题,构建了一种基于多头自注意力机制结合多尺度特征融合的语义分割算法——RMHSA-NeXt。首先使用ConvNeXt卷积神经网络提取图像特征;其次构造基于残差和多头自注意力机制的注意力模块,将模型注意力集中于分割目标,增强重要特征的表达;再次通过多尺度结构(Atrous spatial pyramid pooling, ASPP)将不同尺度的特征进行融合,在其中针对采摘点特性,在融合过程中使用条状池化(Strip pooling),减少无用特征的获取;最后通过卷积以及上采样等操作完成信息的解码,得出分割结果。试验表明,茶园环境下该模型可以对采摘点进行有效分割,模型的像素准确率达75.20%,平均区域重合度为70.78%,运行速度达到8.97f/s。基于相同测试集将本文模型与HRNet V2、EfficientUNet++、DeeplabV3+、BiSeNet V2模型进行对比,结果表明相比于其他模型同时具有准确性高、推理速度快、参数量小等优点,能够较好地平衡精度与速度指标。本文的研究成果可以为精准定位茶叶采摘点提供有效可靠的参考。

    Abstract:

    Tea picking point localization is one of the key technologies for selective tea picking. In the tea tree picking scenario, there are problems such as small scale of picking points, large background interference and complex lighting conditions, which lead to the problem of accurate segmentation of tea picking points. A semantic segmentation model based on multi-headed self-attentive mechanism combined with multi-scale feature fusion, RMHSA-NeXt, was constructed for the accurate segmentation of picking points in tea garden scenes. The attention module based on residuals and multi-headed self-attention mechanism was constructed to focus the model’s attention on the segmentation target and enhance the representation of important features. The features at different scales were fused by multi-scale structure (atrous spatial pyramid pooling, ASPP), in which strip pooling was used in the fusion process for the characteristics of picking points to reduce the useless. Finally, the information was decoded by convolution and upsampling, and the segmentation results were obtained. The experiment results showed that the model can segment the picking points effectively in the tea garden environment, and the pixel accuracy of the model reached 75.20%, the average region overlap was 70.78%, and the running speed reached 8.97f/s. The results showed that the model had the advantages of high accuracy, fast inference speed and small number of parameters, which can balance the accuracy and speed indexes well compared with other models. The research results can provide an effective and reliable reference for pinpointing tea picking points.

    参考文献
    相似文献
    引证文献
引用本文

宋彦,杨帅,郑子秋,宁井铭.基于多头自注意力机制的茶叶采摘点语义分割算法[J].农业机械学报,2023,54(9):297-305. SONG Yan, YANG Shuai, ZHENG Ziqiu, NING Jingming. Semantic Segmentation Algorithm Based Multi-headed Self-attention for Tea Picking Points[J]. Transactions of the Chinese Society for Agricultural Machinery,2023,54(9):297-305.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2023-02-24
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2023-09-10
  • 出版日期: