基于全局-局部注意力机制的甘蔗病害分类算法

doi:10.6041/j.issn.1000-1298.2026.06.028

首页 > 过刊浏览>2026年第57卷第6期 >300-310. DOI:10.6041/j.issn.1000-1298.2026.06.028

基于全局-局部注意力机制的甘蔗病害分类算法
DOI:
                        10.6041/j.issn.1000-1298.2026.06.028
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:广西民族大学引进人才科研启动（2024KJQD218）、广西科技重大专项（桂科AA24263038)、国家自然科学基金项目（62361002）和广西农业科学院基本科研业务专项（桂农科2025YP065）

Sugarcane Disease Classification Algorithm Based on Global-local Attention Mechanism

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对复杂自然场景下，甘蔗病斑受光照不均等干扰，识别难度大，检测效率低等问题，本文提出了一种基于全局-局部注意力机制的甘蔗病虫害分类算法（Sugarcane disease classification algorithm using global-local attention mechanism,SDCA-GLAM）。为扩充模型容量，将改进的Vision Transformer (ViT)模型线性投影层替换为可变形的卷积模块，自适应地提取病斑纹理与叶片边缘信息;引入可重参数化的卷积结构以增强空间位置信息表达能力，在多层感知机环节融合深度卷积模块，用于挖掘高维空间特征;为减少模型参数量并提升检测准确率，设计全局-局部自注意力并行学习支路，局部支路采用窗口注意力细化高频纹理特征，全局支路引入池化策略压缩向量K/V的空间维度，并通过超参数α聚合关键区域信息;将层归一化操作替换为批归一化，以降低频繁reshape带来的内存开销和时间损耗。实验结果表明，SDCA-GLAM在包含11个类别的甘蔗叶片数据集上准确率达到88.26%，吞吐量达1 620幅/s，模型参数量为2.758×10^7，显著优于对比的主流模型。本文算法在准确率与效率之间取得了良好平衡，可为甘蔗病害移动端快速识别提供有效技术支撑。

Abstract:

Sugarcane is a globally important crop for both sugar production and bioenergy, and it is widely cultivated in tropical and subtropical regions. Effective disease diagnosis is essential to ensuring agricultural productivity and economic returns. In response to challenges posed by complex field environments, such as uneven lighting, low recognition accuracy, and limited detection efficiency, a novel algorithm: sugarcane disease classification algorithm using global-local attention mechanism (SDCA-GLAM) was proposesd. To enhance model capacity, the linear projection layers in a modified Vision Transformer (ViT) were replaced with deformable convolution modules, enabling adaptive extraction of lesion textures and leaf-edge information. Re-parameterized convolution was incorporated to strengthen spatial positional encoding, and deep convolutional modules were embedded in the multilayer perceptron to extract high-dimensional semantic features. To improve both accuracy and model efficiency, a parallel global-local self-attention architecture was designed. The local branch leveraged window attention to refine fine-grained textures, while the global branch reduced the spatial dimensions of key/value vectors via pooling and aggregating critical region information using a hyperparameter α. Finally, LayerNorm was replaced with BatchNorm to reduce the memory and time overhead caused by frequent reshaping. Experimental results on an 11-class sugarcane leaf dataset demonstrated that SDCA-GLAM achieved an accuracy of 88.26%, a throughput of 1,620 images per second, and a model size of 2.76×10^7. The proposed method outperformed mainstream models in both accuracy and efficiency, making it suitable for real-time mobile deployment in field diagnosis of sugarcane conditions.

参考文献

相似文献

引证文献

引用本文

邓健志,黄福兴,王泽平,罗丽平,井佩光,李云.基于全局-局部注意力机制的甘蔗病害分类算法[J].农业机械学报,2026,57(6):300-310. DENG Jianzhi, HUANG Fuxing, WANG Zeping, LUO Liping, JING Peiguang, LI Yun. Sugarcane Disease Classification Algorithm Based on Global-local Attention Mechanism[J]. Transactions of the Chinese Society for Agricultural Machinery,2026,57(6):300-310.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-06-08
最后修改日期:
录用日期:
在线发布日期: 2026-04-15
出版日期:

期刊浏览

EI收录结果

引用本文

分享

相关视频

文章指标

历史

文章二维码