基于改进YOLO 11模型的棉田地表残膜识别方法研究

doi:10.6041/j.issn.1000-1298.2025.05.002

首页 > 过刊浏览>2025年第56卷第5期 >17-25，48. DOI:10.6041/j.issn.1000-1298.2025.05.002

基于改进YOLO 11模型的棉田地表残膜识别方法研究
DOI:
                        10.6041/j.issn.1000-1298.2025.05.002
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家重点研发计划项目(2022YFD2002400)、兵团科技攻关计划项目(2023AB014、2022DB003)、新疆棉花产业技术体系项目(XJARS-03)和自治区科技支疆计划项目（2024E02016）

Recognition Method of Cotton Field Surface Residual Film Based on Improved YOLO 11

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

为实现残膜回收机在自然环境中快速、准确地识别棉田地表残膜目标，本文提出了一种基于DCA-YOLO 11轻量化模型的棉田地表残膜识别方法。以4JMLE-210型残膜回收机工作后棉田地表残膜为研究对象，在不同时间段采集地表残膜图像900幅，通过透视变换、图像裁剪、数据清洗、数据增强等预处理，最终得到5215幅残膜样本图像，按照4∶1划分为训练集和测试集，实现了对棉田地表残膜的数据集构建。通过在YOLO 11模型主干网络中增加深度可分离卷积（DWConv）模块代替通用卷积（Conv）模块，用于减少计算复杂度和参数量；通过在输出检测端末尾加入CBAM卷积块注意力机制模块来提高模型的感知能力，减少边缘与背景干扰；通过用ADown模块替换主干网络中的Conv模块，实现残膜特征图不同层之间的下采样，减少特征图空间维度，保留关键信息来提高残膜目标检测准确性。试验结果表明，在复杂自然环境下，DCA-YOLO 11模型精确率P为81.9%，召回率R为80.9%，平均精度均值mAP（重叠率0.5）为86.7%，参数量为2.20×106，处理速度为80f/s。通过对不同模型进行对比试验，DCA-YOLO 11模型检测精确率比YOLO v10、YOLO v9、YOLO v8分别高2.9、2.3、3.8个百分点，召回率比YOLO v10、YOLO v9、YOLO v8分别高2.0、1.0、1.8个百分点，处理速度比YOLO v9、YOLO v8分别提升12.7%、14.2%，略低于YOLO v10。DCA-YOLO 11模型在保证精度的同时，模型最小，参数量最少，证明其轻量化与优越性。模型通过泛化性试验，其在验证数据集上的检测结果，R2为0.72，平均绝对误差和均方根误差分别为4.92个和2.72个，提出的DCA-YOLO 11轻量化模型泛化性较好。该研究可为残膜回收机械在复杂环境下精准、高效捡拾残膜以及残膜回收机回收率车载视觉估测提供理论依据与数据基础。

Abstract:

In response to the issue of estimating the recovery rate of residual film in cotton fields by current residual film recovery machines, a lightweight residual film recognition method named DCA-YOLO 11 was proposed, which enabled rapid and accurate identification of residual film on cotton field surfaces in natural environments. Taking the residual film on cotton field surfaces after the operation of the 4JMLE-210 residual film recovery machine as the research object, totally 900 images of residual film were collected at different time periods. Through preprocessing steps such as perspective transformation, image cropping, data cleaning, and data augmentation, a dataset of 5215 residual film sample images was constructed, which was divided into training and test sets at a 4∶1 ratio. To enhance the model’s performance, a depthwise convolution (DWConv) module was added to the backbone network of YOLO 11 to replace a standard convolution (Conv) module, thereby reducing computational complexity and the number of parameters. Additionally, a CBAM attention mechanism module was incorporated at the end of the detection output to improve the model’s perception capability and reduce interference from edges and backgrounds. Furthermore, the ADown module was used to replace the Conv module in the backbone network, enabling downsampling between different layers of the residual film feature maps, reducing the spatial dimensions of the feature maps while retaining key information to improve the accuracy of residual film target detection. Experimental results demonstrated that the DCA-YOLO 11 model achieved a precision (P) of 81.9%, a recall (R) of 80.9%, and a mean average precision (mAP) of 86.7% (at an IoU threshold of 0.5) in complex natural environments. The model has about 2.20 million parameters, and an FPS of 80f/s. Comparative experiments with other models showed that DCA-YOLO 11 outperformed YOLO v10, YOLO v9 and YOLO v8 in precision by 2.9 percentage points, 2.3 percentage points, 3.8 percentage points. In terms of recall, it was improved by 2.0 percentage points, 1.0 percentage points, and 1.8 percentage points compared with that of YOLO v10, YOLO v9, and YOLO v8, respectively. While its processing speed was slightly lower than than that of YOLO v10, and it surpassed YOLO v9 and YOLO v8 by 12.7% and 14.2%. DCA-YOLO 11 achieved the smallest model size and the fewest parameters while maintaining high accuracy, demonstrating its lightweight design and superiority. Through generalization test, the model’s detection results on the validation dataset showed an R2 of 0.72, a mean absolute error (MAE) of 4.92 pcs and a root mean square error (RMSE) of 2.72 pcs, indicating good generalization. The research result can provide a theoretical foundation and data support for the precise and efficient collection of residual film by recovery machinery in complex environments, as well as for the visual estimation of the recovery rate of residual film recovery machines.

参考文献

相似文献

引证文献

引用本文

孟庆建,翟志强,张连朴,吕继东,王虎挺,张若宇.基于改进YOLO 11模型的棉田地表残膜识别方法研究[J].农业机械学报,2025,56(5):17-25，48. MENG Qingjian, ZHAI Zhiqiang, ZHANG Lianpu, Lü Jidong, WANG Huting, ZHANG Ruoyu. Recognition Method of Cotton Field Surface Residual Film Based on Improved YOLO 11[J]. Transactions of the Chinese Society for Agricultural Machinery,2025,56(5):17-25，48.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-03-10
最后修改日期:
录用日期:
在线发布日期: 2025-05-10
出版日期:

期刊浏览

EI收录结果

引用本文

分享

文章指标

历史

文章二维码