基于局部图像特征聚合的温室场景识别方法

doi:10.6041/j.issn.1000-1298.2025.02.045

首页 > 过刊浏览>2025年第56卷第2期 >485-494. DOI:10.6041/j.issn.1000-1298.2025.02.045

基于局部图像特征聚合的温室场景识别方法
DOI:
                        10.6041/j.issn.1000-1298.2025.02.045
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家重点研发计划项目(2021YFD1500204、2023YFD1501303)

Greenhouse Scene Recognition Method Based on Local Image Feature Aggregation

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

场景识别可作为温室环境空间定位的替代方案,也是智能农机装备视觉系统的重要功能之一。针对以特征聚类为基础的场景识别范式无法适应高动态变化且高度相似的温室场景识别的问题,提出一种基于深度特征聚合的温室场景识别方法,以预训练的视觉Transformer网络为基础,提取场景图像局部特征,应用多层感知机全局感受野特性,考虑局部特征空间关系,融合图像局部特征,生成场景图像全局描述子,以多重相似性损失最小化为优化目标,构建温室场景识别模型。试验结果表明,模型场景识别R@1(top-1召回率)、R@5和R@10分别达到78.43%、89.21%和92.47%,具有较高的场景识别精度。所提出的基于多层感知机的特征混合方法是有效的,与采用池化操作进行特征聚合相比,R@1提高8.01个百分点。模型对光照条件变化具有一定的鲁棒性,与正常的中等光照条件相比,强光及弱光条件下,R@1下降未超过4.00个百分点。相机视角及采样距离的变化也会影响模型识别性能,20°以内的视角变化,R@1下降6.61个百分点,2倍以内的距离变化,R@1下降17.87个百分点。与现有场景识别基准方法NetVLAD、GeM、Patch-NetVLAD、MultiRes-NetVLAD和MixVPR相比,R@1分别提高7.82、6.59、3.56、4.14、1.88个百分点,在温室场景识别任务上模型性能有较大提升。该研究构建的基于多层感知机的图像全局特征聚合方法,能够生成可靠的全局描述子,用于温室场景识别,且具有一定的光照、视角、距离及时间变化的鲁棒性,研究结果可为智能农机视觉系统设计提供技术参考。

Abstract:

Scene recognition could be used as an alternative for spatial positioning in greenhouse environments, and it was also one of the important functions of the visual system of intelligent agricultural machinery equipment. Addressing the issue that scene recognition paradigms based on feature clustering could not adapt to the recognition of greenhouse scenes with high dynamic changes and high similarity, a greenhouse scene recognition method based on deep feature aggregation was proposed. This method, grounded on a pre-trained visual transformer network, extracted local features from scene images. It applied the global receptive field characteristics of multi-layer perceptron, took into account the spatial relationships of local features, fused the local features of the images, and generated global descriptors for the scene images. With the goal of minimizing multi-similarity loss as the optimization objective, a greenhouse scene recognition model was constructed. The test results indicated that the R@ 1 ( top 1 recall rate), R @ 5, and R @ 10 of the model’s scene recognition reached 78.43% , 89.21% , and 92.47% , respectively, and it possessed high scene recognition accuracy. The proposed feature mixing method based on multi-layer perceptron was proven effective, with an improvement of 8.01 percentages in R@ 1 compared with that of feature aggregation using pooling operations. The model demonstrated a certain robustness to changes in lighting conditions, with the R@ 1 metric decreased by no more than 4.00 percentages under strong and weak lighting conditions compared with that under normal medium lighting conditions. Changes in camera angle and sampling distance also impacted the model’s recognition performance, with a decline of 6.61 percentages for angle changes within 20 degrees, and a drop of 17.87 percentages for distance changes within twice the original distance. Compared with the existing scene recognition benchmark methods, including NetVLAD, GeM, Patch-NetVLAD, MultiRes-NetVLAD, and MixVPR, the R@ 1 of proposed model was improved by 7.82, 6.59, 3.56, 4.14, and 1.88 percentages, respectively, demonstrating a significant performance enhancement on the greenhouse scene recognition task. The image global feature aggregation method based on multi-layer perceptron constructed was able to generate reliable global descriptors for greenhouse scene recognition, and exhibited robustness to changes in lighting, viewpoint, distance, and time. The research findings would provide technical references for the design of visual systems for intelligent agricultural machinery.

参考文献

相似文献

引证文献

引用本文

于美玲,周云成,侯玉涵,刘峻渟.基于局部图像特征聚合的温室场景识别方法[J].农业机械学报,2025,56(2):485-494. YU Meiling, ZHOU Yuncheng, HOU Yuhan, LIU Junting. Greenhouse Scene Recognition Method Based on Local Image Feature Aggregation[J]. Transactions of the Chinese Society for Agricultural Machinery,2025,56(2):485-494.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-09-18
最后修改日期:
录用日期:
在线发布日期: 2025-02-10
出版日期:

期刊浏览

EI收录结果

引用本文

相关视频

分享

文章指标

历史

文章二维码