基于改进SSD卷积神经网络的苹果定位与分级方法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

河北省重点研发计划项目(21321902D)


Apple Location and Classification Based on Improved SSD Convolutional Neural Network
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为实现苹果果径与果形快速准确自动化分级,提出了基于改进型SSD卷积神经网络的苹果定位与分级算法。深度图像与两通道图像融合提高苹果分级效率,即对从顶部获取的苹果RGB图像进行通道分离,并提取分离通道中影响苹果识别精度最大的两个通道与基于ZED双目立体相机从苹果顶部获取的苹果部分深度图像进行融合,在融合图像中计算苹果的纵径相关信息,实现了基于顶部融合图像的多个苹果果形分级和信息输出;使用深度可分离卷积模块替换原SSD网络主干特征提取网络中部分标准卷积,实现了网络的轻量化。经过训练的算法在验证集下的识别召回率、精确率、mAP和F1值分别为93.68%、94.89%、98.37%和94.25%。通过对比分析了4种输入层识别精确率的差异,实验结果表明输入层的图像通道组合为DGB时对苹果的识别与分级mAP最高。在使用相同输入层的情况下,比较原SSD、Faster R-CNN与YOLO v5算法在不同果实数目下对苹果的实际识别定位与分级效果,并以mAP为评估值,实验结果表明改进型SSD在密集苹果的mAP与原SSD相当,比Faster R-CNN高1.33个百分点,比YOLO v5高14.23个百分点。并且在不同硬件条件下验证了该算法定位分级效率的优势,单幅图像在GPU下的检测时间为5.71ms,在CPU下的检测时间为15.96ms,检测视频的帧率达到175.17f/s和62.64f/s。该研究可为自动化分级设备在高速环境下精准定位并分级苹果提供理论基础。

    Abstract:

    An apple localization and grading algorithm was proposed based on an improved SSD convolutional neural network to achieve fast and accurate automatic grading of apple fruit diameter and shape. The efficiency of apple grading was improved by improving the input layer of the original SSD network. Channel separation was performed on the color apple image obtained from the top, and the two channels in the separation channel that had the most significant impact on the apple recognition accuracy were extracted. A fused image was composed of the two channels and the apple depth image from the top based on the binocular camera. The longitudinal diameter-related information of the apple was calculated in the fused image. Moreover, multiple apple shape grading and information output based on the fused image were realized through this method. The depthwise-separable convolution module was used to replace part of the standard convolution in the original SSD network backbone feature extraction network, which achieved the light weighting of the network. The recognition recall, accuracy, mAP and F1 values of the trained model under the verification set were 93.68%, 94.89%, 98.37% and 94.25%, respectively. By comparing and analyzing the differences in recognition accuracy among the four input layers, the experimental results showed that the highest recognition and grading mAP for apples was achieved when the image channel combination of the input layer was DGB. The actual recognition localization and grading effects of the original SSD, Faster R-CNN and YOLO v5 algorithms for apples with different numbers of fruits were compared by using the same input layer and evaluated in terms of mAP. The experimental results showed that the improved SSD had a comparable mAP to the original SSD for dense apples, which was higher than that of Faster R-CNN by 1.33 percentage points and higher than YOLO v5 by 14.23 percentage points. The advantages of the algorithm localization and grading efficiency were verified under different hardware conditions. The detection time of an image was 5.71ms under GPU and 15.96ms under CPU, and the actual frame rate of the detected video reached 175.17f/s and 62.64f/s. The research result can provide a theoretical basis for automated grading equipment to accurately locate and grade apples in a high-speed environment.

    参考文献
    相似文献
    引证文献
引用本文

张立杰,周舒骅,李娜,张延强,陈广毅,高笑.基于改进SSD卷积神经网络的苹果定位与分级方法[J].农业机械学报,2023,54(6):223-232. ZHANG Lijie, ZHOU Shuhua, LI Na, ZHANG Yanqiang, CHEN Guangyi, GAO Xiao. Apple Location and Classification Based on Improved SSD Convolutional Neural Network[J]. Transactions of the Chinese Society for Agricultural Machinery,2023,54(6):223-232.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2022-10-28
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2023-02-14
  • 出版日期: