基于FE-P2Pnet的无人机小麦图像麦穗计数方法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

安徽省自然科学基金项目(2208085MC60)、安徽省科学技术厅高校科研计划项目(2023AH050084)和国家自然科学基金项目(62273001、32372632)


Method for Counting Wheat Ears in UAV Images Based on FE-P2Pnet
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对无人机图像背景复杂、小麦密集、麦穗目标较小以及麦穗尺寸不一等问题,提出了一种基于FE-P2Pnet(Feature enhance-point to point)的无人机小麦图像麦穗自动计数方法。对无人机图像进行亮度和对比度增强,增大麦穗目标与背景之间的差异度,减少叶、秆等复杂背景因素的影响。引入了基于点标注的网络P2Pnet作为基线网络,以解决麦穗密集的问题。同时,针对麦穗目标小引起的特征信息较少的问题,在P2Pnet的主干网络VGG16中添加了Triplet模块,将C(通道)、H(高度)和W(宽度)3个维度的信息交互,使得主干网络可以提取更多与目标相关的特征信息;针对麦穗尺寸不一的问题,在FPN(Feature pyramid networks)上增加了FEM(Feature enhancement module)和SE(Squeeze excitation)模块,使得该模块能够更好地处理特征信息和融合多尺度信息;为了更好地对目标进行分类,使用Focal Loss损失函数代替交叉熵损失函数,该损失函数可以对背景和目标的特征信息进行不同的权重加权,进一步突出特征。实验结果表明,在本文所构建的无人机小麦图像数据集(Wheat-ZWF)上,麦穗计数的平均绝对误差(MAE)、均方误差(MSE)和平均精确度(ACC)分别达到3.77、5.13和90.87%,相较于其他目标计数回归方法如MCNN(Multi-column convolutional neural network)、CSRnet(Congested scene recognition network)和WHCNETs (Wheat head counting networks)等,表现最佳。与基线网络P2Pnet相比,MAE和MSE分别降低23.2%和16.6%,ACC提高2.67个百分点。为了进一步验证本文算法的有效性,对采集的其它4种不同品种的小麦(AK1009、AK1401、AK1706和YKM222)进行了实验,实验结果显示,麦穗计数MAE和MSE平均为5.10和6.17,ACC也达到89.69%,表明本文提出的模型具有较好的泛化性能。

    Abstract:

    Ear count is the committed step of wheat yield estimation. With the rapid development of unmanned aerial vehicle (UAV) and computer vision technology, the problem of automatic counting of wheat ears can be solved more quickly and efficiently. An automatic counting method for UAV wheat ear images was proposed based on feature enhance-point to point (FE-P2Pnet) to address issues such as complex background, dense wheat, small wheat ear targets, and varying wheat ear sizes. Firstly, the brightness and contrast of the UAV image were enhanced to increase the difference between the wheat ear target and the background, and the influence of complex background factors such as leaves and stems were reduced. Secondly, a point annotated network P2Pnet was introduced as the baseline network to address the problem of dense wheat ears. At the same time, in response to the problem of limited feature information caused by small wheat ear targets, a Triplet module was added to the backbone network VGG16 of P2Pnet, which interacted with the information of C (channel), H (height), and W (width) dimensions, allowing the backbone network to extract more feature information related to the target. In response to the issue of varying wheat ear sizes, feature enhancement module (FEM) and squeeze excitation (SE) modules were added to feature pyramid networks (FPN), enabling this module to better process feature information and fuse multi-scale information. In order to better classify targets, Focal Loss function instead of cross entropy loss function was used. This loss function can carry out different weights on the background and target feature information to further highlight features. The experimental results showed that the mean absolute error (MAE), mean square error (MSE), and accuracy (ACC) indicators of wheat ear counting on the constructed unmanned aerial vehicle wheat image dataset (Wheat-ZWF) achieved 3.77, 5.13, and 90.87%, respectively. Compared with other target counting regression methods such as MCNN, CSRnet, and WHCNETs, the performance was the best. Compared with the baseline network P2Pnet, the MAE and MSE values were decreased by 23.2% and 16.6% respectively, and the ACC value was increased by 2.67 percentage points. In order to further validate the effectiveness of the algorithm proposed, experiments were conducted on four other different wheat varieties (AK1009, AK1401, AK1706, and YKM222) collected. The experimental results showed that the average MAE and MSE values of wheat ear counting were 5.10 and 6.17, with ACC of 89.69%. This indicated that the proposed model had good generalization performance. The research can provide certain support and assistance for related studies on wheat ear counting.

    参考文献
    相似文献
    引证文献
引用本文

鲍文霞,苏彪彪,胡根生,黄承沛,梁栋.基于FE-P2Pnet的无人机小麦图像麦穗计数方法[J].农业机械学报,2024,55(4):155-164,289. BAO Wenxia, SU Biaobiao, HU Gensheng, HUANG Chengpei, LIANG Dong. Method for Counting Wheat Ears in UAV Images Based on FE-P2Pnet[J]. Transactions of the Chinese Society for Agricultural Machinery,2024,55(4):155-164,289.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2023-08-16
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2024-04-10
  • 出版日期: