基于双金字塔网络的RGB-D群猪图像分割方法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家重点研发计划项目(2016YFD0500506)、中央高校自主创新基金项目(2662018JC003、2662018JC010、2662017JC028)和现代农业产业技术体系项目(CARS-35)


RGB-D Segmentation Method for Group Piglets Images Based on Double-pyramid Network
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为实现群养猪的视觉追踪和行为监测,针对猪舍中仔猪因拥挤堆叠等习性而导致的目标个体粘连、图像分割困难问题,提出基于双金字塔网络的RGB-D群猪图像分割方法。该方法基于实例分割Mask R-CNN框架,在特征提取网络(ResNet101)基础上改进成双金字塔特征提取网络。RGB图像和Depth图像分别提取特征后进行融合,输入区域生成网络得到预选锚(ROI)和共享特征输入Head网络,通过类别、回归和掩模3个分支,输出检测目标的位置和分类结果,实现猪舍场景下群养仔猪粘连区域的有效个体分割。网络模型训练采用2000组图像样本,按照4∶1比例随机划分训练集和验证集。试验结果表明,双金字塔网络(Feature pyramid networks,FPN)能有效解决颜色相近、个体相似的群猪粘连问题,实现单个仔猪区域的完整分割,分割准确率达89.25%,训练GPU占有率为77.57%,与Mask R-CNN和PigNet网络分割结果相比,分割准确率和分割速度均有较大提高。双金字塔网络模型对于多种行为状态、不同粘连程度的群猪图像中个体分割都取得了良好效果,模型泛化性和鲁棒性较好,为群养猪的个体自动追踪提供了新的途径。

    Abstract:

    Aiming to achieve automatic individual pig’s tracking and monitoring in pig group, an RGB-D image segmentation method based on the double-pyramid network was proposed to solve the segmentation difficulties caused by overlaps and adhesion body areas which were frequently exiting in images because of habits of huddle and crowd in piglets. The method was based on an instance segmentation network Mask R-CNN, modifying its feature extraction network, ResNet101, to a double-pyramid structure. Features were extracted from RGB and Depth images and combined to be inputted into a regional generation network. The network outputted regions of interest (ROI). The combined features and ROIs were then inputted into a head network, which included the classifications and regression and mask branches and outputted the locations of pigs and results of classification. Eventually, the individual pigs were segmented from images according to the outputs. The double-pyramid network was trained using 2000 groups of images, splitting to a training set and a validation set in a ratio of 4∶1 randomly. Experimental results showed that the double-pyramid network (Feature pyramid networks, FPN) can effectively address the segmentation for group pig images of adhesive pigs, and acquire the complete individual pig areas, the segmentation accuracy rate was up to 8925%. During the training process, the GPU used rate was lower to 7757%, the FPN outperformed the Mask R-CNN and PigNet networks both in the segmentation accuracy rate and running speed. The double-pyramid network represented its generalization and robustness on the segmentation for multi-behaviors and diversified adhesions in pig group images, which provided a new approach to automatically track individual pig in group pigs.

    参考文献
    相似文献
    引证文献
引用本文

高云,廖慧敏,黎煊,雷明刚,余梅,李小平.基于双金字塔网络的RGB-D群猪图像分割方法[J].农业机械学报,2020,51(7):36-43. GAO Yun, LIAO Huimin, LI Xuan, LEI Minggang, YU Mei, LI Xiaoping. RGB-D Segmentation Method for Group Piglets Images Based on Double-pyramid Network[J]. Transactions of the Chinese Society for Agricultural Machinery,2020,51(7):36-43.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2019-10-28
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2020-07-10
  • 出版日期: