基于深度学习的群猪图像实例分割方法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家重点研发计划项目(2016YFD0500506)和中央高校自主创新基金项目(2662018JC003、2662018JC010、2662017JC028)


Instance-level Segmentation Method for Group Pig Images Based on Deep Learning
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    群养饲喂模式下猪群有聚集在一起的习性,特别是躺卧时,当使用机器视觉跟踪监测猪只时,图像中存在猪体粘连,导致分割困难,成为实现群猪视觉追踪和监测的瓶颈。根据实例分割原理,把猪群中的猪只看作一个实例,在深度卷积神经网络基础上建立PigNet网络,对群猪图像尤其是对粘连猪体进行实例分割,实现独立猪体的分辨和定位。PigNet网络采用44层卷积层作为主干网络,经区域候选网络(Region proposal networks,RPN)提取感兴趣区域(ROI),并和主干网络前向传播的特征图共享给感兴趣区域对齐层(Region of interest align,ROIAlign),分支通过双线性插值计算目标空间,三分支并行输出ROI目标的类别、回归框和掩模。Mask分支采用平均二值交叉熵损失函数计算独立猪体的目标掩模损失。连续28d采集6头9.6kg左右大白仔猪图像,抽取前7d内各不同时段、不同行为模式群养猪图像2500幅作为训练集和验证集,训练集和验证集的比例为4∶1。结果表明,PigNet网络模型在训练集上总分割准确率达86.15%,在验证集上准确率达85.40%。本文算法对不同形态、粘连严重的群猪图像能够准确分割出独立的猪个体目标。将本文算法与Mask R-CNN模型及其改进模型进行对比,准确率比Mask R-CNN模型高11.40个百分点,单幅图像处理时间为2.12s,比Mask R-CNN模型短30ms。

    Abstract:

    With the development of intelligence and automation technology, people pay more attention to use it to monitor pig welfare and health in modern pig industry. Since the behaviors of group pigs present their healthy status, it is necessary to detect and monitor behaviors of group pigs. At present, machine vision technology with advantages of low price, easy installation, noninvasion and mature algorithm has been preferentially utilized to monitor pigs’ behaviors, such as drinking, eating, farrowing behavior of sow, and detect some of pigs’ physiological indices, such as lean yield rate. Feeding pigs at group level was used the most in intensive pig farms. Owing to normally happened huddled pigs showing in grouppig images, it was challenging to utilize traditional machine vision technique to monitor the behaviors of group pigs through separating adhesive pig areas. Thus a new segmentation method was introduced based on deep convolution neural network to separate adhesive pig areas in grouppig images. A PigNet network was built to solute the problem of separating adhesive pig areas in grouppig images. Main part of the PigNet network was established on the structure of the Mask R-CNN network. The Mask R-CNN network was a deep convolution neural network, which had a backbone network with a branch of FCN from classification layer and regression layer to mask the region of interest. The PigNet network used 44 convolutional layers of backbone network of Mask R-CNN network as its main network. After the main network, the output feature image was fed to the next four convolutional layers with different convolution kernels, which was the resting part of the main network and produced binary mask for each pig area. As well, the output feature image was fed into two branches, one was the region proposal networks (RPN), the other was region of interest align (ROIAlign) processing. The first branch outputted the regions of interest, and then the second one aligned each pig area and produced class of the whole pig area and the background area and bounding boxes of each pig regions. A binary cross entropy loss function was utilized to calculate the loss of each mask to correct the class layer and the location of ROIs. Here, the ROIAlign was used to align the candidate region and convolution characteristics through the bilinear difference, and which would not lose information by quantization, making the segmentation more accurate, and FCN of the mask branch used average binary cross entropy as the loss function to process each mask, which avoided the competition among pig masks. After all, the ROI was labeled with different colors. Totally 2000 images captured from previous five days of a 28day experiments were taken as the training set, and 500 images from the next 6th to 7th day were validation set. The results showed that the accuracy of the PigNet on training set was 86.15% and on validation set was 85.40%. The accuracies on each data set were very close, which showed that the model had effective generalization performance and high precision. The cooperation between the PigNet, Mask R-CNN (ResNet101-FPN) and its improvement showed the PigNet surpassed the other two algorithms in accuracy. Meanwhile, the PigNet run faster than the Mask R-CNN. However, the times of three algorithms spent on 500 samples of the validation set were similar. The algorithm can be used to separate individual pig from grouppig images with different behaviors and severe adhesion situation. The PigNet network model adopted the GPU operation mode, and used the three branches of class, regression box and mask to compute parallel processing time, which made the processing time of single image quick, only 2.12s. To a certain degree, the PigNet could reduce convolution parameters and simplify the network structure. The research provided a new segmentation method for adhesive grouppig images, which would increase the possibility of grouppig tracing and monitoring.

    参考文献
    相似文献
    引证文献
引用本文

高云,郭继亮,黎煊,雷明刚,卢军,童宇.基于深度学习的群猪图像实例分割方法[J].农业机械学报,2019,50(4):179-187.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2018-10-17
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2019-04-10
  • 出版日期: