基于语义分割的食品标签文本检测
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家重点研发计划项目(2018YFC1603305、2018YFC1603302)


Text Detection of Food Labels Based on Semantic Segmentation
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    食品包装上的标签文本含有生产日期、营养成分、生产厂家等食品相关信息,这些不仅为消费者购买食品提供了重要依据,也有助于食品监督抽检机构发现潜在的食品安全问题。食品标签文本检测是食品标签自动识别的前提,有助于降低人工录入成本、提高数据处理效率。基于食品包装图像构建数据集,提出了一种基于语义分割的距离场模型,以检测食品标签。该模型包含像素分类和距离场回归两类任务,其中像素分类任务分割处理图像中的文本区域,距离场回归任务预测文本区域内的像素点到该区域边界的归一化距离。为提升模型的检测性能,在回归预测模块中通过增加注意力模块优化模型结构,并针对距离场回归任务损失值过小、影响模型训练优化问题对其损失函数进行了改进。消融实验结果表明,增加注意力模块和损失函数的改进使得模型的准确率分别提高了4.39、3.80个百分点,有效提高了检测准确率。食品包装图像数据集的对比实验表明,采用本文模型检测食品标签文本具有较好的性能,其召回率、准确率分别达到87.61%、76.50%。

    Abstract:

    The label texts on food package include some information like production date, nutrition facts and production corporation etc. The information provides important foundation for consumers to buy food. It also can help the food supervision and inspection administrations to discover the potential problems of food safety. Food label detection is the groundwork of food label recognition. It can help to decrease the heavy workload of manual inputting and advance efficiency of data processing. The dataset of food label was constructed firstly, and then a semantic segmentation based distance field model (DFM) was proposed. In DFM two tasks were included: pixel classification and distance field regression. The pixel classification task was used to segment the text from background regions, and the distance field regression task was used to predict the normalized distance from the pixel located in the text region to the boundary of text region. For effectively using the correlation of two tasks, an attention module was added into DFM to optimize the model structure. In addition, the loss function was improved to resolve the loss value of the distance field regression as it was too small to train smoothly. The results of ablation experiment showed that the accuracy of the proposed model was increased by 4.39 percentage points and 3.80 percentage points respectively according to the improvement of attention module and loss function. The comparative experiments of different model methods showed that DFM had good performance in detecting the text of food labels, and the recall rate and precision were 87.61% and 76.50%, respectively.

    参考文献
    相似文献
    引证文献
引用本文

田萱,王子亚,王建新.基于语义分割的食品标签文本检测[J].农业机械学报,2020,51(8):336-343. TIAN Xuan, WANG Ziya, WANG Jianxin. Text Detection of Food Labels Based on Semantic Segmentation[J]. Transactions of the Chinese Society for Agricultural Machinery,2020,51(8):336-343.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2020-05-13
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2020-08-10
  • 出版日期: 2020-08-10