基于注意力池化和堆叠式结构的病虫害文献识别模型
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家重点研发计划项目(2016YFD0300710)


Diseases and Pests Articles Identification Model Based on Attention Pooling and Stacked Structure
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为解决病虫害文献识别过程中存在语义特征学习不够、上下文信息不能充分利用等问题,以病虫害相关文献摘要为研究对象,提出一种基于注意力池化策略和堆叠式双向长短期记忆(Bi-directional long-short term memory, BiLSTM)的神经网络模型(AP-LSTM)。该模型采用堆叠式长短期记忆结构,提高了对语义特征的学习能力,在进行堆叠操作时,通过将输入向量与输出向量拼接,进一步加强了对语义信息的表征;然后采用基于注意力机制的池化策略为不同的词分配不同权重,使模型在抓住重点的同时能够充分利用上下文信息。本文在包含1439条正例、1061条负例的自标注数据集上进行了实验,所提出的AP-LSTM模型在该数据集上的精确率、召回率、〖JP2〗F1值和准确率分别为92.67%、97.20%、94.88%和94.00%,实验结果表明,AP-LSTM模型能够有效识别病虫害文献。

    Abstract:

    Diseases and pests articles identification is an important pre-task of natural language processing in the field of diseases and pests. It is of great significance to develop a fast and accurate method for diseases and pests articles identification. In order to solve the problems of insufficient learning of semantic features and insufficient use of context information in the process of diseases and pests articles identification, a neural network model of attention pooling based bi-directional long-short term memory (AP-LSTM) was proposed, which was based on attention pooling strategy and bi-directional long-short term memory (BiLSTM). The model adopted the stacked LSTM structure, which improved the learning ability of semantic features. In the stacking operation, the input vector and output vector were concatenated to further enhance the representation of semantic information. Then, a pooling strategy based on the attention mechanism was used to assign different weights to different words, so that the model can make full use of context information while grasping the keywords. The experiments were carried out on a self annotated dataset with 2500 labeled samples, including 1439 positive cases and 1061 negative cases. The precision, recall, F1 score, and accuracy of the proposed AP-LSTM model on the dataset were 92.67%, 97.20%, 94.88%, and 94.00%, respectively. The experimental results showed that the proposed AP-LSTM model can effectively identify pest literature.

    参考文献
    相似文献
    引证文献
引用本文

唐 詹,柏 召,刁 磊,郭旭超,周 晗,李 林.基于注意力池化和堆叠式结构的病虫害文献识别模型[J].农业机械学报,2021,52(S0):178-184. TANG Zhan, BAI Zhao, DIAO Lei, GUO Xuchao, ZHOU Han, LI Lin. Diseases and Pests Articles Identification Model Based on Attention Pooling and Stacked Structure[J]. Transactions of the Chinese Society for Agricultural Machinery,2021,52(S0):178-184.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-07-12
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2021-11-10
  • 出版日期: 2021-12-10