家禽诊疗文本多实体关系联合抽取模型研究
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家重点研发计划项目(2016YFD0300607)


Joint Extraction Model of Multi-entity Relations for Poultry Diagnosis and Treatment Text
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对传统实体关系抽取方法中主体特征与句向量难以有效融合、现有BIO标注策略难以有效处理重叠关系的问题,提出一种基于BERT和双重指针标注的家禽疾病诊疗文本实体关系联合抽取模型(Joint extraction of entity relationship of poultry disease diagnosis and treatment text,JEER_PD)。JEER_PD使用双重指针标注(Dual-pointer labeling, DPL)策略,建立头、尾2个指针标注器,一次性标注出所有实体的开始和结束位置;引入CLN(Conditional layer normalization)网络层,强化主体抽取任务与客体关系联合抽取任务之间的联系;利用概率平衡策略PBS对抗正负类标签类别失衡,以加速模型收敛。实验表明,JEER_PD准确率、召回率和F1分别为97.69%、97.59%和97.64%,3项指标较现有方法均有显著提升,说明JEER_PD能够快速、准确地抽取家禽疾病诊疗复杂知识文本中的实体关系三元组。

    Abstract:

    Aiming at the problems that the subject feature and sentence vector in the traditional entity relationship extraction method are difficult to effectively integrate, and the existing BIO annotation strategy is difficult to effectively deal with the overlapping relationships, a joint extraction of entity relationship of poultry disease diagnosis and treatment text (JEER_PD) based on BERT and dual-pointer was proposed. JEER_PD used the dual-pointer labeling (DPL) strategy to establish two pointer labelers at the head and tail, marking the beginning and ending positions of all entities at once; introduced the conditional layer normalization (CLN) network layer to strengthen the connection between the subject extraction task and the object relationship joint extraction task; and used the probability balance strategy (PBS) to combat the imbalance of positive and negative labels to accelerate the model convergence.The experimental results showed that the accuracy, recall and F1 value of JEER_PD were 97.69%, 97.59% and 97.64%, respectively, and the three indicators were significantly improved compared with that of the existing methods, which proved that JEER_PD can quickly and accurately extract the entity relationship triples in the complex knowledge text of the diagnosis and treatment of poultry diseases.

    参考文献
    相似文献
    引证文献
引用本文

胡滨,汤保虎,姜海燕,霍傲,韩文笑.家禽诊疗文本多实体关系联合抽取模型研究[J].农业机械学报,2021,52(6):268-276. HU Bin, TANG Baohu, JIANG Haiyan, HUO Ao, HAN Wenxiao. Joint Extraction Model of Multi-entity Relations for Poultry Diagnosis and Treatment Text[J]. Transactions of the Chinese Society for Agricultural Machinery,2021,52(6):268-276.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2020-09-02
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2021-06-10
  • 出版日期: 2021-06-10
文章二维码