植物领域知识图谱构建中本体非分类关系提取方法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学基金项目(61503386)


Research on Ontology Non-taxonomic Relations Extraction in Plant Domain Knowledge Graph Construction
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    采用本体学习的方法,以百度百科植物类词条内容的非结构和半结构化中文文本信息作为语料进行处理。使用一种有指导的基于依存句法分析的词汇-语法模式来获取植物领域的概念、分类和非分类关系,并分别利用基于词表过滤的方法和给模式添加限制的方法,较大程度地提高了关系抽取的精确度,完成在轻量级本体的基础上自动构建重量级本体。该方法建立了一个特定领域语料的概念层次,提高了最具代表性的分类和非分类关系的发现,并使用OWL语言形式化表达抽取结果。实验表明,该方法在非分类关系抽取上取得了较好的结果,为该领域知识图谱构建奠定了基础。

    Abstract:

    In order to provide more specific knowledge and technology of plant field, the main task of KG (knowledge graph) is to extract a wealth of concepts and relationships. Due to the relation extraction is the most difficult in KG construction, this paper makes use of ontology learning, and proposes a nontaxonomic relation learning method to obtain representative concepts and their relations from unstructured and semistructured texts of Baidu Encyclopedia entry content by using lexiconsyntactic patterns based on dependency grammar analysis. Moreover, the methods of adding constraint models and words filtering were adopted to build heavy weight ontology automatically based on a lightweight ontology and greatly improved the precision of the relation extraction. The approach established a concept structure from the plant domain corpus, ameliorated the discovery of the most representative non-taxonomic relation, and formalized them in the standardized OWL 2.0. A set of experiments was performed using the approach implemented in the plant domain. The results indicated that extraction by patterns should be performed directly after natural language processing, which has a comparatively high accuracy compared to the former algorithms, and this approach can extract non-taxonomic relations with high effectiveness, which lays the foundation for KG construction of plant field.

    参考文献
    相似文献
    引证文献
引用本文

赵明,杜亚茹,杜会芳,张家军,王红说,陈瑛.植物领域知识图谱构建中本体非分类关系提取方法[J].农业机械学报,2016,47(9):278-284.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2016-03-09
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2016-09-10
  • 出版日期: 2016-09-10