赵明,杜亚茹,杜会芳,张家军,王红说,陈瑛.植物领域知识图谱构建中本体非分类关系提取方法[J].农业机械学报,2016,47(9):278-284.
Zhao Ming,Du Yaru,Du Huifang,Zhang Jiajun,Wang Hongshuo,Chen Ying.Research on Ontology Non-taxonomic Relations Extraction in Plant Domain Knowledge Graph Construction[J].Transactions of the Chinese Society for Agricultural Machinery,2016,47(9):278-284.
摘要点击次数: 945
全文下载次数: 819
植物领域知识图谱构建中本体非分类关系提取方法   [下载全文]
Research on Ontology Non-taxonomic Relations Extraction in Plant Domain Knowledge Graph Construction   [Download Pdf][in English]
投稿时间:2016-03-09  
DOI:10.6041/j.issn.1000-1298.2016.09.038
中文关键词:  植物领域本体  知识图谱  非分类关系  本体学习  百度百科
基金项目:国家自然科学基金项目(61503386)
作者单位
赵明 中国农业大学 
杜亚茹 中国农业大学 
杜会芳 中国农业大学 
张家军 中国农业大学 
王红说 中国农业大学 
陈瑛 中国农业大学 
中文摘要:采用本体学习的方法,以百度百科植物类词条内容的非结构和半结构化中文文本信息作为语料进行处理。使用一种有指导的基于依存句法分析的词汇-语法模式来获取植物领域的概念、分类和非分类关系,并分别利用基于词表过滤的方法和给模式添加限制的方法,较大程度地提高了关系抽取的精确度,完成在轻量级本体的基础上自动构建重量级本体。该方法建立了一个特定领域语料的概念层次,提高了最具代表性的分类和非分类关系的发现,并使用OWL语言形式化表达抽取结果。实验表明,该方法在非分类关系抽取上取得了较好的结果,为该领域知识图谱构建奠定了基础。
Zhao Ming  Du Yaru  Du Huifang  Zhang Jiajun  Wang Hongshuo  Chen Ying
China Agricultural University,China Agricultural University,China Agricultural University,China Agricultural University,China Agricultural University and China Agricultural University
Key Words:plant domain ontology  knowledge graph  non taxonomic relation  ontology learning  Baidu Encyclopedia
Abstract:In order to provide more specific knowledge and technology of plant field, the main task of KG (knowledge graph) is to extract a wealth of concepts and relationships. Due to the relation extraction is the most difficult in KG construction, this paper makes use of ontology learning, and proposes a non taxonomic relation learning method to obtain representative concepts and their relations from unstructured and semi structured texts of Baidu Encyclopedia entry content by using lexicon syntactic patterns based on dependency grammar analysis. Moreover, the methods of adding constraint models and words filtering were adopted to build heavy weight ontology automatically based on a lightweight ontology and greatly improved the precision of the relation extraction. The approach established a concept structure from the plant domain corpus, ameliorated the discovery of the most representative non-taxonomic relation, and formalized them in the standardized OWL 2.0. A set of experiments was performed using the approach implemented in the plant domain. The results indicated that extraction by patterns should be performed directly after natural language processing, which has a comparatively high accuracy compared to the former algorithms, and this approach can extract non-taxonomic relations with high effectiveness, which lays the foundation for KG construction of plant field.

Transactions of the Chinese Society for Agriculture Machinery (CSAM), in charged of China Association for Science and Technology (CAST), sponsored by CSAM and Chinese Academy of Agricultural Mechanization Science(CAAMS), started publication in 1957. It is the earliest interdisciplinary journal in Chinese which combines agricultural and engineering. It always closely grasps the development direction of agriculture engineering disciplines and the published papers represent the highest academic level of agriculture engineering in China. Currently, nearly 8,000 papers have been already published. There are around 3,000 papers contributed to the journal each year, but only around 600 of them will be accepted. Transactions of CSAM focuses on a wide range of agricultural machinery, irrigation, electronics, robotics, agro-products engineering, biological energy, agricultural structures and environment and more. Subjects in Transactions of the CSAM have been embodied by many internationally well-known index systems, such as: EI Compendex, CA, CSA, etc.

   下载PDF阅读器