基于卷积模型的农业问答语性特征抽取分析
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学基金项目(61571051)、北京市自然科学基金项目(4172024)和北京市农林科学院2018年度科研创新平台建设项目(PT2018-25)


Analysis of Extraction of Semantic Feature in Agricultural Question and Answer Based on Convolutional Model
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    互联网农技推广社区每秒增衍问答数据近万组,这些海量数据具有隐性的词性、情感和冗余向量特征,实现数据聚合与数据块消减是该领域的难题。提出了一种基于卷积神经网络的农业问答情感极性特征抽取分析模型,结合农业分词字典,对数据集进行分词后使用Skip-gram模型转换为256维的词向量,利用批规范后的卷积神经网络对数据集进行训练,从而得到用于识别农技推广社区问答词性情感相似性的神经网络模型参数。试验结果表明,该方法能够准确识别测试样例集中的冗余队列,与其他5种文本分类方法进行比较,各项指标优势明显,针对测试集的语性特征抽取准确率达到82.7%。

    Abstract:

    Tens of thousands of question and answer data have been increased per second in the internet agricultural technology extension community, these massive data have features of recessive part of speech, emotion and unwanted vectors, and how to implement data aggregation and data block reduction is the difficult problem in this field. An analytical model for the extraction of emotional polarity in agricultural question and answer based on convolutional neural network was proposed, the training set was transformed into a 256-dimensional word vector by using the Skip-gram model after segmenting the dataset with agricultural word segmentation dictionary. The convolution neural network after batch-normalization specification was used to train the dataset, and the neural network model parameters used to identify the part of speech emotional similarities in the agricultural technology promotion community question and answer were obtained. The experimental results showed that the method could accurately identify redundant queues in the test sample set, and by comparing with the other four text classification methods, there were also obvious advantages in each index, the accuracy of the semantic feature extraction for the test set was up to 82.7%.

    参考文献
    相似文献
    引证文献
引用本文

张明岳,吴华瑞,朱华吉.基于卷积模型的农业问答语性特征抽取分析[J].农业机械学报,2018,49(12):203-210.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2018-05-23
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2018-12-10
  • 出版日期: