基于Multi-probe LSH的菊花花型相似性计算
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学基金项目(61502236)和中央高校基本科研业务费专项资金项目(KYZ201752、KJQN201651)


Chrysanthemum Petal Similarity Evaluation Based on Multi-probe Locality Sensitive Hashing
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对海量高维菊花图像相似性计算带来的挑战,研究了基于多探测局部位置敏感哈希技术的菊花表型相似性计算方法。针对菊花图像,采用SIFT技术提取菊花图像特征,并采用BoVW模型进行建模。由于图像特征的高维性质,海量的菊花表型相似性计算效率不高,为了提高计算效率,提出采用近似相似性技术中的多探测局部位置敏感哈希技术,用此方法构建菊花图像数据的哈希数据结构,在菊花相似性查询方面提高了计算效率,并确保了计算结果的质量。在菊花数据集上进行了计算效率和查询质量两方面的测试,并与典型的方法进行了试验对比和分析。结果表明,相比线性式扫描,平均查询成功概率达到0.90以上,平均加速比为3.3~19.8。本文方法能够在查询质量和计算效率两方面通过参数设置提供灵活的优化选择,并对参数的选择提供了参考范围,可为海量菊花花型相似性计算提供参考。

    Abstract:

    Plant phenotyping is an important research topic in the field of botany. The similarity of plant phenotypes is widely used in plant taxonomy, ecology and digital agriculture etc. It is one of the important contents of plant phenotype research. Chrysanthemum is an important plant in China as well as in the world, and the phenotype similarity evaluation of chrysanthemum plays an important role in chrysanthemum classification and phenotypic research. The feature of high-dimension of massive chrysanthemum data brings great challenge for chrysanthemum phenotype analysis, from this point of view, the chrysanthemum phenotypic similarity query and evaluation were studied based on multiprobe locality sensitive hashing technique. For evaluating the similarity of chrysanthemum image, the SIFT features of the chrysanthemum images were extracted and clustered based on the K-means method. Hereafter, the bag of visual words (BoVW) model was built. Due to the high-dimensional nature of the image features, especially for the massive chrysanthemum images, the computing efficiency of the query was a big challenge for the high dimensional problem. The multi-probe locality sensitive hashing (LSH) was applied for chrysanthemum phenotype similarity computing. The multiprobe locality sensitive hashing technique was an optimization technique for high-dimensional data similarity query. By means of the technique, a hash data structure of chrysanthemum image data was constructed, which improved query efficiency in chrysanthemum similarity query and ensured the query result quality. The theory of the multi-probe locality sensitive hashing was analyzed, in addition to this, extensive experiments were conducted and important results were gained as well. Experiments showed that compared with linear scanning, the average success probability of the query can reach above 090, and the average acceleration ratio was 3.3~19.8,furthermore, it was also compared with the typical method in the aspects of query quality and query efficiency, and the results demonstrated that the method was better than the entropy based LSH in quality and performance. The experimental results revealed that the query quality and query efficiency could be tuned flexibly through the parameter settings of hash function number and the hash tables, which provided an elastic way for the choice for tuning the quality and efficiency. In addition, it can provide technical reference for massive chrysanthemum phenotypic similarity calculation.

    参考文献
    相似文献
    引证文献
引用本文

袁培森,翟肇裕,钱淑韵,徐焕良.基于Multi-probe LSH的菊花花型相似性计算[J].农业机械学报,2019,50(7):208-215.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2019-01-06
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2019-07-10
  • 出版日期: