面向立木识别的有效K均值聚类算法研究
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家林业局林业科学技术推广项目(2016-29)和中央高校基本科研业务费专项资金项目(2015ZCQ-GX-01)


Effective K-means Clustering Algorithm for Tree Trunk Identification
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对林区自动对靶施药过程中,当立木生长密集时,获取的点云数据聚类准确率低、效率低的问题,提出优化后的K均值聚类算法,数据获取方式基于2D激光扫描。针对立木点云信息聚类前需对相关数据进行滤波,提出窗口滤波算法,选取产生混合像素点的树干边缘,提取3次连续扫描的混合像素及其近邻点组成滤波窗口,进行最大阈值滤波,结果显示50次试验中仅有2个混合像素点未被滤除,混合噪声的滤除率高。在K均值算法优化方面,针对算法需预先确定聚类数和初始聚类中心的不足,提出利用斜率变化确定聚类数的方法,试验对5个不同距离下5组立木分别进行100次测量,结果显示错误测量次数仅为3次,并可在试验前期通过人工方式去除,算法合理有效;对哈夫曼树法确定立木扫描点聚类中心的性能进行了试验分析,3种不同树干分布类型下分别运用随机抽样法和哈夫曼树法进行K均值聚类,前者平均正确率仅为76.4%,后者则为95.5%;同时分析了Ⅰ型分布下2种算法聚类的迭代次数和耗时,5个不同距离下,随机抽样法的平均迭代次数明显高于哈夫曼树法,平均运行耗时上,哈夫曼树法则高于随机抽样法,前者变化范围为120~220ms,后者为50~85ms,该范围为林区测绘的可接受范围。试验证明,基于斜率变化确定聚类数和基于哈夫曼树法确定聚类中心的K均值算法是林区立木点云聚类的有效算法,可应用于林区的立木检测。

    Abstract:

    In the process of automatic targeted spray in forest region at present, the accuracy and efficiency of point cloud data are all low when the tree trunks grow intensively, aimed at which the optimized K-means clustering algorithm was put forward, and data acquisition method was based on 2D laser detection. In view of the relevant data needed to be filtered before clustering analysis for trunk scanning spots, application of window filtering algorithm was put forward. The edge of trunk which generated mixed pixels was selected, and then the mixed pixels deriving from three adjacent scans and the scanning spots deriving from two scanning angles near the mixed pixel were extracted, finally, the maximum threshold filtering processing for the neighbor spots was done. Through 50 times of extractions and analyses of test data, only two mixed pixels were not filtered, which indicated that the filtering rate of mixed noises was high. Aimed at the defects of cluster number and initial cluster centers for Kmeans algorithm needed to be predetermined, the method of slope variation used to determine the clustering number was firstly proposed. Five groups of trunks were respectively measured for 100 times at five different distances in the test, and results showed that the number of error measurements was only three times, which could be removed by artificial way at the early stage of the test, and it indicated that the slope variation algorithm was reasonable and effective. The performance of Huffman tree method, which was used to determine the clustering centers for the trunk scanning spots, was analyzed in another test, and K-means clustering was carried out by using random sampling method and Huffman tree method under three trunk distribution types. The average correct rate of former was only 76.4%, while that of the latter was 95.5%. Meanwhile, iterations and time-consuming using the two above-mentioned algorithms at type I distribution were analyzed, and the average number of iterations of random sampling method was obviously higher than that of Huffman tree method at five different distances, but the average time-consuming of Huffman tree method was higher than that of random sampling method. The variation range of former was 120~220ms and it was 50~85ms for the latter, which were all in acceptable ranges on forest surveying and mapping. Experiments proved that the determining methods for clustering number based on the slope variation algorithm and clustering centers based on Huffman tree method were effective algorithms for the clustering of trunk scanning spots in forest region during using K-means algorithm, which could be applied to tree trunk detection for actual forest region.

    参考文献
    相似文献
    引证文献
引用本文

王亚雄,康峰,李文彬,文剑,郑永军.面向立木识别的有效K均值聚类算法研究[J].农业机械学报,2017,48(3):230-237.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2016-11-15
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2017-03-10
  • 出版日期: