CORC  > 北京大学  > 信息科学技术学院
A new fuzzy decision tree classification method for mining high-speed data streams based on binary search trees
Li, Zhoujun ; Wang, Tao ; Wang, Ruoxue ; Yan, Yuejin ; Chen, Huowang
2007
英文摘要Decision tree construction is a well-studied problem in data mining. Recently, there has been much interest in mining data streams. Domingos and Hulten have presented a one-pass algorithm for decision tree constructions. Their system using Hoeffding inequality to achieve a probabilistic bound on the accuracy of the tree constructed. Gama et al. have extended VFDT in two directions. Their system VFDTc can deal with continuous data and use more powerful classification techniques at tree leaves. Peng et al. present soft discretization method to solve continuous attributes in data mining. In this paper, we revisit these problems and implemented a system sVFDT for data stream mining. We make the following contributions: 1) we present a binary search trees (BST) approach for efficiently handling continuous attributes. Its processing time for values inserting is O(nlogn), while VFDTs processing time is O(n 2). 2) We improve the method of getting the best split-test point of a given continuous attribute. Comparing to the method used in VFDTc, it decreases from O(nlogn) to O (n) in processing time. 3) Comparing to VFDTc, sVFDT s candidate split-test number decrease from O(n) to O(logn).4)Improve the soft discretization method to increase classification accuracy in data stream mining. ? Springer-Verlag Berlin Heidelberg 2007.; EI; 0
语种英语
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/409583]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Li, Zhoujun,Wang, Tao,Wang, Ruoxue,et al. A new fuzzy decision tree classification method for mining high-speed data streams based on binary search trees. 2007-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace