首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于PCA的决策树优化算法
引用本文:谢霖铨,徐 浩,陈希邦,赵 楠.基于PCA的决策树优化算法[J].教育技术导刊,2019,18(9):69-71.
作者姓名:谢霖铨  徐 浩  陈希邦  赵 楠
作者单位:江西理工大学 理学院,江西 赣州 341000
基金项目:国家自然科学基金项目(61762047);国家重点研发计划重点专项项目(2016YFB0800700)
摘    要:为了改善传统ID3算法在分类属性选择上存在多值偏向性的不足,提出基于PCA的决策树优化算法。在普通基于PCA 的决策树改进算法中,存在数据经降维处理后代表性不强的问题,导致算法需经过多次数据运行后,准确率才能小幅提升。在ID3算法基础上,在分类前两次提取属性特征值,并计算了需要分类的数据量,也即对原始数据进行最重要的属性选择。在子树建立之后,再进行数据的降维合并选择。采用UCI数据库中的3个数据集对改进算法进行验证,结果表明改进算法的平均准确率达到94.6%,相比传统ID3算法与普通PCA决策树优化算法分别提升了1.6%和0.6%。因此,基于PCA的决策树算法能在一定程度上提升结果准确率,具备一定的应用价值。

关 键 词:决策树算法  ID3  PCA算法  
收稿时间:2018-12-26

PCA-based Decision Tree Optimization Algorithm
XIE Lin-quan,XU Hao,CHEN Xi-bang,ZHAO Nan.PCA-based Decision Tree Optimization Algorithm[J].Introduction of Educational Technology,2019,18(9):69-71.
Authors:XIE Lin-quan  XU Hao  CHEN Xi-bang  ZHAO Nan
Institution:College of Science, Jiangxi University of Science and Technology,Ganzhou 341000, China
Abstract:In this paper,the problem of the multi-valued bias of the traditional ID3 algorithm in classification attribute selection is improved. A PCA-based decision tree optimization algorithm is proposed. In the ordinary PCA-based decision tree improvement algorithm, there are data after dimension reduction processing. The problem of low representation is that the improved algorithm needs to pass through multiple data to bring the accuracy to increase slightly. Therefore, based on the ID3 algorithm, the feature values are extracted twice before classification, and the classification needs to be calculated. The amount of data, that is, the most important attribute selection for the original data, after the subtree is established, the data is reduced and merged and selected. In the experimental stage, the improved algorithm was verified by three data sets in the UCI database. The results showed that the average accuracy rate in the three data sets reached 94.6%, and the traditional ID3 algorithm and the ordinary PCA decision tree optimization algorithm were improved by 1.6% and 0.6%, which proves the algorithm has certain practical significance.
Keywords:decision tree algorithm  ID3  PCA algorithm  
点击此处可从《教育技术导刊》浏览原始摘要信息
点击此处可从《教育技术导刊》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号