首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Comparison of Attribute Reduction Methods for Coronary Heart Disease Data by Decision Tree Classification
作者姓名:郑刚  黄亚楼  王鹏涛  舒光复
作者单位:[1]Department of Computer Science, Tianjin University of Technology, Tianjin 300191 , China [2]College of Information Technical Science, Nankai University, Tianjin 300071 , China [3]College of Software, Nankai University, Tianjin 300071, China [4]System Institute, Chinese Academy of Sciences, Beijing 100864, China
基金项目:Supported by Ministry of Education of China ( No. 02038 ) , Asian Research Center of Nankai University ( No. AS0405) , and Tianjin Higher Education Science Development Fund( No. 20030621 ).
摘    要:Attribute reduction is necessary in decision making system. Selecting right attribute reduction method is more important. This paper studies the reduction effects of principal components analysis (PCA) and system reconstruction analysis , SRA) on coronary heart disease data. The data set contains 1723 records, and 71 attributes in each record. PCA and SRA are used to reduce attributes number (less than 71 ) in the data set. And then decision tree algorithms. C4.5, classification and regression tree ( CART), and chi-square automatic interaction detector ( CHAID ), are adopted to analyze the raw data and attribute reduced data. The parameters of decision tree algorithms, including internal node number, maximum tree depth, leaves number, and correction rate are analyzed. The result indicates that. PCA and SRA data can complete attribute reduction work. and the decision-making rate on the reduced data is quicker than that on the raw data: the reduction effect of PCA is better than that of SRA. while the attribute assertion of SRA is better than that of PCA. PCA and SRA methods exhibit good performance in selecting and reducing attributes.

关 键 词:冠心病  主成份分析  系统重建分析  决策树
收稿时间:2005-08-15

Comparison of Attribute Reduction Methods for Coronary Heart Disease Data by Decision Tree Classification
ZHENG Gang,HUANG Yalou,WANG Pengtao,SHU Guangfu.Comparison of Attribute Reduction Methods for Coronary Heart Disease Data by Decision Tree Classification[J].Transactions of Tianjin University,2005,11(6):463-468.
Authors:ZHENG Gang  HUANG Yalou  WANG Pengtao  SHU Guangfu
Abstract:Attribute reduction is necessary in decision making system. Selecting right attribute reduc-tion method is more important. This paper studies the reduction effects of principal components analysis ( PCA) and system reconstruction analysis (SRA) on coronary heart disease data. The data set contains 1 723 records, and 71 attributes in each record. PCA and SRA are used to reduce attributes number (less than 71 ) in the data set. And then decision tree algorithms, C4.5, classifi-cation and regression tree (CART). and chi-square automatic interaction detector ( CHAID), are adopted to analyze the raw data and attribute reduced data. The parameters of decision tree algo-rithms, including internal node number, maximum tree depth, leaves number, and correction rate are analyzed. The result indicates that, PCA and SRA data can complete attribute reduction work, and the decision-making rate on the reduced data is quicker than that on the raw data; the reduction effect of PCA is better than that of SRA. while the attribute assertion of SRA is better than that of PCA. PCA and SRA methods exhibit good performance in selecting and reducing attributes.
Keywords:principal components analysis ( PCA)  system reconstruction analysis ( SRA)  attribute reduction  decision tree
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号