首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于角度的变系数多分类支持向量机
作者姓名:康文佳  林文辉  张三国
作者单位:1. 中国科学院大学数学科学学院, 北京 100049; 2. 航天信息股份有限公司技术研究院, 北京 100195
基金项目:Supported by the open project of Hubei Collaborative Innovation Center for Early Warning and Emergency Response Technology (JD20150402)
摘    要:支持向量机作为机器学习中一个经典的分类算法,一直广受数据科学家的喜爱。无论是处理线性可分还是非线性可分数据,传统的支持向量机能够很好地解决二分类问题。针对给定的样本,支持向量机通过最大化最小间隔得到最佳的决策分界面,从而实现对新样本的类别预测。然而现实中的数据更为复杂多样,一方面数据的类别往往多于两个,近年不乏有优秀的多分类支持向量机算法出现;另一方面不同领域的数据的特征集中可能存在相对特殊的变量(称之为主变量,targeted variable),需要将其挑选出来并加以特殊处理,以保持主变量对最终分类结果的重要影响。考虑这两个方面,提出基于角度的变系数多分类支持向量机(TLAMSVM)模型以解决含有主变量的多分类问题。它使用具备更好几何解释能力的基于角度的间隔最大分类框架完成多分类,并引入变系数模型,通过选择合适的局部光滑函数处理主变量对模型的影响。把基于角度的变系数多分类支持向量机分别应用到模拟数据集和真实数据集上。数值结果显示,相比没有使用变系数思想或基于角度的多分类框架的多分类支持向量机,TLAMSVM模型具有更高的预测准确度。

关 键 词:局部光滑  多分类支持向量机  基于角度的间隔最大分类框架  
收稿时间:2017-11-27
修稿时间:2018-04-23

Targeted local angle-based multi-category support vector machine
Authors:KANG Wenjia  LIN Wenhui  ZHANG Sanguo
Institution:1. School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; 2. Technology Research Institute, Aisino Corporation, Beijing 100195, China
Abstract:The support vector machine(SVM) is one of the most concise and efficient classification methods in machine learning. Traditional SVMs mainly handle with binary classification problems by maximizing the smallest margins. However, the real-world data are much more complicated. On the one hand, the label set usually has more than two categories, so SVMs need to be generalized for solving multi-category problems reasonably. On the other hand, there may exist one special variable which should be singled out to preserve its effect on the final results from other variables such as age in bioscience field. We name such a special variable as targeted variable. In this work, in order to take both aspects mentioned above into consideration, targeted local angle-based multi-category support vector machine(TLAMSVM) is proposed. This new model not only solves multi-category problems but also pays special attention to targeted variable. Moreover, TLAMSVM solves multi-classification in the framework of angle-based method, which provides a better interpretation from the geometrical viewpoint, and it uses local smoothing method to pool the information of targeted variable. In order to validate the classification effect of TLAMSVM model, we apply it to both simulated and real data sets, respectively, and get the expected results in numerical experiments.
Keywords:local smoothing                                                                                                                        multi-category support vector machine                                                                                                                        angle-based maximum margin classification framework
点击此处可从《》浏览原始摘要信息
点击此处可从《》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号