首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Time-series classification with SAFE: Simple and fast segmented word embedding-based neural time series classifier
Institution:1. College of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou, China 450002;2. School of Cyber Science and Engineering, Wuhan University, Wuhan, China 430079;3. School of Information Engineering, North China University of Water Resources and Electric Power, Zhengzhou, China 450045;4. Henan Key Laboratory of Cyberspace Situation Awareness, Zhengzhou 450001;1. School of Information Management, Nanjing University, Nanjing 210023, China;2. School of Computer Science and Engineering, Southeast University, Nanjing 210096, China;3. School of Economics and Management, Nanjing University of Science and Technology, Nanjing 210094, China;1. West China Biomedical Big Data Center, West China Hospital, Sichuan University, No.37 Guoxue Alley, Chengdu 610041, China;2. Department of Radiology, West China Hospital, Sichuan University, No.37 Guoxue Alley, Chengdu 610041, China;3. West China Periodicals, West China Hospital, Sichuan University, No.37 Guoxue Alley, Chengdu 610041, China;4. Department of Bile Duct Surgery, West China Hospital, Sichuan University, No.37 Guoxue Alley, Chengdu 610041, China
Abstract:Dictionary-based classifiers are an essential group of approaches in the field of time series classification. Their distinctive characteristic is that they transform time series into segments made of symbols (words) and then classify time series using these words. Dictionary-based approaches are suitable for datasets containing time series of unequal length. The prevalence of dictionary-based methods inspired the research in this paper. We propose a new dictionary-based classifier called SAFE. The new approach transforms the raw numeric data into a symbolic representation using the Simple Symbolic Aggregate approXimation (SAX) method. We then partition the symbolic time series into a sequence of words. Then we employ the word embedding neural model known in Natural Language Processing to train the classifying mechanism. The proposed scheme was applied to classify 30 benchmark datasets and compared with a range of state-of-the-art time series classifiers. The name SAFE comes from our observation that this method is safe to use. Empirical experiments have shown that SAFE gives excellent results: it is always in the top 5%–10% when we rank the classification accuracy of state-of-the-art algorithms for various datasets. Our method ranks third in the list of state-of-the-art dictionary-based approaches (after the WEASEL and BOSS methods).
Keywords:Time series  Classification  Word embedding  Neural network
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号