首页 | 本学科首页   官方微博 | 高级检索  
     检索      

关键词共现方法识别领域研究热点过程中的数据清洗方法
引用本文:潘玮,牟冬梅,李茵,刘鹏.关键词共现方法识别领域研究热点过程中的数据清洗方法[J].图书情报工作,2017,61(7):111-117.
作者姓名:潘玮  牟冬梅  李茵  刘鹏
作者单位:1. 蚌埠医学院卫生管理系 蚌埠 233030; 2. 吉林大学公共卫生学院 长春 130021
基金项目:本文系国家自然科学面上项目"嵌入式知识服务驱动下的领域多维知识库构建"(项目编号:71573102)和蚌埠医学院人文社科基金重点项目"医药专利研究领域的知识图谱绘制与分析"(项目编号:BYKY16110skZD)研究成果之一。
摘    要:目的/意义] 针对关键词共现方法识别领域研究热点过程中数据清洗进行理论研究与探索,以辅助科研工作者准确识别领域研究热点。方法/过程] 在文献调研的基础上,阐述数据清洗的定义和对象,并分析脏数据产生的原因和影响,进而制定数据清洗的步骤和方案,并采用实证研究方法对数据清洗的效果和方案的可行性进行验证。结果/结论] 研究结果表明该数据清洗方案能够提高研究热点识别的准确性,从而证明了该方案的可行性。

关 键 词:关键词共现  研究热点  研究领域分析  数据清洗  数据挖掘  
收稿时间:2017-01-03

Data Cleaning in the Process of Identifying Research Hotpot Based on Keywords Co-occurrence
Pan Wei,Mu Dongmei,Li Yin,Liu Peng.Data Cleaning in the Process of Identifying Research Hotpot Based on Keywords Co-occurrence[J].Library and Information Service,2017,61(7):111-117.
Authors:Pan Wei  Mu Dongmei  Li Yin  Liu Peng
Institution:1. Department of Health Management, Bengbu Medical College, Bengbu 233030; 2. School of Public Health, Jilin University, Changchun 130021
Abstract:Purpose/significance] In order to efficiently aid researchers to identify research hotpot, this paper aims to explore theoretical basis and practical guidance of data cleaning in the process of identifying research hotpots based on keywords co-occurrence. Method/process] On the basis of literature research, it firstly defines the conception and the objects of data cleaning. Then it analyses the reasons and influences of dirty data. Finally, it proposes the procedures of data cleaning, which is verified by empirical research method. Result/conclusion] The result indicates that the procedures of data cleaning which are proved to be feasible can increase the accuracy of identification of research hotpot.
Keywords:keywords co-occurrence  research hotpot  research area analysis  data cleaning  data mining  
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号