首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于多源数据的专业领域热点探测模型研究
引用本文:王晓光,王宏宇,黄菡.基于多源数据的专业领域热点探测模型研究[J].图书情报工作,2019,63(14):52-61.
作者姓名:王晓光  王宏宇  黄菡
作者单位:1. 武汉大学信息资源研究中心 武汉 430072; 2. 中南财经政法大学信息与安全工程学院 武汉 430072
基金项目:本文系国家自然科学基金面上项目"基于大规模开放科学知识图谱的学科新兴趋势探测研究"(项目编号:71874129)和国家社会科学基金重大项目"基于认知计算的学术论文评价理论与方法研究"(项目编号:17ZDA292)研究成果之一。
摘    要:目的/意义]面向出版业进行专业领域出版时的选题决策问题,对互联网上公开的资讯动态进行多源整合,通过多维度的情报分析探测专业领域内的热点,实现数据驱动的出版选题决策,为出版业的数字化转型与发展奠定坚实基础。方法/过程]设计一个情报分析模型,面向出版选题决策进行专业领域的热点探测。模型包含热点发现与热度评价两个过程。热点发现过程,通过词频统计和词增长速度算法对专业领域内的热点进行识别;热度评价过程,从内容层面和传播层面两个维度设计并计算一系列指标,对识别到的热点进行热度评价与排序。结果/结论]以2018年1月至4月的36 550条信息、通讯和技术领域多源中文信息为样本进行热点探测实验,实验结果表明,设计的热点探测模型可以有效地探测专业领域内的热点,辅助出版业科学地进行专业领域选题决策。

关 键 词:选题决策  热点探测  热点发现  热度计算  热度评价  
收稿时间:2018-12-12

Towards Professional Publishing: Research on Hotspot Detection Model Based on Multi-source Data
Wang Xiaoguang,Wang Hongyu,Huang Han.Towards Professional Publishing: Research on Hotspot Detection Model Based on Multi-source Data[J].Library and Information Service,2019,63(14):52-61.
Authors:Wang Xiaoguang  Wang Hongyu  Huang Han
Institution:1.Center for Studies of Information Resources, Wuhan University, Wuhan 430072;2.School of Information and Safety Engineering, Zhongnan University of Economic and Law, Wuhan 430072
Abstract:Purpose/significance] In order to solve the problem of topic selection for professional fields in publishing industry, this paper integrates multisource dynamic information on the Internet to detect the hotspots for professional fields through multi-dimensional intelligence analysis. The data-driven topic selection is realized to lay a solid foundation for the digitization transformation and development of publishing industry.Method/process] A intelligence analysis model towards topic selection was proposed to detect hotspots in professional fields. The model was divided into two steps:the hotspot discovery and the hotness evaluation. The hotspot discovery in this model identified hotspots in professional fields through word frequency statistics and the algorithm of word growth rate. Then, in the step of hotness evaluation, a series of indices in the dimension of content and spread were designed to calculate and evaluate the hotness of the hotspots identified in the last step.Result/conclusion] A hotspots detecting experiment was conducted with 36,550 pieces of Chinese multisource dynamic information in the area of ICT collected from January to April of 2018, which verified the effectiveness of the proposed model. This model can be used in publishing industry to complete the step of topic selection scientificallyn
Keywords:topic selection  hotspot tracking  hotspot detection  hotness calculate  hotness evaluation  
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号