首页 | 本学科首页   官方微博 | 高级检索  
     检索      

人文社科专题数据库建设的主题选择研究
引用本文:刘雨农,吴柯烨,权昭瑄.人文社科专题数据库建设的主题选择研究[J].现代情报,2009,39(12):11.
作者姓名:刘雨农  吴柯烨  权昭瑄
作者单位:南京大学信息管理学院, 江苏 南京 210023
基金项目:国家社会科学基金重大项目"人文社科专题数据库建设规范化管理研究"(项目编号:18ZDA326)。
摘    要:目的/意义] 探索一种融入数据驱动思维的人文社科专题数据库建设主题选择方法,为相关主体在建库主题的遴选、比较和确定等工作提供决策参考。方法/过程] 从政策、用户两个维度出发,提出基于政策文本与检索数据的人文社科专题数据库主题筛选框架。以Fulink平台为例,基于政策文本LDA主题分类建模和检索数据的词频统计归类,确定专题数据库建设备选主题,最后通过比对筛选将主题进行分类。结果/结论] 本文构建的主题选择框架,能够有效提升相关主题选择工作的全面性、准确性、科学性,为人文社科专题数据库建设的项目规划等提供了良好的思路。

关 键 词:人文社科  专题数据库  主题选择  LDA  

Research on Topic Selection of Humanities and Social Sciences Thematic Database
Authors:Liu Yunong  Wu Keye  Quan Zhaoxuan
Institution:School of Information Management, Nanjing University, Nanjing 210023, China
Abstract:Purpose/Significance] This paper explored a topic selection method for Humanities and Social Sciences database with data-driven thinking,which can provide reference for relevant institution to make decisions on the topic selection of database.Method/Process] On the basis of requirement analysis,the topic selection framework of Humanities and Social Sciences thematic database based on policy texts and retrieval data was proposed from two dimensions:policy and user.Based on LDA topic classification modeling of policy texts and word frequency statistical classification of retrieval data,alternative topics for thematic database construction were determined,and finally,topics were classified through comparative selection.Results/Conclusion] The theme selection framework constructed in this paper provided a new idea and method for project planning of database,which could effectively improve the comprehensiveness of topic selection.
Keywords:Humanities and Social Sciences  thematic database  topic selection  LDA  
点击此处可从《现代情报》浏览原始摘要信息
点击此处可从《现代情报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号