首页 | 本学科首页   官方微博 | 高级检索  
     检索      

面向学术文献的知识挖掘方法研究
引用本文:王凯,孙济庆,李楠.面向学术文献的知识挖掘方法研究[J].现代情报,2017,37(5):47.
作者姓名:王凯  孙济庆  李楠
作者单位:华东理工大学科技信息研究所, 上海 200237
基金项目:2015年度教育部人文社会科学研究青年项目"面向语义出版的富语义模型构建与应用研究"(项目编号:15YJC870014);中央高校基本科研业务费专项资金资助项目"信息环境下的数字文献内涵语义聚合研究"(项目编号:WR1522001)
摘    要:学术文献中包含的大量有价值的知识往往无法在摘要中体现出来。本文提出一种基于位置加权的核心知识挖掘方法,旨在以句为知识处理粒度,抽取正文中的核心句子作为独立的知识单元。该方法通过量化句子间的关联,将正文表示成一个以句子为节点,句子间关联为边的文本关系网络,提出基于章节的位置加权算法,结合社会网络分析方法,挖掘出文本中核心知识单元部分的句子。实验结果表明,该方法可以实现对文章核心章节中重要句子的抽取,达到初步预期效果。

关 键 词:学术文献  知识挖掘  方法  位置加权  知识抽取  文本网络  社会网络分析  

Research on Method of Knowledge Minning in Academic Documents
Authors:Wang Kai  Sun Jiqing  Li Nan
Institution:Institute of Science and Technology Information, East China University of Science and Technology, Shanghai 200237, China
Abstract:There is abundant valuable knowledge inside academic documents that is not revealed in abstracts. This paper promoted a method of core knowledge discovery based on position weights, aiming to extract the core sentences as separated knowledge units in the main text with the processing size of sentence. By measuring the connection between sentences, the paper transformed main text into a text network that considers sentences as dots and connection between sentences as sides. An algorithm to compute position weights based on chapters was promoted in this paper. With the help of social network analysis, the paper could find sentences that revealed the core knowledge of the text. The result of the experiment showed that this method could realize the extraction of key sentences in the core chapter from the text, which is primarily expected.
Keywords:academic documents  knowledge minning  method  position weight  knowledge extraction  text network  social network analysis  
点击此处可从《现代情报》浏览原始摘要信息
点击此处可从《现代情报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号