首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于分类标注语料库的关键词标引知识自动获取
引用本文:刘华.基于分类标注语料库的关键词标引知识自动获取[J].图书情报工作,2007,51(7):41-43.
作者姓名:刘华
作者单位:暨南大学华文学院;海外华语研究中心
摘    要:基于大规模层级分类语料库,抽取网页上专家已经标引的关键词形成关键词表;针对关键词的领域不均匀性和邻界域两个特征,提出并模拟计算了关键词表征文本主题特征程度的主题度。以关键词及其主题度为领域知识,结合统计方法,完成了一个知识与统计相结合的关键词自动标引系统。

关 键 词:关键词标引  分类语料库  主题度  
收稿时间:2006-06-11
修稿时间:2006-06-102006-12-18

Knowledge Repository Acquire for Keywords Auto-Indexing System Based on Labeled and Classed Corpus
Liu Hua.Knowledge Repository Acquire for Keywords Auto-Indexing System Based on Labeled and Classed Corpus[J].Library and Information Service,2007,51(7):41-43.
Authors:Liu Hua
Abstract:From a classed large-scale corpus, extracts keywords labeled on web pages by indexing specialist and formed a keywords list; Referring to the two characteristics of keywords: fields non-even and exists range edge, brought up and calculated the words' subject degree by statistical model. Subject degree expresses text content' s subject concept. Based on subject degree, constructed a key words auto-indexing system.
Keywords:keywords indexing classed corpus subject degree
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号