首页 | 本学科首页   官方微博 | 高级检索  
     检索      

条件随机场标引模型的性能影响因素分析
引用本文:章成敏,许鑫,章成志.条件随机场标引模型的性能影响因素分析[J].现代图书情报技术,2008,24(6):34-40.
作者姓名:章成敏  许鑫  章成志
作者单位:1. 南京大学信息管理系,南京,210093;中国药科大学图书馆,南京,210009
2. 华东师范大学信息学系,上海,200241
3. 南京理工大学信息管理系,南京,210094;中国科学技术信息研究所,北京,100038
摘    要:利用条件随机场模型进行自动标引研究,对文本分词性能、训练集的规模、特征的个数、模型本身的参数设置等影响模型标引性能的因素进行实验和分析。

关 键 词:自动标引  关键词提取  条件随机场  机器学习
收稿时间:2008-01-31
修稿时间:2008-03-06

Analysis of the Factors Affecting the Performance of CRF-based Keywords Extraction Model
Zhang Chengmin,Xu Xin,Zhang Chengzhi.Analysis of the Factors Affecting the Performance of CRF-based Keywords Extraction Model[J].New Technology of Library and Information Service,2008,24(6):34-40.
Authors:Zhang Chengmin  Xu Xin  Zhang Chengzhi
Institution:(Department of Information Management, Nanjing University, Nanjing 210093,China) (Library of China Pharmaceutical University, Nanjing 210009,China) (Department of Informatics, East China Normal University, Shanghai  200241,China) (Department of Information Management, Nanjing University of Science &; Technology, Nanjing 210094,China) (Institute of Scientific &; Technical Information of China, Beijing 100038,China)
Abstract: The CRF model can use the features of documents more sufficiently and effectively. Keywords extraction based on CRF is proposed and implemented. The factors affecting the performance of the CRF-based keyword extraction model are analyzed. The factors include: the performance of text segmentation, the scale of training corpus, the number of figure and the parameters setting of the CRF model.
Keywords:Automatic indexing  Keywords extraction  Conditional random fields  Machine learning
本文献已被 万方数据 等数据库收录!
点击此处可从《现代图书情报技术》浏览原始摘要信息
点击此处可从《现代图书情报技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号