首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于本体的汉语领域命名实体识别
引用本文:史树敏,冯冲,黄河燕,刘东升,王树梅.基于本体的汉语领域命名实体识别[J].情报学报,2009,28(6).
作者姓名:史树敏  冯冲  黄河燕  刘东升  王树梅
作者单位:1. 南京理工大学计算机科学与技术学院,南京,210094;内蒙古师范大学计算机与信息工程学院,呼和浩特,010022
2. 中国科学院计算机语言信息工程研究中心,北京,100097
3. 内蒙古师范大学计算机与信息工程学院,呼和浩特,010022
4. 南京理工大学计算机科学与技术学院,南京,210094
基金项目:基金项目:本文得到国家863,国家自然科学基金 
摘    要:命名实体识别是众多自然语言处理任务的核心内容之一,也是近年来的领域研究热点.本文将命名实体分为两大类:常规命名实体和领域命名实体.基于已经构建的领域本体MPO,本文提出一种基于本体知识规则与统计方法相结合的领域命名实体识别方法.该方法通过本体化实例,获取实体构成词性规则模板,结合CRFs机器学习模型,进行领域命名实体识别.实验结果表明:相比运用单一统计方法而言,该方法能使领域实体的识别性能显著提高,F值达到92.36%.同时表明本体化知识规则的有效运用,能够在领域实体边界和特殊形式领域实体识别的准确率上发挥积极作用.

关 键 词:领域实体  领域命名实体识别  本体  词性规则模板

Recognition of Chinese Domain Named Entities Based on Ontology
Shi Shumin,Feng Chong,Huang Heyan,Liu Dongsheng,Wang Shumei.Recognition of Chinese Domain Named Entities Based on Ontology[J].Journal of the China Society for Scientific andTechnical Information,2009,28(6).
Authors:Shi Shumin  Feng Chong  Huang Heyan  Liu Dongsheng  Wang Shumei
Abstract:Named Entity Recognition ( NER) is one of kernel task in many Natural Language Processing ( NLP) applications,which has recently become the hot spot of research. Named Entities are classified into General Named Entities ( GNEs) and Domain Named Entities (DNEs) in this paper. We put forward a method of Chinese Domain Named Entity Recognition (DNER) which combining Conditional Random Field ( CRF) with the rule templates of POS based on formalized instances that acquired from domain ontology constructed already. Results of experiments indicate that such a method can improve effectively the performance on DNER and F-measure has reached 92.36 % . Experimental data also show that ontological knowledge can make great effect in recognizing the boundaries of DNEs and DNEs with special forms.
Keywords:CRFs
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号