首页 | 本学科首页   官方微博 | 高级检索  
     检索      

自然语言理解心理学在短文本分类中的实证研究
引用本文:盛宇,刘俊熙,郭金兰,龙怡.自然语言理解心理学在短文本分类中的实证研究[J].现代情报,2009,29(8):4-7.
作者姓名:盛宇  刘俊熙  郭金兰  龙怡
作者单位:上海政法学院计算机教研室,上海,201701
基金项目:该项目基于上海政法学院计算机实验室决策支持系统项目 
摘    要:目前对文本分类研究多数集中在对大规模语料基础上的特征选择或分类器算法的研究。本文是建立在训练样本少且样本长度短的基础上,根据人脑对自然语言理解的心理学原理"人们总是根据已知的最熟悉的、最典型的例子进行判断,只有在该方法不奏效的时候才使用频率这一概念,并且使用的是十分简单的频率"从该角度进行短文本分类的实证研究。以心理学中的"熟悉原理"、"典型原理"等为模型建立特殊词库和典型案例词库,改进了传统文本分类的实验步骤,同时提出了该方法的优势和局限性。

关 键 词:文本分类  短文本  特征选择  自然语言  心理学

Research of Natural Language Understanding of Psychology in the Short Text Classification
Authors:Sheng Yu  Liu Junxi  Guo Jinlan  Long Yi
Institution:Department of Computer, Shanghai University of Political Science and Law, Shanghai 201701, China
Abstract:The current research of classification of most text focused on large - scale corpus on the basis of choice of the characteristics or classification algorithm. Tiffs article is built on less training samples with short length, according to the human brain's understanding of the psychology principle of natural language "People always make a judgment according to the most familiar, the most typical example. They only use the concept of frequency when this method is not effective. It is also a very simple freguency." We do research from this perspective of the short text classification. We establish a special vocabulary and a typical vocabulary based on "familiar principle", "typical principle" which are known in psychology. We improve the experimental steps of the traditional classification of the text and mention the advantages and limitations of this method.
Keywords:text categorization  short text  feature selection  natural language  psychology
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《现代情报》浏览原始摘要信息
点击此处可从《现代情报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号