共查询到20条相似文献,搜索用时 69 毫秒
1.
基于既定词表的自适应汉语分词技术研究 总被引:3,自引:0,他引:3
提出一种汉语分词算法,在给定的分词词表的基础上进行汉语分词时,不但能成功切分出分词词表中已有的词,而且能同时自动识别出分词词表中没有的词,即未登录词。与逆向最长匹配法以及其他未登录词识别算法进行的测试比较表明,该分词算法可以有效地解决大多数未登录词的识别问题,并且能减少分词错误,同时对分词算法的效率基本没有影响。 相似文献
2.
全二分快速自动分词算法构建 总被引:1,自引:0,他引:1
张海营 《现代图书情报技术》2007,2(4):52-55
分析现有分词算法存在的不足,在此基础上提出一种新的分词词典,通过为分词词典建立首字Hash表和词索引表两级索引,使得该分词词典支持全二分最大匹配分词算法,利用该分词算法进行自动分词,其时间复杂度实现了大的改善。 相似文献
3.
基于EM算法的汉语自动分词方法 总被引:9,自引:1,他引:8
汉语自动分词是中文信息处理中的基础课题。本文首先对汉语分词的基本概念与应用 ,以及汉语分词的基本方法进行了概述。接着引出一种根据词的出现概率、基于极大似然原则构建的汉语自动分词的零阶马尔可夫模型 ,并重点剖析了EM(Expectation Maximization)算法 ,对实验结果进行了分析。最后对算法进行了总结与讨论。 相似文献
4.
本文讨论了书面汉语的人工辅助分词和自动分词,并以汉语语言学为依据归纳了用汉语词素构词的类型。就书面汉语自动分词的复杂性和依赖于汉语词素构词法的自动分词的可行性进行了分析。本文给出了该自动切分方法分层处理的基本构思和程序框图。 相似文献
5.
6.
基于EMM中文抽词算法的XMARC主题信息挖掘 总被引:4,自引:0,他引:4
本文在分词词典上采用区间最大词长,改进正向减字最大匹配法为“词首 长词匹配 短词推进”自动标引方法,从而有效地减少领域的分词歧义性和缩短标引时间。最后将该研究付诸于XMARC主题信息的挖掘与检索的实现,并证明其在时间和质量综合性能上的优越性。 相似文献
7.
三字歧义链自动分词方法 总被引:3,自引:0,他引:3
歧义问题是自动分词系统中要解决的主要问题之一。本文介绍一种在最大匹配法基础上,根据大量的真实语料中出现的歧义现象,把可能产生歧义切分的词进行特性分类,对每类确定一组规则进行处理 相似文献
8.
汉语自动分词研究的现状与新思维 总被引:17,自引:2,他引:15
汉语自动分词是机器翻译、文献标引、智能检索、自然语言理解与处理的基础。本文对十余年来的汉语自动分词的研究方法与成果进行了综合论述, 分析了现有分词方法的特点, 提出了把神经网络和专家系统结合起来建立集成式汉语自动分词系统的新思维。 相似文献
9.
基于反序词典的中文逆向最大匹配分词系统设计* 总被引:6,自引:0,他引:6
介绍几种常见的分词算法,在改进传统的反序词典、优化逆向最大匹配算法的基础上,设计并实现基于逆向最大匹配的中文分词系统,试验证明速度和精度都有显著提高。 相似文献
10.
11.
基于神经网络的汉语自动分词系统的设计与分析 总被引:15,自引:1,他引:14
应用神经网络进行汉语自动分词研究是中文信息处理领域的重要课题。本文从分析神经网络的一个主要模型和算法入手,阐述了基于神经网络的汉语自动分词系统的设计方法,较详细地介绍了该系统的实验结果,并给出了必要的分析。 相似文献
12.
13.
14.
Milan Grba 《Slavic & East European Information Resources》2017,18(3-4):152-164
This article surveys a sample of sources of the information about Romania available to British readers in nineteenth century British newspapers and periodicals. It traces first contacts between the Romanian lands and Britain after the union of the principalities of Wallachia and Moldavia in 1859, then after their independence from the Ottoman Empire. The article highlights an increased Romanian interest in British periodicals, which reported and reviewed Romanian literature and scholarship. The article concludes that nineteenth century British newspapers and periodicals offer a great variety and wealth of new material previously unavailable or unknown to researchers. It also states that only a portion of a large quantity of this material has been indexed and is therefore available via the bibliographic sources mentioned in the article. The author argues for the need of a new and updated British-Romanian bibliography, which can draw on new online resources offering access to thousands of new newspapers and periodical records. 相似文献
15.
《Slavic & East European Information Resources》2013,14(4):25-33
ABSTRACT The paper looks at library approval plans for material published in Slavic, East European, and Eurasian countries from the selector's point of view. Reasons why a selector would or would not want one are examined. Success with approval plans requires monitoring receipts, as well as good and ongoing communication among the selector, the acquistions department, and the vendor. A preliminary list of vendors offering approval plans for the countries of the region appears in the appendix. 相似文献
16.
为进一步提升武汉科技信息共享服务平台使用效率,本文从平台资源建设、资源应用、供需对接方式和供需特点等方面分析了武汉科技信息资源服务现状;基于需求和利用的角度,结合平台管理实践和走访用户、问卷调查等研究方法,从信息资源需求主体和平台自身建设管理两个维度,找出制约科技资源供需对接的主要因素;以市场化和制度化为创新理念,从政策创新、机制创新、市场化服务、环境营造、人才培养等方面提出平台建设由“资源集聚”向“需求导向”转变的对策建议. 相似文献
17.
《Slavic & East European Information Resources》2013,14(1-2):123-143
ABSTRACT The article examines the most important periodicals of ethnic minorities in Poland. After 1989, many ethnic groups (e.g., Germans and Romanies) were allowed to publish journals and newspapers for the first time since the end of World War II. The publications examined show the rich cultural life of the various ethnic groups as well as their current status in Poland. In addition to popular titles, some scholarly publications are also discussed. 相似文献
18.
Jon C. Giullian 《Slavic & East European Information Resources》2013,14(3):278-284
The author answers a reference question on bibliographic sources for the Ukrainian periodical press 1840–1850. Helpful publications include bibliographies, guides, and library catalogs. These potentially make mention of revolutionary developments in Hungary (such as the Twelve Points paragraph of the Demands of the Hungarian Nation in March 1848, the subsequent April Laws, and Hungary's declaration of independence in April 1949), and elsewhere in the Hapsburg Empire. 相似文献
19.
20.
Kelly Ahlfeld 《International Information and Library Review》2017,49(4):285-289
Chromebooks and the G Suite group of products, like Google Search, Gmail, and Google Docs, have rapidly expanded in American schools during the past 5 years. The impact of one-to-one Chromebook devices and the pervasive use of Google's software products in American education cannot be overstated. This article explores some of the influences of these products on research, based on the experiences of a librarian and technology coordinator at an elementary education level. The author has several suggestions for effective research with these products in mind. 相似文献