首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到17条相似文献,搜索用时 109 毫秒
1.
学术文献引文上下文自动识别研究   总被引:2,自引:2,他引:0  
[目的/意义] 引文内容分析能够帮助揭示文献引用关系的深层语义内涵,而引文上下文识别作为引文内容分析的基础显得尤为重要。[方法/过程] 梳理已有引文上下文研究的现状,总结当前引文上下文识别的不足,在此基础上归纳引文上下文识别的5类特征,并采用文本分类和序列标注两种方法开展引文上下文自动识别实验。[结果/结论] 实验结果表明,本文提出的特征能够很好地提升引文上下文识别效果,且基于文本分类的SVM分类效果要优于基于序列标注的CRF。  相似文献   

2.
[目的/意义] 将引用内容分析与引文著录分析进行比较,旨在引起国内学者、出版商、图书馆、信息服务等相关人员和部门对引用内容研究的关注,促进引文分析理论和实践的全面发展。[方法/过程] 首先对引文著录与引用内容的相关概念进行分析,然后从引用频次的计算、引用极性的分类、引用动机的分析、引文网络中文献之间的关系、应用发展5个角度将引用内容分析与引文著录分析进行比较,得出引用内容分析相对引文著录分析的优势所在。[结果/结论] 与引文著录分析相比较,引用内容分析在科学评价和揭示科学技术发展的特征方面更合理,是引文著录分析的有力补充。  相似文献   

3.
徐琳宏  丁堃  陈娜  李冰 《情报学报》2020,39(1):25-37
基于内容的引文情感分析克服了传统基于引用频次的引用同一化问题,是引文内容分析领域一个重要的研究热点。然而引文情感分析依赖于带标注的数据集,目前大规模高质量的引文情感语料资源匮乏,严重制约了该领域的研究。因此,本文在分析引文情感表达方式的基础上提出了一套适用于引文情感表示的标注体系,并详细阐述了语料库建设的技术和方法。采用人机结合的标注策略,借助完善的引文标注系统,构建了规模较大的中文文献的引文情感语料库。统计结果显示,在中文信息处理和科技管理领域情感褒义和贬义总的引用的占比分别为22%和6%,引文情感标注kappa值达到0.852,表明该语料库能够客观地反映作者的情感倾向性,可为论文评价、引文网络分析和情感分析等相关领域的研究提供数据支撑。  相似文献   

4.
基于引用频次开展的引文分析忽略了引文对施引文献的效用差异,依此展开资源评估、学术影响力判断难免受到“无效”引用干扰。为排除“无效”引文,提升测度及评价数据质量,文章从引文效用的角度,选择引文属性、功能、对象、情感等典型特征,建立引文标注框架。测试逻辑回归与支持向量机等方式对引文自动分类的效果,择优构建“效用视角下核心引文识别模型”。其中,为探索自动化分类方式,在数据标注阶段,除人工标引,还使用大规模预训练语言模型的对话系统ChatGPT进行自动分类标注,并进行建模效果测试,以期为引文自动分类方法及应用实践提供新思路。  相似文献   

5.
参考文献引用分类标注与科技期刊和论文的评价   总被引:6,自引:0,他引:6  
董建军 《编辑学报》2006,18(6):406-409
为了规范参考文献的引用,减小引文问题对科技期刊和论文评价所造成的影响,使得以引文为基础的各种分析方法在科技期刊和论文的评价中更加合理、更加科学,提出了参考文献引用分类标注的概念.依据被引文献对论文作用的不同和引用意义的差别,将论文所引用参考文献分为正相关性引用类、负相关性引用类和平行相关性引用类,建议在论文写作中标注出分类的标志,编辑和审稿人在编审时可据此核实参考文献引用的正确性.在论文评价时分类进行检索统计,有利于进一步规范引文分析的各项指标,变笼统的引文分析为细化的据类分析评价,使引文分析评价体系更加科学、更加可靠.  相似文献   

6.
关于学术论文中的引文问题   总被引:8,自引:1,他引:7  
在学术期刊的评价中,影响因子是制约评价的因素。从影响因子的计算公式得知,引文是决定影响因子的重要因素。本文采用例证法,阐述了适宜在正文中标注参与文献的九项内容,并阐述了不必在正文中标注参考文献的五项内容以及不宜间接引用的三项内容。同时认为,参考文献不等同于引用文献,只有引用了为论题所需的内容的文献并在正文中标注,才能在文后加以著录。  相似文献   

7.
基于被引次数的引文分析无法直接揭示论文的研究内容,利用关键词或从标题、摘要和全文中抽取的主题词很难客观反映论文的被引原因。本文以碳纳米管纤维研究领域的高被引论文为研究对象进行引文内容抽取和主题识别,经人工判读验证:基于引文内容分析的高被引论文识别的核心主题能够较好地揭示高被引论文的被引原因(引用动机),而且与论文的研究内容相符合;与基于全文、基于标题和摘要的主题识别相比,在引文内容分析基础上识别的主题具有更好的主题代表性,能够有效揭示被引文献的研究内容,是对原文相关信息的重要补充。本文的实验表明基于引文内容分析的高被引论文主题识别是可行而且有效的。图4。表4。参考文献31。  相似文献   

8.
科学文献的相互引用关系是引文分析的主要依据.引文分析是以文献引用数据为基础,用以揭示其数量特征和规律的一种文献计量分析方法.作者对<大学图书情报学刊>1998年与2004年的引文数量进行了量化分析,指出了刊物在引文数量、引文文献类型、引文语种、引文主题、引文原始来源期刊的发展和变化情况,提出了值得重视的有关问题.  相似文献   

9.
关于图书馆学情报学期刊引文中的一些问题   总被引:5,自引:0,他引:5  
针对图书馆情报学期刊在引文标注、引文标号、引用内容、引用量、引文必要性五个方面存在的问题,在《中国图书馆学报》、《情报学报》等12种核心期刊中进行了抽样调查,最后根据参考文献的标注方法,对调查结果进行了简要的评述。  相似文献   

10.
学术论文中的引文失范问题刍议   总被引:1,自引:0,他引:1  
不同的引用动因和目的导致各种各样的引文失范。引文的标注形式和主体内容决定了引文的形式失范和内容失范。同时,学术道德规范的缺失又引发大量的虚假模糊引文。厘清引文的内涵与外延以及它与参考文献的联系与区别,正确对待现有学术标准和规范,并在运用中使之日臻完善;责任编辑应树立正确的引文价值观,恪守职业操行,公正、合理、专业、规范地处理引文问题;作者要掌握正确应用引文的原则与方法,增强文献情报利用的能力,避免"主观故意"的引文失范。  相似文献   

11.
Wide differences in publication and citation practices make impossible the direct comparison of raw citation counts across scientific disciplines. Recent research has studied new and traditional normalization procedures aimed at suppressing as much as possible these disproportions in citation numbers among scientific domains. Using the recently introduced IDCP (Inequality due to Differences in Citation Practices) method, this paper rigorously tests the performance of six cited-side normalization procedures based on the Thomson Reuters classification system consisting of 172 sub-fields. We use six yearly datasets from 1980 to 2004, with widely varying citation windows from the publication year to May 2011. The main findings are the following three. Firstly, as observed in previous research, within each year the shapes of sub-field citation distributions are strikingly similar. This paves the way for several normalization procedures to perform reasonably well in reducing the effect on citation inequality of differences in citation practices. Secondly, independently of the year of publication and the length of the citation window, the effect of such differences represents about 13% of total citation inequality. Thirdly, a recently introduced two-parameter normalization scheme outperforms the other normalization procedures over the entire period, reducing citation disproportions to a level very close to the minimum achievable given the data and the classification system. However, the traditional procedure of using sub-field mean citations as normalization factors yields also good results.  相似文献   

12.
为解决各引文数据库的检索方法、检索结果展示方式不一、引文著录不规范等问题,解放军医学图书馆采用 NET 框架作为系统的开发平台,采用 C/S 结构模式,在生物医学引文数据库的基础上构建生物医学文献引文集成检索整合平台,实现了统一平台一站式检索、自动去重、自动分组和便捷输出规范的引文报告等个性化功能,提高了生物医学文献查引效率。  相似文献   

13.
引文分析存在的问题及其原因探究   总被引:5,自引:2,他引:3       下载免费PDF全文
目前引文分析已成为文献计量学的主要研究内容之一,但少有对其存在的问题进行分析.本文从引文分析理论(基础理论和引用动机)的不完善,引用过程中存在的不足,引文分析方法、工具和数据库的缺陷,引文分析应用与实践(科学评价和科学交流)的局限四个方面较系统地总结引文分析存在的问题及原因.目前应该以谨慎的态度看待引文分析,对其方法和结果作客观评价,在充分了解其优缺点的基础上进行完善与发展.  相似文献   

14.
数字科研时代的引文分析-基于被引频次分析的实证研究   总被引:3,自引:0,他引:3  
从期刊被引频次的角度出发,采取实证研究的方法,选择国际权威的引文数据库Web of Science和著名的搜索引擎GoogleScholar,以《美国信息科学和技术学会杂志》为文献源进行相关分析,得出在数字科研时代引文分析有必要采取多个引文分析工具,使得引文分析能跟上时代发展步伐的结论。  相似文献   

15.
As the volume of scientific articles has grown rapidly over the last decades, evaluating their impact becomes critical for tracing valuable and significant research output. Many studies have proposed various ranking methods to estimate the prestige of academic papers using bibliometric methods. However, the weight of the links in bibliometric networks has been rarely considered for article ranking in existing literature. Such incomplete investigation in bibliometric methods could lead to biased ranking results. Therefore, a novel scientific article ranking algorithm, W-Rank, is introduced in this study proposing a weighting scheme. The scheme assigns weight to the links of citation network and authorship network by measuring citation relevance and author contribution. Combining the weighted bibliometric networks and a propagation algorithm, W-Rank is able to obtain article ranking results that are more reasonable than existing PageRank-based methods. Experiments are conducted on both arXiv hep-th and Microsoft Academic Graph datasets to verify the W-Rank and compare it with three renowned article ranking algorithms. Experimental results prove that the proposed weighting scheme assists the W-Rank in obtaining ranking results of higher accuracy and, in certain perspectives, outperforming the other algorithms.  相似文献   

16.
The adoption of e-governing practices has revolutionised the administrative machinery of governments worldwide by improving efficiency, transparency, and accountability. Researchers and administrators often aim to identify emerging research fronts and the timeline of the evolution to forecast and implement technology. In this work, we systematically investigate the trajectory of the global evolution and emerging research fronts as well as the prospects for e-governance using citation network analysis. The growth curve fitted to the number of articles published per year shows that the research activities are still in the ascendant phase. We visualise the global main path of the citation network and investigate the patterns to trace the knowledge diffusion path, major milestones, and emerging research fronts. The cluster analysis identifies the major topics of research as administration and information system management, e-governance framework design, efficiency or quality evaluation, and the application of social networks and open data leading to e-democracy. The adoption of open data and social networking for user interactions with government that leads to participatory governance are the emerging research trends. We also identify research that can have a future impact based on network parameters. The results contribute to the literature by setting the focus of future research, and assisting administrators in selecting suitable models and methodologies, and manufacturers with the development of required technical devices suitable for the upcoming phase of symbiosis.  相似文献   

17.
Scientific production is steadily growing, exhibiting 4% annual growth in publications and 1.8% annual growth in the number of references per publication, together producing a 12-year doubling period in the total supply of references, i.e. links in the science citation network. This growth has far-reaching implications for how academic knowledge is connected, accessed and evaluated. Against this background, we analyzed a citation network comprised of 837 million references produced by 32.6 million publications over the period 1965–2012, allowing for a detailed analysis of the ‘attention economy’ in science. Our results show how growth relates to ‘citation inflation’, increased connectivity in the citation network resulting from decreased levels of uncitedness, and a narrowing range of attention – as both very classic and very recent literature are being cited increasingly less. The decreasing attention to recent literature published within the last 6 years suggests that science has become stifled by a publication deluge destabilizing the balance between production and consumption. To better understand these patterns together, we developed a generative model of the citation network, featuring exponential growth, the redirection of scientific attention via publications’ reference lists, and the crowding out of old literature by the new. We validate our model against several empirical benchmarks, and then use perturbation analysis to measure the impact of shifts in citing behavior on the synthetic system's properties, thereby providing insights into the functionality of the science citation network as an infrastructure supporting the memory of science.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号