首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this paper, we focus on the problem of automatically generating amplified scientific paper’s abstract which represents the most influential aspects of scientific paper. The influential aspects can be illustrated by the target scientific paper’s abstract and citation sentences discussing the target paper, which are provided in papers citing the target paper. In this paper, we extract representative sentences through data-weighted reconstruction approach(DWR) by jointly leveraging target scientific paper’s abstract and citation sentences’ content and structure. In our study, we make two-folded contributions.Firstly, sentence’s weight was learned by exploiting regularization for ranking on heterogeneous bibliographic network. Specially, Sentences-similar-Sentences relationship was identified by language modeling-based approach and added to the bibliographic network. Secondly, a data-weighted reconstruction objective function is optimized to select the most representative sentences which reconstructs the original sentence set with minimum error. In this process, sentences’ weight plays a critical role. Experimental evaluation over real dataset confirms the effectiveness of our approach.  相似文献   

2.
Citation analysis does not tell the whole story about the innovativeness of scientific papers. Works by prominent authors tend to receive disproportionately many citations, while publications by less well-known researchers covering the same topics may not attract as much attention. In this paper we address the shortcomings of traditional scientometric approaches by proposing a novel method that utilizes a classifier for predicting publication years based on latent topic distributions. We then calculate real-number innovation scores used to identify potential breakthrough papers and turnaround years. The proposed approach can complement existing citation-based measures of article importance and author contribution analysis; it opens as well novel research direction for time-based, innovation-centered research scientific output evaluation. In our experiments, we focus on two corpora of research papers published over several decades at two well-established conferences: The World Wide Web Conference (WWW) and the International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), containing around 3500 documents in total. We indicate significant years and demonstrate examples of highly-ranked papers, thus providing a novel insight on the evolution of the two conferences. Finally, we compare our results to citation analysis and discuss how our approach may complement traditional scientometrics.  相似文献   

3.
Research evaluation, which is an increasingly pressing issue, invariably relies on citation counts. In this contribution we highlight two concerns that the research community needs to pay attention to. One, in the world of search engine facilitated research, factors such as ease of Web discovery, ease of access, and content relevance, rather than quality, influence what gets read and cited. Two, research evaluation based on citation counts works against many types of high-quality works. We also elaborate on the implications of these points by examining a recent nationwide evaluation of researchers performed in Italy. We focus on our discipline (computer science), but we believe that our observations have relevance for a broad audience.  相似文献   

4.
引用认同分析:引文分析的新视角   总被引:1,自引:0,他引:1  
作者的引用认同是该作者所引用过的所有作者的集合。与传统的引文分析方法不同,引用认同分析以作者(引用者和被引者)为研究对象。本文选取图书情报界3位著名学者作为研究对象,进行实证研究,探讨引用认同的分布规律以及作者的引用风格。
Abstract:
An author's citation identity is a set of authors that he cites. Different from traditional citation analysis,citation identity analysis takes authors ( citers and citees) as the object of study. This paper selects 3 well-known scholars in library and information science circles as the object of study to conduct an empirical analysis with special focus on the distribution law of citation identity and the authors' citing styles.  相似文献   

5.
Previous studies have confirmed that citation mention and location reveal different contributions of the cited articles, and that both are significant in scientific research evaluation. However, traditional citation count prediction only focuses on predicting citation frequency. In this paper, we propose a novel fine-grained citation count prediction task (FGCCP), which aims to predict in-text citation count from each structural function of a paper separately. Specifically, we treated this task as a “sequence to sequence” issue and a multi-task learning job, in which both the inputs and the outputs are based on the sequence pattern of citations from different structural functions. To fulfill FGCCP, we proposed a transformer-based model (i.e. MTAT) in which a novel among-attention mechanism is employed. Based on an empirical study of full-text documents from PubMed Central Open Access Subset, our model achieves satisfactory prediction accuracy, and surpasses common machine learning and deep learning models on FGCCP. Moreover, we also discuss the potential role of the among-attention mechanism and the reason why our proposed model outperforms state-of-the-art strategies. FGCCP may provide more detailed decision-making evidence and evaluation basis for researchers in scientific research evaluation. In addition, MTAT is a general model which can be easily deployed in other multi-task learning jobs.  相似文献   

6.
Mining linkage information from the citation graph has been shown to be effective in identifying important literatures. However, the question of how to utilize linkage information from the citation graph to facilitate literature retrieval still remains largely unanswered. In this paper, given the context of biomedical literature retrieval, we first conduct a case study in order to find out whether applying PageRank and HITS algorithms directly to the citation graph is the best way of utilizing citation linkage information for improving biomedical literature retrieval. Second, we propose a probabilistic combination framework for integrating citation information into the content-based information retrieval weighting model. Based on the observations of the case study, we present two strategies for modeling the linkage information contained in the citation graph. The proposed framework provides a theoretical support for the combination of content and linkage information. Under this framework, exhaustive parameter tuning can be avoided. Extensive experiments on three TREC Genomics collections demonstrate the advantages and effectiveness of our proposed methods.  相似文献   

7.
《Research Policy》2019,48(7):1855-1865
Quantitative research evaluation requires measures that are transparent, relatively simple, and free of disciplinary and temporal bias. We document and provide a solution to a hitherto unaddressed temporal bias – citation inflation – which arises from the basic fact that scientific publication is steadily growing at roughly 4% per year. Moreover, because the total production of citations grows by a factor of 2 every 12 years, this means that the real value of a citation depends on when it was produced. Consequently, failing to convert nominal citation values into real citation values produces significant mis-measurement of scientific impact. To address this problem, we develop a citation deflator method, outline the steps to generalize and implement it using the Web of Science portal, and analyze a large set of researchers from biology and physics to demonstrate how two common evaluation metrics – total citations and h-index – can differ by a remarkable amount depending on whether the underlying citation counts are deflated or not. In particular, our results show that the scientific impact of prior generations is likely to be significantly underestimated when citations are not deflated, often by 100% or more of the nominal value. Thus, our study points to the need for a systemic overhaul of the counting methods used evaluating citation impact – especially in the case of researchers, journals, and institutions – which can span several decades and thus several doubling periods.  相似文献   

8.
何星星  武夷山 《情报杂志》2012,31(8):98-102
传统期刊论文评价工作关注的是论文内部特征和引用情况,从新的视角提出以文献的利用数据(包括网页点击量、浏览量、下载量)及调整指标(点击下载率、下载引用率)来综合评价一篇文章的表现力,并利用《PLoS Biology》与F1000系统数据做了实证分析,证明了上述指标的可行性,其表现也优于被引这一单一指标.  相似文献   

9.
人才培养是高校的核心任务。高校的各项工作都要紧密围绕这个任务,努力提升在人才培养中的贡献度。毕业生就业状况作为衡量专业人才培养质量的重要指标历来受到学校和各个院系的高度重视。近年来,在学校招生毕业处、学工处、宣传部和团委等相关部门的大力支持下,湖北经济学院经济学系连续三年开展了毕业生跟踪调查,本文拟从毕业生跟踪调查的视角浅谈学生工作如何提升在人才培养中的贡献度。  相似文献   

10.
This paper looks at how citations are perceived among scientists. Based on a questionnaire survey it traces the repertoire of views and experiences about citations that could be found among Norwegian scientists that had published highly cited papers. Their views circle around three issues: the relation between the quality (or importance or significance) of a paper and its citation history; the importance of visibility and how different sorts of factors play a role in determining citation in general and high citation in particular; and the fairness (or lack of fairness) of the system. Taken together, the respondents’ answers and comments offer an informal (and fragmented) sociology of citations and their role in the world of science. In the final section we discuss the relevance of our findings in respect to the increasing use of citation indicators in science policy and research evaluations.  相似文献   

11.
郑继来  郑德俊  周露 《情报杂志》2012,31(8):74-78,97
引用认同是引文研究的新视角,是从引用者本身出发,包含引用认同和被引网络图两个方面.以22位普赖斯奖获得者(见表1)的2895篇论文、53405条引文为研究对象,借助Histcite、Bibexcel、CitespaceⅡ等科学计量和信息可视化工具,了解他们的学术社会网络关系,揭示了国际科学计量学领域的关键文献、权威期刊和关键词情况,为国内相关学者了解本领域的国际研宄现状提供参考.  相似文献   

12.
我国医学情报研究专业水平的现状分析   总被引:3,自引:0,他引:3  
兰小筠  裴新宇 《情报科学》2000,18(11):1054-1056
通过对1989-1998年《医学情报工作》发表的327篇医学情报研究方面的论文进行统计分析,从论文的年代分布、主题分布、著名分布及引文分析等角度研究了我国医学情报研究的现状、存在的问题及对策。  相似文献   

13.
[目的/意义] 在信息检索、科技论文评价和知识结构演化方面,引文分析都起着至关重要的作用。随着格式化全文数据库的出现,引文分析迈入了4.0时代——全文引文分析阶段。但是,目前还没有中文的格式化全文数据库,这极大地制约了全文引文分析在我国科技文献中的研究和应用。[方法/过程] 在本文中我们提出建立高效的中文全文引文分析依赖的数据集和检索平台的方法,主要包括:1)提出了基于规则和SVM分类方法的论文元数据和引用提取方法;2)提出基于Spark平台的实现高效引文内容分析标准化数据集生成方法;3)提出建立引用内容的科技文献检索平台。[结果/结论] 引文内容分析标准化数据集的建立将全面提升全文引文分析在我国科技领域中的研究效能,提高科技文献查找精度。  相似文献   

14.
马仁杰  路思 《现代情报》2014,34(10):50-56
本文从论文年代、来源期刊、基金项目、作者总体情况、核心作者、作者所在系统及高产机构、作者合作度、引文分析等方面进行文献计量分析,并运用关键词词频分析法对我国OAIS领域现阶段研究重点与热点进行剖析,以期对推动我国OAIS理论的进一步发展有所帮助.  相似文献   

15.
两种农业学报引文分析   总被引:6,自引:0,他引:6  
唐圣琴 《情报科学》1998,16(5):444-448
本文采用文献计量学方法,对两种农业学报(《西北农业学报》和《西南农业学报》)中刊载论文的引文情况进行统计分析。结果表明:两种农业学报的论文著者引用文献均以期刊为主,占总引文量的65%以上;引文语种主要是中文;论文的篇均引文数较低,自引率较高;引文的时间跨度较大,中、外文期刊的半衰期分别为5.1年和11.67年。  相似文献   

16.
The Web is revolutionizing the entire scholarly communication process and changing the way that researchers exchange information. In this paper, we analyze two views of information production and use in computer-related research based on citation analysis of PDF and Postcript formatted publications on the Web using autonomous citation indexing (ACI), and a parallel citation analysis of the journal literature indexed by the Institute for Scientific Information (ISI) in SCISEARCH. Our goal is to establish a baseline profile of computer science “literature” as it appears in the published journals and as it appears on the publicly available Web. From this starting point, we hope to identify additional research areas dealing with information dissemination and citation practices in computer science and the utility of autonomous citation indexing on the Web as an adjunct to commercial indexing  相似文献   

17.
Research evaluating models of scientific productivity require coherent metrics that quantify various key relations among papers as revealed by patterns of citation. This paper focuses on the various conceptual problems inherent in measuring the degree to which papers tend to cite other papers written by authors of the same nationality. We suggest that measures can be given a degree of assurance of coherence by being based on mathematical models describing the citation process. A number of such models are developed.  相似文献   

18.
本文利用2000-2008年scientometrics期刊上所刊载的论文在2000-2008年的引文数据为研究对象,在被引次数的基础上同时加入施引期刊的学术质量指标和引证时差权重,计算每篇论文的引文加权值,通过被引次数和引文加权值的比较得出引文加权对论文的评价更为合理,同时以2000-2008年在scientometrics发文量最多的10名作者为研究对象,探讨引文加权用于作者评价的可行性。  相似文献   

19.
Numbers of publications and citation ratings have recently been used as measures of scientific growth. The present paper discusses a number of presumed weaknesses of these measures. First, a distinction is made among scientific activity, scientific productivity, and scientific progress. Then it is suggested that the above measures might depend on the particular field of science, on the speed whereby research front information becomes archival, on the phenomena of wrong papers and of ‘also ran’ papers, on the geographical differences in communication patterns, on whether we want to measure activity, productivity, or progress, and on the temporal variations in scientific communication patterns. Though some examples are given, the quantitative substantiation of the proposed effects must await further research.  相似文献   

20.
We present a study exploring the connection between social networks and collaborative process. We focus on exploring academics' network position and its effect on their collaborative networks. In this paper, we discuss two types of networks of collaboration—(i) citation; and, (ii) co authorship. We explore the effects of social networks on these two types of collaborative process. By defining network position in this way, we develop a social network that uses the academics as nodes within the network instead of each published paper. We obtained the collaboration data through archival records (i.e. Web of Science) and examined the interactions among different actors from the archival records for determining the existence and strength of relations between actors.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号