首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
基于GATE语义标注的Web信息的自动抽取   总被引:1,自引:0,他引:1  
重点研究基于语义标注样本的Web信息自动抽取的实现方法。借助自然语言处理框架GATE,首先引入领域本体对样本网页内容进行语义标注,精确定位出待抽取的语义项,并据此将样本网页解析为S DOM树。从S DOM树中抽取出语义项的特征描述,形成样本实例并采用机器学习算法归纳抽取规则,自动生成包装器。抽取过程中,通过比较网页结构的相似度,系统能够感知网页的变化,主动学习并扩展规则库。试验结果表明,由于精确定位保障了学习样本的质量,小样本学习生成的包装器能够达到较为理想的查全率和查准率。  相似文献   

2.
This study is the first investigation into the types of contents in young adult (YA) web pages in public library websites in Japan. The study reveals that YA web pages, in general, place more emphasis on providing guidance on YA services, on helping young adults with regular learning, on the use of information resources for reference services, and on improving the communication abilities for young adults, rather than on providing research assistance to adults on YA services. Furthermore, an IRIS (Information Reference Instructional Sharing) Contents Model is proposed for YA web pages, whereas an IRIR (Information Reference Instructional Research) Contents Model is presented for children’s web pages, based on the differences between the contents of YA web pages and those of children’s web pages.  相似文献   

3.
This article provides an overview of open government data. It outlines what it is, provides examples, and summarizes library engagement with these data. The professional and academic literature and other web sources were examined along with government policies and portals. Open government data is a fairly recent and evolving phenomenon that promotes government transparency and invites citizen participation and innovative reuses of public data. Libraries have been responding to this release of data in a number of ways including offering data literacy instruction and special services and programs.  相似文献   

4.
A progenitor article for this work can be found in Public Library Quarterly 27, no. 4. This study is the first to report on the types of contents in children's web pages and the characteristics of Web-OPACs for children in public library websites in Japan. This study reveals that children's web pages, in general, place more emphasis on providing guidance on library services and on the use of information resources for reference services, rather than on helping children with regular learning or providing research assistance to adults on children's services. Other findings are reported as well.  相似文献   

5.
Providing government information, services, products and transactions electronically has the potential benefit of accessibility for a wider audience, political and administrative transparency, and improved service delivery. By using e-government websites, citizens can conveniently access government information and services and gain greater opportunities to participate in democratic processes. The present study aims to evaluate National Portal of India, which provides single window access to 601 e-government portals and websites in India. A total of 1576 online services are provided by these portals responding to information needs of the citizens. Ranking of the state and union territory portals has been done based on the number of online services they provide. The paper also focuses on the digitization of documents, acts, rules and schemes of central and state government departments and their availability and accessibility through government portals.  相似文献   

6.
黄黄 《图书情报工作》2011,55(17):105-111
采用网页调查方式收集整理美国16所公共图书馆网站的志愿者网页。通过对这些网页链接点设置、整体框架结构、信息组织方式的调查及对网页具体信息内容、链接方式与内容等的分析与研究,总结出公共图书馆在志愿者网页整体建设、信息用词、志愿者表彰和其他社会组织链接的四点启示,可用于我国公共图书馆网站对志愿者网页的建设。  相似文献   

7.
Providing government information, services, products and transactions electronically has the potential benefit of accessibility for a wider audience, political and administrative transparency, and improved service delivery. By using e-government websites, citizens can conveniently access government information and services and gain greater opportunities to participate in democratic processes. The present study aims to evaluate National Portal of India, which provides single window access to 601 e-government portals and websites in India. A total of 1576 online services are provided by these portals responding to information needs of the citizens. Ranking of the state and union territory portals has been done based on the number of online services they provide. The paper also focuses on the digitization of documents, acts, rules and schemes of central and state government departments and their availability and accessibility through government portals.  相似文献   

8.
面对搜索引擎基于关键词全文检索导致检索准确度低和学科信息门户加工描述只到站点级别的问题,作者提出了将搜索引擎和学科信息门户结合构建智能学科门户搜索引擎的建议--在经过学科专家筛选的、学科信息门户目录中的高质量网站中自动收集网页,形成网页索引,利用自动标引与自动分类方法对收集到的网页进行标引和分类,最后通过分类浏览目录与主题词检索的方式,向用户提供学术资源网页的查找.文章重点介绍了智能学科门户搜索引擎的网页采集、网页自动标引与自动分类及用户接口的设计与实现,并对该搜索引擎存在的问题进行了分析和讨论.  相似文献   

9.
The emergence of standardized open data software platforms has provided a similar set of features to sustain the lifecycle of open data practices, which includes storing, managing, publishing, and visualizing data, in addition to providing an out-of-the-box solution for data portals. Accordingly, the dissemination of data portals that implement such platforms has paved the way for automation, wherein (meta)data extraction supplies the demand for quantity-oriented metrics, mainly for benchmark purposes. This has given rise to an issue regarding how to survey data portals globally, especially reducing the manual efforts, while covering a wide variety of sources that may not implement standardized solutions. Thus, this study raises two main problems: searching for standardized open data software platforms and identifying specific developed web-based software operated as data portals. This study aims to develop a method that deeply searches each web page on the internet and formalizes a machine learning classification model to improve the identification of data portals, irrespective of how these data portals implement a standardized open data software platform and comply with the open data technical guidelines. The contributions of this work have been demonstrated through a list of 1,650 open data portals generalized in a training model that makes it feasible to distinguish between a data portal (that may or may not implement a standardized platform) and an ordinary web page. The results provide new insights on how machine-readable, publicly available data are affected by artificial intelligence, with special focus on how it can be used to understand data openness worldwide.  相似文献   

10.
在学术期刊非法网站仍然存在的现状下,提出了一种应对非法网站的可行方法——使用学术期刊网址导航网。设计建设了一个学术期刊网址导航网(网址为http://www.cujc.cn),并已将其投入实际使用。投稿者可以通过该导航网查询到学术期刊的真实网址,从而避免了浏览到非法网站上的学术期刊虚假网页的可能。使用学术期刊网址导航网可以帮助投稿者远离学术期刊非法网站的侵扰,是应对非法网站的一种可行方法。  相似文献   

11.
The purpose of this research is to investigate the current state and trend of government website information cited by social science and humanities (SS&H) journal articles in China. The Chinese Social Science Citation Index (CSSCI) was used as the benchmark and the Social Science Citation Index (SSCI) journals as the reference samples. It analyzed 204,019 web citations (N = 5,063,237) found in 925,506 articles that were published in CSSCI journals during the 1998–2009 period. The findings unveil that web citations accounted for only 4.03% of the total number of citations (N = 5,063,237), and that citations of Chinese government websites constituted 6.6% of the total number of web citations (N = 204,019). The study disclosed detailed information regarding citations derived from ministries and commissions directly under the State Council websites (N = 69), government online media (N = 7), government website citation subjects (N = 21), and various types of government website information (N = 5). Although government website information has limited influence on SS&H, their impact is currently growing rapidly. In comparison with international research community, influence of government web information on Chinese social science is higher, while its influence on humanities is lower. Essentially, Chinese scholars put emphasis on citing information from authoritative central government websites or highly visible state-owned media information as supporting evidences in their articles. In general, the citation of information from Chinese government website tends to hot social issues of society. Finally, it is necessary to promote the visibility of local government websites, to develop policies and guidelines to encourage the disclosure and the diversity of data, so that there will be more citation balances between social and technological topics.  相似文献   

12.
One of the most rapidly growing professional social networks is GitHub, an online space to share code. GitHub is based on free and open-source software called Git, a version control system used in many digital projects, from library websites to government data portals to scientific research. For projects that involve developing code and collaborating with others, Git is an invaluable tool; it also creates a backup system and structured documentation. In this article, we examine version control, the particulars of Git, the burgeoning social network of GitHub, and how Git can be an archival tool.  相似文献   

13.
基于链接分析的网站评价研究   总被引:10,自引:1,他引:10       下载免费PDF全文
从排名前50位的美国商学院中,随机抽取20个,把它们主页所在网站作为研究对象, 以指向网站的网页数和网络影响因子(Web-IF)作为测定核心网站的依据。研究表明:用这两种依据来测定核心网站,所得结果基本一致;Web-IF对评价网站质量和测定核心网站具有重要价值;在计算Web-IF时应以其他网站指向被研究对象的网页数和网站在该时刻可访问到的网页数作为依据;布拉德福定律可能适用于核心网站的研究。表3。参考文献11。  相似文献   

14.
It is well documented that government agencies, at all levels, continue to have problems ensuring that government web sites follow laws related to web accessibility for people with disabilities. Although there are a number of published studies on government web accessibility that are point-in-time, there are no published studies consisting of a longitudinal analysis of state-level government web site accessibility. This paper contributes to the research literature in three ways: 1) an accessibility inspection of 25 Maryland state government homepages in 2012 which involved 150 human inspections of web pages, 2) a comparison of the results from 2012 to a similar accessibility evaluation in 2009, and 3) a discussion of the role of a web page template, which was introduced in Maryland state government shortly after the 2009 evaluation. The data from this longitudinal evaluation leads to the conclusion that web page templates do tend to result in more accessible sites within state government.  相似文献   

15.
统计了刊载H5N1型禽流感信息的网页和网站,根据布拉德福定律确定出核心网站,并对网站类型及域名进行分析,为H5N1禽流感方面研究人员提供网络信息。  相似文献   

16.
Information quality and community municipal portal use   总被引:1,自引:0,他引:1  
This paper presents an in-depth research investigation of the role information quality plays in the use of community municipal portals. These portals are a new type of website spearheaded by local governments and community agencies in response to citizen calls for a more user-friendly, comprehensive, and convenient way of accessing community-based and local government information via a single entry point. Prior empirical evidence on electronic government adoption and use shapes the study's theoretical model. The model was tested via a survey completed by 1279 respondents across five community municipal portal sites in the province of Ontario, Canada. The survey polled respondents' uptake and perceptions of these portals. Cross-validation using structural equation modeling analysis indicates that information quality plays a critical but indirect role in influencing a person's use of a community municipal portal. In addition, other end-user factors, namely perceived ease of use and compatibility, also affect usage. Importantly, the need to pay attention to the information quality of community municipal portals is raised as a means of rallying citizen response to this new type of website specifically, as well as to local government websites more generally.  相似文献   

17.
以国内7大类共275个图书情报学网站为研究样本,依据"相关网页数量"、"总入链数"、"网络影响因子"等定量指标,采用布拉德福定律方法、百分比补偿法和网络影响因子方法等信息计量方法,对我国图书情报学科的核心网站进行测定,并对不同方法所得的结果进行分析和讨论。实证研究的结果表明,网络影响因子等网络信息计量指标完全可以应用到网络信息资源评价当中。  相似文献   

18.
The currently existing webometric rankings and methods of their analysis are focused primarily on the quantitative measurement of the contents of websites and almost completely ignore the study of the user audience (web traffic). In a pilot project the traffic of ten websites of scientific organizations has been studied with the emphasis on web-traffic sources and the analysis of the traffic of pages with scientific content. It is shown that the direct visits to the site are an indicator of the regular audience of an organization website. This audience consists mainly of the organization’s staff and their immediate colleagues, while new visitors come mainly from search engines. It was revealed that the most visited pages are the ones with information about staff and laboratories, as well as news pages if they are regularly updated. It was found that there is no strong relationship between webometric rankings and website traffic. The rank correlation is moderate and traffic from external links on other websites is weak despite the fact that such links are a key webometric indicator. The results of the study can be used to optimize the structures of the websites of scientific organizations and the analysis of their user audience.  相似文献   

19.
State government websites are a main information portal for people. The primary objective of this study is to examine 50 U.S. state government websites to evaluate the status of their accessibility in comparison with federal government and randomly selected commercial websites. The results show a significant difference among the three groups (F(2, 101) = 11.81, p < 0.001) with respect to accessibility. In particular, the state and federal government websites provide more accessible service to their users than the commercial websites (p < 0.01). The most frequent barriers to accessibility found on state government websites are also listed here for web designers and developers to enable them to improve their quality of service in the future.  相似文献   

20.
语义Web门户基于本体技术,采用一系列的语义Web技术实现了语义查询、语义浏览、语义发布和语义个性化。文章重点调研了相对完善的语义Web门户,并在此基础上归纳总结了语义Web门户相关的实施方案以及它对科研活动的支持。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号