首页 | 本学科首页   官方微博 | 高级检索  
     检索      

A sustainable development OCR system in CADAL application
作者姓名:黄晨  赵继海  胡晓
作者单位:Zhejiang University Libraries,Zhejiang University,Zhejiang University Libraries,Zhejiang University,Graduate School of Library and Information Science,University of Illinois at Urbana-Champaign,IL 61801,USA Hangzhou 310027,China,Hangzhou 310027,China
基金项目:Project supported by China-US Million Books Digital Library Project
摘    要:INTRODUCTION China-US Million Books Digital Library Project is a research and development project proposed by Chinese and American scientists, aiming at creating a universally free access digital library containing over one million scanned books, using optical character recognition (OCR) whenever possible to support full text searching (http://www.cadal.cn). It is one of the key projects of the Ministry of Education of China for the Tenth Five Year Plan, and called China-America D…

关 键 词:OCR系统  光学性质识别  数字图书馆  百万图书计划
收稿时间:2005-08-05
修稿时间:2005-09-10

A sustainable development OCR system in CADAL application
Huang Chen,Zhao Ji-hai,Hu Xiao.A sustainable development OCR system in CADAL application[J].Journal of Zhejiang University Science,2005,6(11):1312-1317.
Authors:Huang Chen  Zhao Ji-hai  Hu Xiao
Institution:(1) Zhejiang University Libraries, Zhejiang University, 310027 Hangzhou, China;(2) Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign, 61801, IL, USA
Abstract:This paper briefly introduces the main ideas of a sustainable development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on computer clusters, for the purpose of dynamically improving the recognition precision of the digitized texts of a million volumes of books produced by the China-US Million Books Digital Library (CADAL) Project. The practice of this center will provide helpful reference for other digital library projects.
Keywords:Sustainable Development  Digital Library  optical character recognition (OCR)  China-US Million Books Digital Library (CADAL)
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号