首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Text Mining and Subject Analysis for Fiction; or,Using Machine Learning and Information Extraction to Assign Subject Headings to Dime Novels
Authors:Matthew Short
Institution:1. University Libraries, Northern Illinois University, DeKalb, Illionis, USAmshort@niu.edu
Abstract:Abstract

This article describes multiple experiments in text mining at Northern Illinois University that were undertaken to improve the efficiency and accuracy of cataloging. It focuses narrowly on subject analysis of dime novels, a format of inexpensive fiction that was popular in the United States between 1860 and 1915. NIU holds more than 55,000 dime novels in its collections, which it is in the process of comprehensively digitizing. Classification, keyword extraction, named-entity recognition, clustering, and topic modeling are discussed as means of assigning subject headings to improve their discoverability by researchers and to increase the productivity of digitization workflows.
Keywords:Subject analysis  text mining  cataloging digital resources  cataloguing popular fiction  dime novels
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号