Finding related sentence pairs in MEDLINE |
| |
Authors: | Larry H Smith W John Wilbur |
| |
Institution: | (1) Computational Biology Branch, National Center for Biotechnology Information, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA |
| |
Abstract: | We explore the feasibility of automatically identifying sentences in different MEDLINE abstracts that are related in meaning.
We compared traditional vector space models with machine learning methods for detecting relatedness, and found that machine
learning was superior. The Huber method, a variant of Support Vector Machines which minimizes the modified Huber loss function,
achieves 73% precision when the score cutoff is set high enough to identify about one related sentence per abstract on average.
We illustrate how an abstract viewed in PubMed might be modified to present the related sentences found in other abstracts
by this automatic procedure. |
| |
Keywords: | |
本文献已被 PubMed SpringerLink 等数据库收录! |
|