Paper
29 January 2007 Identification of comment-on sentences in online biomedical documents using support vector machines
Author Affiliations +
Proceedings Volume 6500, Document Recognition and Retrieval XIV; 65000O (2007) https://doi.org/10.1117/12.704423
Event: Electronic Imaging 2007, 2007, San Jose, CA, United States
Abstract
MEDLINE(R) is the premier bibliographic online database of the National Library of Medicine, containing approximately 14 million citations and abstracts from over 4,800 biomedical journals. This paper presents an automated method based on support vector machines to identify a "comment-on" list, which is a field in a MEDLINE citation denoting previously published articles commented on by a given article. For comparative study, we also introduce another method based on scoring functions that estimate the significance of each sentence in a given article. Preliminary experiments conducted on HTML-formatted online biomedical documents collected from 24 different journal titles show that the support vector machine with polynomial kernel function performs best in terms of recall and F-measure rates.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
In Cheol Kim, Daniel X. Le, and George R. Thoma "Identification of comment-on sentences in online biomedical documents using support vector machines", Proc. SPIE 6500, Document Recognition and Retrieval XIV, 65000O (29 January 2007); https://doi.org/10.1117/12.704423
Lens.org Logo
CITATIONS
Cited by 7 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Biomedical optics

Medicine

Databases

Pattern recognition

Binary data

Error analysis

Feature extraction

Back to Top