Paper
16 July 2002 Classification performance results of various medical diagnostic data sets
Author Affiliations +
Abstract
In this paper, the Bayesian Data Reduction Algorithm is applied to a collection of medical diagnostic data sets found at the University of California at Irvine's Repository of Machine Learning databases. The algorithm works by finding the best performing quantization complexity of the feature vectors, and this makes it necessary to discretize all continuous valued features. Therefore, results are given by showing the quantization of the continuous valued features that yields best performance. Further, the Bayesian Data Reduction Algorithm is also compared to a conventional linear classifier, which does not discretize any feature values. In general, the Bayesian Data reduction Algorithm is shown to outperform the linear classifier by obtaining a lower probability of error, as averaged over all data sets.
© (2002) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Robert S. Lynch Jr. and Peter K. Willett "Classification performance results of various medical diagnostic data sets", Proc. SPIE 4733, Component and Systems Diagnostics, Prognostics, and Health Management II, (16 July 2002); https://doi.org/10.1117/12.475497
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Quantization

Binary data

Medical diagnostics

Error analysis

Feature selection

Breast cancer

Data modeling

Back to Top