Outlier detection in high dimensional data is one of the hot areas of data mining. The existing outlier detection methods are based on the distance in Euclidean space. In high-dimensional data, these methods are bound to deteriorate due to the notorious "dimension disaster" which leads to distance measure cannot express the original physical meaning and the low computational efficiency. This paper improves the method of angle-based outlier factor and proposes the method of variance of angle-based outlier factor outlier in mining high dimensional. It introduces the related theories to guarantee the reliability of the method. The empirical experiments on synthetic data sets show the method is efficiency and scalable to high-dimensional data sets.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.