A New Local Distance-Based Outlier Detection Approach for Scattered Real-World Data

Computer Science – Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

15 LaTeX pages, 7 figures, 2 tables, 1 algorithm, 2 theorems

Scientific paper

Detecting outliers which are grossly different from or inconsistent with the remaining dataset is a major challenge in real-world KDD applications. Existing outlier detection methods are ineffective on scattered real-world datasets due to implicit data patterns and parameter setting issues. We define a novel "Local Distance-based Outlier Factor" (LDOF) to measure the {outlier-ness} of objects in scattered datasets which addresses these issues. LDOF uses the relative location of an object to its neighbours to determine the degree to which the object deviates from its neighbourhood. Properties of LDOF are theoretically analysed including LDOF's lower bound and its false-detection probability, as well as parameter settings. In order to facilitate parameter settings in real-world applications, we employ a top-n technique in our outlier detection approach, where only the objects with the highest LDOF values are regarded as outliers. Compared to conventional approaches (such as top-n KNN and top-n LOF), our method top-n LDOF is more effective at detecting outliers in scattered data. It is also easier to set parameters, since its performance is relatively stable over a large range of parameter values, as illustrated by experimental results on both real-world and synthetic datasets.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A New Local Distance-Based Outlier Detection Approach for Scattered Real-World Data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A New Local Distance-Based Outlier Detection Approach for Scattered Real-World Data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A New Local Distance-Based Outlier Detection Approach for Scattered Real-World Data will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-648097

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.