Abstract
We propose a measure, spatial local outlier measure (SLOM), which captures the local behaviour of datum in their spatial neighbourhood. With the help of SLOM, we are able to discern local spatial outliers that are usually missed by global techniques, like "three standard deviations away from the mean". Furthermore, the measure takes into account the local stability around a data point and suppresses the reporting of outliers in highly unstable areas, where data are too heterogeneous and the notion of outliers is not meaningful. We prove several properties of SLOM and report experiments on synthetic and real data sets that show that our approach is novel and scalable to large datasets.
Original language | English |
---|---|
Pages (from-to) | 412-429 |
Number of pages | 18 |
Journal | Knowledge and Information Systems |
Volume | 9 |
Issue number | 4 |
DOIs | |
Publication status | Published - Apr 2006 |
Externally published | Yes |
Keywords
- Complexity
- Oscillating parameter
- R-trees index
- Spatial local outlier
- Spatial neighbourhood