Distance Boxplot for Unsupervised Outlier Detection

Atiq Ur Rehman, Samir Brahim Belhaouari

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Anomaly detection is one of the key step in data preprocessing. Moreover, it is also vital in decision processing with applications such as intrusion detection and spam filtering. In this paper, an unsupervised anomaly detection method based on boxplot approach is proposed. In the proposed approach, a distance metric is integrated with the classical boxplot which results in a powerful hybrid approach capable of detecting both, either a single lying outlier or a cluster of outliers. Integration of distance metric takes into account the neighborhood estimation along with the density estimation. The concept realizes the compactness of data along with the position of underlying data points. This makes it suitable even for detecting small clusters of outlying data points which can be ignored as inlier data points because of their existence in a close proximity and high density. The usefulness of the proposed approach is demonstrated by some complex synthetic datasets.

Original languageEnglish
Title of host publicationICSCA 2022 - 2022 11th International Conference on Software and Computer Applications
PublisherAssociation for Computing Machinery
Pages74-77
Number of pages4
ISBN (Electronic)9781450385770
DOIs
Publication statusPublished - 24 Feb 2022
Event11th International Conference on Software and Computer Applications, ICSCA 2022 - Virtual, Online, Malaysia
Duration: 24 Feb 202226 Feb 2022

Publication series

NameACM International Conference Proceeding Series

Conference

Conference11th International Conference on Software and Computer Applications, ICSCA 2022
Country/TerritoryMalaysia
CityVirtual, Online
Period24/02/2226/02/22

Keywords

  • Anomaly detection
  • Boxplot
  • Euclidean distance
  • Outlier detection
  • Unsupervised learning

Fingerprint

Dive into the research topics of 'Distance Boxplot for Unsupervised Outlier Detection'. Together they form a unique fingerprint.

Cite this