On integrated clustering and outlier detection

Lionel Ott, Linsey Pang, Fabio Ramos, Sanjay Chawla

Research output: Contribution to journalConference articlepeer-review

32 Citations (Scopus)

Abstract

We model the joint clustering and outlier detection problem using an extension of the facility location formulation. The advantages of combining clustering and outlier selection include: (i) the resulting clusters tend to be compact and semantically coherent (ii) the clusters are more robust against data perturbations and (iii) the outliers are contextualised by the clusters and more interpretable. We provide a practical subgradient-based algorithm for the problem and also study the theoretical properties of algorithm in terms of approximation and convergence. Extensive evaluation on synthetic and real data sets attest to both the quality and scalability of our proposed method.

Original languageEnglish
Pages (from-to)1359-1367
Number of pages9
JournalAdvances in Neural Information Processing Systems
Volume2
Issue numberJanuary
Publication statusPublished - 2014
Externally publishedYes
Event28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014 - Montreal, Canada
Duration: 8 Dec 201413 Dec 2014

Fingerprint

Dive into the research topics of 'On integrated clustering and outlier detection'. Together they form a unique fingerprint.

Cite this