TY - GEN
T1 - Mining causal outliers using gaussian Bayesian networks
AU - Babbar, Sakshi
AU - Chawla, Sanjay
PY - 2012
Y1 - 2012
N2 - Outliers are often identified as data points which are ''rare'', ''isolated'', or far away from their nearest neighbours. In this paper we demonstrate that meaningful outliers, i.e., outliers which perhaps encode important or new information are those which violate causal relationships. We first build a Bayesian network which encode causal relationships between attributes and then identify those points as outliers which violate these causal relationships. Experiments on several data sets confirm that the outliers identified in this fashion are in some sense ''genuine'' as they reveal new information about the underlying data generating process.
AB - Outliers are often identified as data points which are ''rare'', ''isolated'', or far away from their nearest neighbours. In this paper we demonstrate that meaningful outliers, i.e., outliers which perhaps encode important or new information are those which violate causal relationships. We first build a Bayesian network which encode causal relationships between attributes and then identify those points as outliers which violate these causal relationships. Experiments on several data sets confirm that the outliers identified in this fashion are in some sense ''genuine'' as they reveal new information about the underlying data generating process.
KW - Bayesian networks
KW - Causality and Outliers
UR - http://www.scopus.com/inward/record.url?scp=84876931267&partnerID=8YFLogxK
U2 - 10.1109/ICTAI.2012.22
DO - 10.1109/ICTAI.2012.22
M3 - Conference contribution
AN - SCOPUS:84876931267
SN - 9780769549156
T3 - Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI
SP - 97
EP - 104
BT - Proceedings - 2012 IEEE 24th International Conference on Tools with Artificial Intelligence, ICTAI 2012
T2 - 2012 IEEE 24th International Conference on Tools with Artificial Intelligence, ICTAI 2012
Y2 - 7 November 2012 through 9 November 2012
ER -