TY - GEN
T1 - Network anomaly detection using a commute distance based approach
AU - Khoa, Nguyen Lu Dang
AU - Babaie, Tahereh
AU - Chawla, Sanjay
AU - Zaidi, Zainab
PY - 2010
Y1 - 2010
N2 - We propose the use of commute distance, a random walk metric, to discover anomalies in network traffic data. The commute distance based anomaly detection approach has several advantages over Principal Component Analysis (PCA), which is the method of choice for this task: (i) It generalizes both distance and density based anomaly detection techniques while PCA is primarily distance-based (ii) It is agnostic about the underlying data distribution, while PCA is based on the assumption that data follows a Gaussian distribution and (iii) It is more robust compared to PCA, i.e., a perturbation of the underlying data or changes in parameters used will have a less significant effect on the output of it than PCA. Experiments and analysis on simulated and real datasets are used to validate our claims.
AB - We propose the use of commute distance, a random walk metric, to discover anomalies in network traffic data. The commute distance based anomaly detection approach has several advantages over Principal Component Analysis (PCA), which is the method of choice for this task: (i) It generalizes both distance and density based anomaly detection techniques while PCA is primarily distance-based (ii) It is agnostic about the underlying data distribution, while PCA is based on the assumption that data follows a Gaussian distribution and (iii) It is more robust compared to PCA, i.e., a perturbation of the underlying data or changes in parameters used will have a less significant effect on the output of it than PCA. Experiments and analysis on simulated and real datasets are used to validate our claims.
KW - Commute distance based approach
KW - Density-based approach
KW - Distance-based approach
KW - Network anomaly detection
KW - Principal component analysis
UR - http://www.scopus.com/inward/record.url?scp=79951760070&partnerID=8YFLogxK
U2 - 10.1109/ICDMW.2010.90
DO - 10.1109/ICDMW.2010.90
M3 - Conference contribution
AN - SCOPUS:79951760070
SN - 9780769542577
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 943
EP - 950
BT - Proceedings - 10th IEEE International Conference on Data Mining Workshops, ICDMW 2010
T2 - 10th IEEE International Conference on Data Mining Workshops, ICDMW 2010
Y2 - 14 December 2010 through 17 December 2010
ER -