TY - JOUR
T1 - Multivariate fault detection and classification using interval principal component analysis
AU - Basha, Nour
AU - Nounou, Mohamed
AU - Nounou, Hazem
N1 - Publisher Copyright:
© 2018 Elsevier B.V.
PY - 2018/7
Y1 - 2018/7
N2 - Principal component analysis (PCA) is a linear data analysis tool that aims to reduce the dimensionality of a dataset, while retaining most of the variation found in it. It transforms the variables of a dataset into a new set, called the principal components, using linear combinations of the original variables. PCA is a powerful statistical technique used in research for fault detection, classification and feature extraction. Interval principal component analysis (IPCA) is an extension to PCA designed to apply PCA to large datasets using interval data generated from single-valued samples. In this paper, three IPCA methods are compared: centers IPCA, midpoint-radii IPCA, and symbolic covariance IPCA, and methods for fault detection and classification using interval data are described. Fault detection and classification applications are respectively carried out through two examples, one using synthetic and the other using real data, and the results are compared to those of the classical PCA. The results show that IPCA methods have a higher detection rate than classical PCA, for the same false alarm rate. Moreover, IPCA methods are capable of differentiating the type of fault to a high degree of accuracy, unlike classical PCA. Interval centers were capable of detecting changes in mean, while interval radii were capable of detecting changes in variance. Furthermore, for data classification, the results show that MRIPCA had a higher classification precision than other IPCA methods and classical PCA.
AB - Principal component analysis (PCA) is a linear data analysis tool that aims to reduce the dimensionality of a dataset, while retaining most of the variation found in it. It transforms the variables of a dataset into a new set, called the principal components, using linear combinations of the original variables. PCA is a powerful statistical technique used in research for fault detection, classification and feature extraction. Interval principal component analysis (IPCA) is an extension to PCA designed to apply PCA to large datasets using interval data generated from single-valued samples. In this paper, three IPCA methods are compared: centers IPCA, midpoint-radii IPCA, and symbolic covariance IPCA, and methods for fault detection and classification using interval data are described. Fault detection and classification applications are respectively carried out through two examples, one using synthetic and the other using real data, and the results are compared to those of the classical PCA. The results show that IPCA methods have a higher detection rate than classical PCA, for the same false alarm rate. Moreover, IPCA methods are capable of differentiating the type of fault to a high degree of accuracy, unlike classical PCA. Interval centers were capable of detecting changes in mean, while interval radii were capable of detecting changes in variance. Furthermore, for data classification, the results show that MRIPCA had a higher classification precision than other IPCA methods and classical PCA.
KW - Classification
KW - Fault detection
KW - Interval data
KW - Midpoint-radii
KW - Principal component analysis
KW - Symbolic covariance
UR - http://www.scopus.com/inward/record.url?scp=85046665124&partnerID=8YFLogxK
U2 - 10.1016/j.jocs.2018.04.017
DO - 10.1016/j.jocs.2018.04.017
M3 - Article
AN - SCOPUS:85046665124
SN - 1877-7503
VL - 27
SP - 1
EP - 9
JO - Journal of Computational Science
JF - Journal of Computational Science
ER -