Ensemble clustering algorithm with supervised classification of clinical data for early diagnosis of coronary artery disease

Kausar Noreen*, Abdullah Azween, Samir Brahim Belhaouari, Palaniappan Sellapan, Alghamdi Bandar Saeed, Dey Nilanjan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

39 Citations (Scopus)

Abstract

Enhancing the detection accuracy of heart anomalies for clinical diagnosis is essential yet complicated because of irrelevant patient's details and slow systematic processing. In this work, the aim is to select relevant clinical features which can accelerate the classification performance to distinguish abnormal and normal patients. For this purpose, Principal Component Analysis (PCA) algorithm is applied to reduce the attribute dimension by incorporating class identifiers for extracting minimal attributes which have maximum portion of the total variance. This approach combines Supervised and Unsupervised learning methods namely Support Vector Machines (SVM) and K-means Clustering for classification by adjusting their related parameters and measures. K-means clustering groups the similar data patterns in possible clusters which are individually classified to determine overall accuracy by computing average of accuracies achieved from all the clusters. Support Vector Machines (SVM) have a better generalization ability which can even detect unseen testing data with model trained at determined parameter values. Results performed on University of California, Irvine (UCI) Cleveland Heart data set have outperformed earlier data mining approaches because of its time, optimized classification by tuning associated parameters and selection of relevant attributes. In future, this approach can be used for multi-classification of different medical datasets.

Original languageEnglish
Pages (from-to)78-87
Number of pages10
JournalJournal of Medical Imaging and Health Informatics
Volume6
Issue number1
DOIs
Publication statusPublished - Feb 2016
Externally publishedYes

Keywords

  • Clustering
  • Coronary Artery Disease (CAD)
  • Dimension Reduction
  • Feature Selection
  • Principal Component Analysis (PCA)
  • Support Vector Machines (SVM)

Fingerprint

Dive into the research topics of 'Ensemble clustering algorithm with supervised classification of clinical data for early diagnosis of coronary artery disease'. Together they form a unique fingerprint.

Cite this