TY - JOUR
T1 - Q-GEV Based Novel Trainable Clustering Scheme for Reducing Complexity of Data Clustering
AU - Elaziz, Mohamed Abd
AU - Zaid, Esraa Osama Abo
AU - Al-qaness, Mohammed A.A.
AU - Ali, Amjad
AU - Bashir, Ali Kashif
AU - Ewees, Ahmed A.
AU - Al-Otaibi, Yasser D.
AU - Al-Fuqaha, Ala
N1 - Publisher Copyright:
© 2025 John Wiley & Sons Ltd.
PY - 2025/4
Y1 - 2025/4
N2 - This paper presents a new data clustering technique aimed at enhancing the performance of the trainable path-cost algorithm and reducing the computational complexity of data clustering models. The proposed method facilitates the discovery of natural groupings and behaviours, which is crucial for effective coordination in complex environments. It identifies natural groupings within a set of features and detects the best clusters with similar behaviour in the data, overcoming the limitations of traditional state-of-the-art methods. The algorithm utilises a density peak clustering method to determine cluster centers and then extracts features from paths passing through these peak points (centers). These features are used to train the support vector machine (SVM) to predict the labels of other points. The proposed algorithm is enhanced using two key concepts: first, it employs Q-Generalised Extreme Value (Q-GEV) under power normalisation instead of traditional generalised extreme value distributions, thereby increasing modelling flexibility; second, it utilises the random vector functional link (RVFL) network rather than the SVM, which helps avoid overfitting and improves label prediction accuracy. The effectiveness of the proposed clustering algorithm is evaluated through various experiments, including those on UCI benchmark datasets and real-world data, demonstrating significant improvements across multiple performance metrics, including F1 measure, Jaccard index, purity, and accuracy, highlighting its capability in accurately identifying paths between similar clusters. Its average F1 measure, Jaccard index, purity, and accuracy is measured 76.87%, 56.29%, 80.29%, and 79.64%, respectively.
AB - This paper presents a new data clustering technique aimed at enhancing the performance of the trainable path-cost algorithm and reducing the computational complexity of data clustering models. The proposed method facilitates the discovery of natural groupings and behaviours, which is crucial for effective coordination in complex environments. It identifies natural groupings within a set of features and detects the best clusters with similar behaviour in the data, overcoming the limitations of traditional state-of-the-art methods. The algorithm utilises a density peak clustering method to determine cluster centers and then extracts features from paths passing through these peak points (centers). These features are used to train the support vector machine (SVM) to predict the labels of other points. The proposed algorithm is enhanced using two key concepts: first, it employs Q-Generalised Extreme Value (Q-GEV) under power normalisation instead of traditional generalised extreme value distributions, thereby increasing modelling flexibility; second, it utilises the random vector functional link (RVFL) network rather than the SVM, which helps avoid overfitting and improves label prediction accuracy. The effectiveness of the proposed clustering algorithm is evaluated through various experiments, including those on UCI benchmark datasets and real-world data, demonstrating significant improvements across multiple performance metrics, including F1 measure, Jaccard index, purity, and accuracy, highlighting its capability in accurately identifying paths between similar clusters. Its average F1 measure, Jaccard index, purity, and accuracy is measured 76.87%, 56.29%, 80.29%, and 79.64%, respectively.
KW - artificial intelligence
KW - continual learning
KW - data clustering
KW - density peak clustering
KW - generalised extreme value
KW - learning model
KW - machine learning
UR - http://www.scopus.com/inward/record.url?scp=85219632647&partnerID=8YFLogxK
U2 - 10.1111/exsy.70011
DO - 10.1111/exsy.70011
M3 - Article
AN - SCOPUS:85219632647
SN - 0266-4720
VL - 42
JO - Expert Systems
JF - Expert Systems
IS - 4
M1 - e70011
ER -