TY - JOUR
T1 - Long Term HbA1c Prediction Using Multi-Stage CGM Data Analysis
AU - Islam, Md Shafiqul
AU - Qaraqe, Marwa Khalid
AU - Belhaouari, Samir Brahim
AU - Petrovski, Goran
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2021/7/1
Y1 - 2021/7/1
N2 - The glycated hemoglobin (HbA1c) is regarded as an essential biomarker for diabetes management. Having an elevated HbA1c level significantly increases the risk of developing diabetes-related health complications. Accurate prediction of HbA1c can greatly improve the way diabetic patients are treated and can potentially avoid related consequences. This study devises a framework to predict HbA1c levels 2-3 months in advance by using blood glucose data collected through continuous glucose monitoring (CGM) sensors and leveraging advanced feature extraction and machine learning techniques. The CGM data may often contain missing values due to sensor issues or not wearing the sensor for some period. Thus, in the paper, a novel missing data estimation method has been proposed for a single data point, multiple data points, and entire day CGM data imputation. The CGM data have been rigorously investigated, and pertinent features were created along with a multi-stage multi-class (MSMC) classification model to predict futuristic HbA1c levels. To evaluate the developed framework, a total of 150 patients' data were sourced from Sidra Medicine, Doha, Qatar, for analysis. The proposed three-staged and five-staged MSMC models predicted HbA1c levels 2-3 months in advance and obtained overall classification accuracies of 88.65% and 83.41%, respectively.
AB - The glycated hemoglobin (HbA1c) is regarded as an essential biomarker for diabetes management. Having an elevated HbA1c level significantly increases the risk of developing diabetes-related health complications. Accurate prediction of HbA1c can greatly improve the way diabetic patients are treated and can potentially avoid related consequences. This study devises a framework to predict HbA1c levels 2-3 months in advance by using blood glucose data collected through continuous glucose monitoring (CGM) sensors and leveraging advanced feature extraction and machine learning techniques. The CGM data may often contain missing values due to sensor issues or not wearing the sensor for some period. Thus, in the paper, a novel missing data estimation method has been proposed for a single data point, multiple data points, and entire day CGM data imputation. The CGM data have been rigorously investigated, and pertinent features were created along with a multi-stage multi-class (MSMC) classification model to predict futuristic HbA1c levels. To evaluate the developed framework, a total of 150 patients' data were sourced from Sidra Medicine, Doha, Qatar, for analysis. The proposed three-staged and five-staged MSMC models predicted HbA1c levels 2-3 months in advance and obtained overall classification accuracies of 88.65% and 83.41%, respectively.
KW - CGM sensor
KW - HbA1c prediction
KW - diabetes management
KW - feature extraction
KW - missing data estimation
UR - http://www.scopus.com/inward/record.url?scp=85104636023&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2021.3073974
DO - 10.1109/JSEN.2021.3073974
M3 - Article
AN - SCOPUS:85104636023
SN - 1530-437X
VL - 21
SP - 15237
EP - 15247
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 13
M1 - 9406945
ER -