TY - GEN
T1 - Identification of Potential Risk Factors of Diabetes for the Qatari Population
AU - Musleh, Saleh
AU - Alam, Tanvir
AU - Bouzerdoum, Abdesselam
AU - Belhaouari, Samir B.
AU - Baali, Hamza
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/2
Y1 - 2020/2
N2 - Large-scale cohorts are established in different regions of the world to identify the complex interaction of genetic, environmental, and lifestyle-related factors that may contribute to chronic diseases including diabetes. Qatar Biobank (QBB) is the largest repository for cohort study specific to the Qatari population. There are few studies based on the QBB cohort, which highlighted multiple risk factors responsible for diabetes in the Qatari population. However, no comprehensive research has been done using machine learning techniques to identify key factors that may contribute to diabetes specific to the Qatari population. We developed several machine-learning models using QBB data to classify diabetic patients from the non-diabetic participants forming the control group for this study. From the roster of several hundred measurements, we identified 25 potential risk factors that might be influential in distinguishing diabetic patients from nondiabetic participants. From the identified risk factors, we ranked HbAlc, Glucose, and LDL-Cholesterol as the most influential risk factors. Using these risk factors, we also developed several machine-learning models to classify diabetic subjects from healthy subjects. Overall, the classifiers achieved 0.85 F1-score in classifying diabetic subjects from non-diabetic subjects. Further investigation will pave the way for the inclusion of the identified risk factors into the standard diabetes screening process of the Qatari population.
AB - Large-scale cohorts are established in different regions of the world to identify the complex interaction of genetic, environmental, and lifestyle-related factors that may contribute to chronic diseases including diabetes. Qatar Biobank (QBB) is the largest repository for cohort study specific to the Qatari population. There are few studies based on the QBB cohort, which highlighted multiple risk factors responsible for diabetes in the Qatari population. However, no comprehensive research has been done using machine learning techniques to identify key factors that may contribute to diabetes specific to the Qatari population. We developed several machine-learning models using QBB data to classify diabetic patients from the non-diabetic participants forming the control group for this study. From the roster of several hundred measurements, we identified 25 potential risk factors that might be influential in distinguishing diabetic patients from nondiabetic participants. From the identified risk factors, we ranked HbAlc, Glucose, and LDL-Cholesterol as the most influential risk factors. Using these risk factors, we also developed several machine-learning models to classify diabetic subjects from healthy subjects. Overall, the classifiers achieved 0.85 F1-score in classifying diabetic subjects from non-diabetic subjects. Further investigation will pave the way for the inclusion of the identified risk factors into the standard diabetes screening process of the Qatari population.
KW - Diabetes
KW - Glucose
KW - HbA1c
KW - LDL-Cholesterol
KW - Qatar Biobank
KW - machine-learning
UR - http://www.scopus.com/inward/record.url?scp=85085480268&partnerID=8YFLogxK
U2 - 10.1109/ICIoT48696.2020.9089545
DO - 10.1109/ICIoT48696.2020.9089545
M3 - Conference contribution
AN - SCOPUS:85085480268
T3 - 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies, ICIoT 2020
SP - 243
EP - 246
BT - 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies, ICIoT 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies, ICIoT 2020
Y2 - 2 February 2020 through 5 February 2020
ER -