Abstract :
Diabetes mellitus is a prevalent metabolic disorder globally. Its primary etiologies encompass socioeconomic determinants, behavioral risk factors, and underlying comorbidities. Numerous epidemiological studies have investigated various diabetes phenotypes, impacting both sexes across the entire age spectrum. This study utilizes a dataset containing clinical profiles of 1,000 subjects assessed on multiple biometric and sociodemographic variables. The objective is to classify diabetes into type 1, type 2, and prediabetes using an array of deep learning and machine learning algorithms. Currently, artificial intelligence-driven diagnostic methods represent a state-of-the-art approach for disease stratification. This research evaluates the performance of six classification algorithms for determining glycemic status: random forest (RF), support vector machine (SVM), extreme gradient boosting (XGBoost), convolutional neural network (CNN), recurrent neural network (RNN), and long short-term memory (LSTM) network. Results demonstrate that the XGBoost classifier attained the highest predictive accuracy of 91% with a training duration of 20 seconds, surpassing the other models. These findings underscore the potential of advanced computational algorithms for precise diabetes phenotyping and risk assessment, offering significant implications for disease management and public health interventions.
Keywords :
Artificial Intelligence Models, Classification, Detection, diagnosis, Multiclass Diabetes.References :
- Z. Woldaregay et al., “Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes,” Artif. Intell. Med., vol. 98, no. August 2018, pp. 109–134, 2019, doi: 10.1016/j.artmed.2019.07.007.
- M. Carracher, P. H. Marathe, and K. L. Close, International Diabetes Federation 2017, vol. 10, no. 5. 2018. doi: 10.1111/1753-0407.12644.
- B. L. and B. R. Qahtan M. Yas, A. A. Zadain*, B. B. Zaidan, “Towards on Develop a Framework for the Evaluation and Benchmarking of Skin Detectors Based on Arti¯cial Intelligent Models using Multi-Criteria Decision-Making Techniques Qahtan,” Int. J. Pattern Recognit. Arti¯cial Intell., vol. 31, no. 2, p. 1759002, 2016, doi: 10.1142/S0218001417590029.
- M. Yas, B. N. Adday, and A. S. Abed, “Evaluation Multi Diabetes Mellitus Symptoms by Integrated Fuzzy-based MCDM Approach,” Turkish J. Comput. Math. Educ., vol. 12, no. 13, pp. 4069–4082, 2021.
- Williams et al., “Global and regional estimates and projections of diabetes-related health expenditure: Results from the International Diabetes Federation Diabetes Atlas, 9th edition,” Diabetes Res. Clin. Pract., vol. 162, 2020, doi: 10.1016/j.diabres.2020.108072.
- A. Zaidan, B. B. Zaidan, M. A. Alsalem, O. S. Albahri, A. S. Albahri, and M. Y. Qahtan, Multi-agent learning neural network and Bayesian model for real-time IoT skin detectors: a new evaluation and benchmarking methodology, vol. 32, no. 12. Springer London, 2020. doi: 10.1007/s00521-019-04325-3.
- Care and S. S. Suppl, “2. Diagnosis and Classification of Diabetes: Standards of Care in Diabetes—2025,” Diabetes Care, vol. 48, no. January, pp. S27–S49, 2025, doi: 10.2337/dc25-S002.
- Villar et al., “Association between fetal abdominal growth trajectories, maternal metabolite signatures early in pregnancy, and childhood growth and adiposity: prospective observational multinational INTERBIO-21st fetal study,” Lancet Diabetes Endocrinol., vol. 10, no. 10, pp. 710–719, 2022, doi: 10.1016/S2213-8587(22)00215-7.
- Bellou, L. Belbasis, I. Tzoulaki, and E. Evangelou, “Risk factors for type 2 diabetes mellitus: An exposure-wide umbrella review of meta-analyses,” PLoS One, vol. 13, no. 3, pp. 1–27, 2018, doi: 10.1371/journal.pone.0194127.
- Wu et al., “Worldwide estimates of incidence of type 2 diabetes in children and adolescents in 2021,” Diabetes Res. Clin. Pract., vol. 185, pp. 0–28, 2022, doi: 10.1016/j.diabres.2022.109785.
- Contreras and J. Vehi, “Artificial intelligence for diabetes management and decision support: Literature review,” J. Med. Internet Res., vol. 20, no. 5, pp. 1–21, 2018, doi: 10.2196/10775.
- A. Salman, A. Kalakech, and A. Steiti, “Random Forest Algorithm Overview,” Babylonian J. Mach. Learn., vol. 2024, pp. 69–79, 2024, doi: 10.58496/bjml/2024/007.
- Ü. Ağbulut, A. E. Gürel, and Y. Biçen, “Prediction of daily global solar radiation using different machine learning algorithms: Evaluation and comparison,” Sustain. Energy Rev., vol. 135, p. 110114, 2021, doi: 10.1016/j.rser.2020.110114.
- Y. Liew, N. Hameed, and J. Clos, “An investigation of XGBoost-based algorithm for breast cancer classification,” Mach. Learn. with Appl., vol. 6, no. September, p. 100154, 2021, doi: 10.1016/j.mlwa.2021.100154.
- Ayat et al., “NOVEL DIABETES CLASSIFICATION APPROACH BASED ON CNN-LSTM : ENHANCED PERFORMANCE AND ACCURACY,” vol. 25, no. 1, 2024, doi: 10.29354/diag/183633.
- Al-Bander, Q. M. Yas, H. Mahdi, and R. K. H. S. Al-Hamd, “Benchmarking of deep learning algorithms for skin cancer detection based on a hybrid framework of entropy and VIKOR techniques,” Turkish J. Electr. Eng. Comput. Sci., vol. 29, no. 8, pp. 2634–2648, 2021, doi: 10.3906/elk-2103-65.
- Dénes-Fazakas et al., “Physical Activity Detection for Diabetes Mellitus Patients Using Recurrent Neural Networks,” Sensors, vol. 24, no. 8, pp. 1–18, 2024, doi: 10.3390/s24082412.
- O. Eboka et al., “Journal of Future Artificial Intelligence Resolving Data Imbalance Using a Bi-Directional Long-Short Term Memory for Enhanced Diabetes Mellitus Detection,” 2025.
- Yas and Y. Kadthem, “Rainfall rate prediction using recurrent neural network with long short-term memory algorithm : Iraq case study Rainfall rate prediction using recurrent neural network with long short-term memory algorithm : Iraq case study Qahtan M . Yas * Younis Kadthem,” Int. J. Comput. Appl. Technol., vol. 74, no. 1–2, pp. 125–135, 2024, doi: 10.1504/IJCAT.2024.141352.
- Ellahham, N. Ellahham, and M. C. E. Simsekler, “Application of Artificial Intelligence in the Health Care Safety Context: Opportunities and Challenges,” Am. J. Med. Qual., vol. 35, no. 4, pp. 341–348, 2020, doi: 10.1177/1062860619878515.
- Thaiyalnayaki, “Classification of diabetes using deep learning and svm techniques,” Int. J. Curr. Res. Rev., vol. 13, no. 1, pp. 146–149, 2021, doi: 10.31782/IJCRR.2021.13127.
- M. Islam, M. J. Rahman, D. Chandra Roy, and M. Maniruzzaman, “Automated detection and classification of diabetes disease based on Bangladesh demographic and health survey data, 2011 using machine learning approach,” Diabetes Metab. Syndr. Clin. Res. Rev., vol. 14, no. 3, pp. 217–219, 2020, doi: 10.1016/j.dsx.2020.03.004.
- Kumari, D. Kumar, and M. Mittal, “An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier,” Int. J. Cogn. Comput. Eng., vol. 2, no. November 2020, pp. 40–46, 2021, doi: 10.1016/j.ijcce.2021.01.001.
- Gollapalli et al., “A novel stacking ensemble for detecting three types of diabetes mellitus using a Saudi Arabian dataset: Pre-diabetes, T1DM, and T2DM,” Comput. Biol. Med., vol. 147, no. April, p. 105757, 2022, doi: 10.1016/j.compbiomed.2022.105757.
- Y. Shams, Z. Tarek, and A. M. Elshewey, “A novel RFE-GRU model for diabetes classification using PIMA Indian dataset,” Sci. Rep., vol. 15, no. 1, pp. 1–22, 2025, doi: 10.1038/s41598-024-82420-9.
- “Multiclass Diabetes Dataset.” [Online]. Available: https://www.kaggle.com/datasets/yasserhessein/multiclass-diabetes-dataset
- Dharmarathne, T. N. Jayasinghe, M. Bogahawaththa, D. P. P. Meddage, and U. Rathnayake, “A novel machine learning approach for diagnosing diabetes with a self-explainable interface,” Healthc. Anal., vol. 5, no. December 2023, p. 100301, 2024, doi: 10.1016/j.health.2024.100301.
- Alazzawi, Q. M. Yas, and B. Rahmatullah, “A Comprehensive Review of Software Development Life Cycle methodologies : Pros , Cons , and Future Directions,” Iraqi J. Comput. Sci. Math., vol. 4, no. 4, pp. 173–190, 2023.
- M. Yas, A. A. Zaidan, and B. B. Zaidan, “Comprehensive insights into evaluation and benchmarking of real-time skin detectors: Review, open issues & challenges, and recommended solutions,” Meas. J. Int. Meas. Confed., vol. 114, pp. 243–260, 2018, doi: 10.1016/j.measurement.2017.09.027.
- Muawanah, U. Muzayanah, M. G. R. Pandin, M. D. S. Alam, and J. P. N. Trisnaningtyas, “Stress and Coping Strategies of Madrasah’s Teachers on Applying Distance Learning During COVID-19 Pandemic in Indonesia,” Qubahan Acad. J., vol. 3, no. 4, pp. 206–218, 2023, doi: 10.48161/Issn.2709-8206.
- M. Mijwil and M. Aljanabi, “A Comparative Analysis of Machine Learning Algorithms for Classification of Diabetes Utilizing Confusion Matrix Analysis,” Baghdad Sci. J., vol. 21, no. 5, pp. 1712–1728, 2024, doi: 10.21123/BSJ.2023.9010.
- Valero-Carreras, J. Alcaraz, and M. Landete, “Comparing two SVM models through different metrics based on the confusion matrix,” Comput. Oper. Res., vol. 152, no. December 2022, p. 106131, 2023, doi: 10.1016/j.cor.2022.106131.
- O. Nyangaresi, N. K. T. El-Omari, and J. N. Nyakina, “Efficient Feature Selection and ML Algorithm for Accurate Diagnostics,” J. Comput. Sci. Res., vol. 4, no. 1, pp. 10–19, 2022, doi: 10.30564/jcsr.v4i1.3852.
- Markoulidakis, I. Rallis, I. Georgoulas, G. Kopsiaftis, A. Doulamis, and N. Doulamis, “Multiclass Confusion Matrix Reduction Method and Its Application on Net Promoter Score Classification Problem,” Technologies, vol. 9, no. 4, 2021, doi: 10.3390/technologies9040081.
- Helmud, E. Helmud, F. Fitriyani, and P. Romadiana, “Classification Comparison Performance of Supervised Machine Learning Random Forest and Decision Tree Algorithms Using Confusion Matrix,” J. Sisfokom (Sistem Inf. dan Komputer), vol. 13, no. 1, pp. 92–97, 2024, doi: 10.32736/sisfokom.v13i1.1985.
- Theissler, M. Thomas, M. Burch, and F. Gerschner, “ConfusionVis: Comparative evaluation and selection of multi-class classifiers based on confusion matrices,” Knowledge-Based Syst., vol. 247, p. 108651, 2022, doi: 10.1016/j.knosys.2022.108651.

