An Intelligent and Precise Method Used for Detecting Gestational Diabetes in the Early Stages


  • Safa Hameed Department of Computer Science, Faculty of Engineering and Science, Bayan University, Erbil, Iraq
  • Alaa Badeea Ali Department of Computer Science, Faculty of Engineering and Science, Bayan University, Erbil, Iraq



Classifier, Feature Selection, Gestational Diabetes, Machine Learning, Naïve Bayes


This paper suggests a Naive Bayes classifier technique for identifying and categorizing gestational diabetes mellitus (GDM), GDM is a kind of diabetes mellitus that affects a small proportion of pregnant women but recovers to normal once the baby is born. The Pima Indians Diabetes Dataset was chosen for a comprehensive analysis of this critical and pervasive health disease because it contains 768 patient characteristics acquired from a machine learning source at the University of California, Irvine. The goal of the study is to apply smart technology to categorize diseases with high accuracy and precision, practically free of conceivable and potential faults, to provide satisfying findings. The approach is based on eight major characteristics that are present in the operations that are required to establish a precise and reliable categorization system. This approach involves training and testing on real data, as well as for deciding whether or not to construct a categorization model. The work was compared to earlier work and had a 96% accuracy rating.


S. Gupta, H. K. Verma and D. Bhardwaj. “Classification of diabetes using naïve Bayes and support vector machine as a technique”. Operations Management and Systems Engineering, pp. 365-376, 2020.

S. Prasanth, K. Banujan and K. Btgs. “Hyper Parameter Tuned Ensemble Approach for Gestational Diabetes Prediction”. International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT), IEEE. pp. 18-23, 2021.

N. Sneha and T. Gangil. “Analysis of diabetes mellitus for early prediction using optimal features selection”. Journal of Big Data, vol. 6, p. 13, 2019.

A. Sarwar, M. Ali, J. Manhas and V. Sharma. “Diagnosis of diabetes Type-II using hybrid machine learning based ensemble model”. International Journal of Information Technology, vol. 12, pp. 419-428, 2020.

N. Pradhana, G. Rania, V. Singh, V. S. Dhaka and R. C. Pooniab. “Diabetes prediction using artificial neural network”. In: Deep Learning Techniques for Biomedical and Health Informatics, ScienceDirect, pp. 327-339, 2020.

M. D. Okpor. “Prognostic diagnosis of gestational diabetes utilizing fuzzy classifier”. International Journal of Computer Science and Network Security, vol. 15, no. 6, pp. 44-48, 2015.

E. G. Filho, P. R. Pinheiro, M. C. D. Pinheiro, L. C. Nunes and L. B. G. Gom. “Heterogeneous methodology to support the early diagnosis of gestational diabetes. IEEE Access, vol. 99, p. 1, 2019.

M. Marozas, S. Sosunkevič, M. Francaitė-Daugėlienė, D. Veličkienė and A. Lukoševičius. “Algorithm for diabetes risk evaluation from past gestational diabetes data”. Technology and Health Care, vol, 26, no. 4, pp. 637-648, 2018.

Y. Resti, E. S. Kresnawati, N. R. Dewi, D. A. Zayanti and N. Eliyati. Diagnosis of diabetes mellitus in women of reproductive age using the prediction methods of naive bayes, discriminant analysis, and logistic regression. Science and Technology Indonesia, vol. 6, no. 2, pp. 96-104, 2021.

M.A. Islam and N. Jahan. Prediction of Onset Diabetes using Machine Learning Techniques. International Journal of Computer Applications, vol. 180, no. 5, pp. 7-11, 2017.

R. Saxena, S. K. Sharma and M. Gupta. “Analysis of machine learning algorithms in diabetes mellitus prediction”. Journal of Physics: Conference Series, vol. 1921, p. 012073, 2021.

N. Jayanthi, V. B. Babu and S. Rao. “Data mining techniques for CPD of diabetes”. International Journal of Engineering Computational Research and Technology, 2014.

K. Lakhwani, S. Bhargava, K. K. Hiran, M. M. Bundele and D. Somwanshi. “Prediction of the Onset of Diabetes Using Artificial Neural Network and Pima Indians Diabetes Dataset”. 5th IEEE International Conference on Recent Advances and Innovations in Engineering, pp. 1-6, 2020.

R. Zolfaghar. “Diagnosis of diabetes in female population of pima indian heritage with ensemble of BP neural network and SVM”. International Journal of Computational Engineering and Management, vol. 15, no. 4, pp. 115-121, 2012.

A. Kaushik, A. Sehgal, S. Vora, V. Palan and S. Patil. “Presaging The Signs Of Diabetes Using Machine Learning Algorithms”. 12th International Conference on Computing Communication and Networking Technologies, 2021.

R. Sanakal and S. T. Jayakumari. “Prognosis of diabetes using data mining approach-Fuzzy C Means clustering and support vector machine”. International Journal of Computer Trends and Technology, vol. 11, no. 2, pp. 94-98, 2014.

H. Naz and S. Ahuja. “Deep learning approach for diabetes prediction using PIMA Indian dataset”. Journal of Diabetes and Metabolic Disorders, vol. 19, no. 1, pp. 391-403, 2020.

V. Rawat and S. Suryakant. “A classification system for diabetic patients with machine learning techniques”. International Journal of Mathematical, Engineering and Management Sciences, vol. 4, no. 3, pp. 729-744, 2019.

P. Kaur and R. Kaur. “Comparative analysis of classification techniques for diagnosis of diabetes”. In: Jain, L., Virvou, M., Piuri, V. and Balas, V. (eds.), Advances in Bioinformatics, Multimedia, and Electronics Circuits and Signals Advances in Intelligent Systems and Computing. Vol. 1064. Springer, Singapore, 2020.

L. Jonk. “Chronic Disease Prevention a Vital Investment”. World Health Organization, Geneva, Switzerland, 2005.

L. Moon. “Prevention of Cardiovascular Disease, Diabetes and Chronic Kidney Disease: Targeting Risk Factors”. Vol. 118. AIHW, 2009. Available from: cfm. [Last accessed on 2022 Mar 09].

A. Rajivkannan and K. S. Aparna. “A survey on diabetes prediction using machine learning techniques”. International Journal of Research in Engineering, Science and Management, vol. 4, no. 11, pp. 51-54, 2021.

D. Lavanya and K. U. Rani. “Performance evaluation of decision tree classifiers on medical datasets”. International Journal of Computer Applications, vol. 26, no. 4, pp. 1-4, 2011.

R. Raja, I. Mukherjee and B. K. Sarkar. “A machine learning-based prediction model for preterm birth in Rural India”. Journal of Healthcare Engineering, vol. 2021, p. 6665573, 2021.

A. Saleha and F. Nasari. “Implementation of equal-width interval discretization in naive bayes method for increasing accuracy of students' majors prediction”. Lontar Komputer Jurnal Ilmiah Teknologi Informasi. Vol. 9, no. 2, pp. 104-113, 2018.

R. Dash, R. L. Paramguru and R. Dash. “Comparative analysis of supervised and unsupervised discretization techniques”. International Journal of Advances in Science and Technology, vol. 2, no. 3, pp. 29-37, 2011.

J. Dougherty, R. Kohavi and M. Sahami. “Supervised and Unsupervised Discretization of Continuous Features”. In: Proceedings of the Twelfth International Conference on International Conference on Machine Learning (ICML’95). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1995, pp. 194-202.

J. Han and M. Kambar. “Data Mining: Concepts and Techniques”. 2nd ed. Morgan Kaufmann Publisher, Burlington, Massachusetts, 2006.