Enhancing Diabetes Mellitus Prediction: Integrating Hybrid Deep Learning Model with Sampling Techniques

Authors

  • Shivanya Shomir Dutta School of Computer Science and Engineering (SCOPE), Vellore Institute of Technology, Chennai-600127, India
  • Aakash Kumar School of Computer Science and Engineering (SCOPE), Vellore Institute of Technology, Chennai-600127, India
  • Amutha S School of Computer Science and Engineering (SCOPE), Vellore Institute of Technology, Chennai-600127, India
  • R Dhanush School of Electronics Engineering (SENSE), Vellore Institute of Technology, Chennai-600127, India

DOI:

https://doi.org/10.63503/j.ijaimd.2024.8

Keywords:

SMOTE-ENN, Hybrid Deep learning model, BRFSS, Diabetes, Convolutional LSTM

Abstract

Diabetes, characterized by high blood glucose levels, is a leading cause of liver, eye, kidney, and heart diseases. This study evaluates various deep learning models, combined with machine learning classifiers, for predicting diabetes mellitus using the BRFSS dataset. The dataset's imbalance posed a challenge for binary classification, common in medical diagnostics. To address this, different sampling techniques were tested. Hybrid models combining Convolu-tional Long Short Term Memory (Conv LSTM) networks with traditional classi-fiers were also explored. The Conv LSTM model combined with Adaboost classifiers achieved the highest accuracy of 89.47% with SMOTE-ENN resampled data. These findings highlight the potential of integrating deep learning and traditional machine learning for effective diabetes prediction, aiding early diagnosis and intervention

References

Lukmanto, R. B., & Irwansyah, E. J. P. C. S. (2015). The early detection of diabetes mellitus (DM) using fuzzy hierarchical model. Procedia Computer Science, 59, 312-319. 5.

Swapna, G., Vinayakumar, R., & Soman, K. P. (2018). Diabetes detection using deep learning algorithms. ICT express, 4(4), 243-246.Refat, M. A. R., Al Amin, M., Kaushal, C., Yeasmin, M. N., & Islam, M. K. (2021, October). A comparative analysis of early stage diabetes prediction using machine learning and deep learning approach. In 2021 6th International Conference on Signal Processing, Computing and Control (ISPCC) (pp. 654-659). IEEE.

Larabi-Marie-Sainte, S., Aburahmah, L., Almohaini, R., & Saba, T. (2019). Current techniques for diabetes pre-diction: review and case study. Applied Sciences, 9(21), 4604.

Tan, Y., Chen, H., Zhang, J., Tang, R., & Liu, P. (2022). Early risk prediction of diabetes based on GA-Stacking. Applied Sciences, 12(2), 632.

Prajapati, Y. R., Hihoriya, D. G., & Verma, S. (2023, July). Early Detection and Prediction of Diabetes Using Ensemble Classifier. In 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE.

Ayon, S. I., & Islam, M. M. (2019). Diabetes prediction: a deep learning approach. International Journal of In-formation Engineering and Electronic Business, 13(2), 21.

Kopitar, L., Kocbek, P., Cilar, L., Sheikh, A., & Stiglic, G. (2020). Early detection of type 2 diabetes mellitus using machine learning-based prediction models. Scientific reports, 10(1), 11981.

Mahajan, S., Sarangi, P. K., Sahoo, A. K., & Rohra, M. (2023, May). Diabetes Mellitus Prediction using Supervised Machine Learning Techniques. In 2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT) (pp. 587-592). IEEE.

Simaiya, S., Kaur, R., Sandhu, J. K., Alsafyani, M., Alroobaea, R., Margala, M., & Chakrabarti, P. (2022). A novel multistage ensemble approach for prediction and classification of diabetes. Frontiers in Physiology, 13, 1085240.

Teboul, A. (n.d.). Diabetes Health Indicators Dataset. Retrieved from https://www.kaggle.com/datasets/alexteboul/diabetes-health-indicators-dataset?select=diabetes_binary_health_indicators_BRFSS2015.csv

Lamari, M., Azizi, N., Hammami, N. E., Boukhamla, A., Cheriguene, S., Dendani, N., & Benzebouchi, N. E. (2021). SMOTE–ENN-based data sampling and improved dynamic ensemble selection for imbalanced medical data classification. In Advances on Smart and Soft Computing: Proceedings of ICACIn 2020 (pp. 37-49). Springer Singapore.

He, H., Bai, Y., Garcia, E. A., & Li, S. (2008, June). ADASYN: Adaptive synthetic sampling approach for imbal-anced learning. In 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence) (pp. 1322-1328). Ieee.

Prusa, J., Khoshgoftaar, T. M., Dittman, D. J., & Napolitano, A. (2015, August). Using random undersampling to alleviate class imbalance on tweet sentiment data. In 2015 IEEE international conference on information reuse and integration (pp. 197-202). IEEE.

Shi, X., Chen, Z., Wang, H., Yeung, D. Y., Wong, W. K., & Woo, W. C. (2015). Convolutional LSTM network: A machine learning approach for precipitation nowcasting. Advances in neural information processing systems, 28.

Guo, G., Wang, H., Bell, D., Bi, Y., & Greer, K. (2003). KNN model-based approach in classification. In On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE: OTM Confederated International Conferences, CoopIS, DOA, and ODBASE 2003, Catania, Sicily, Italy, November 3-7, 2003. Proceedings (pp. 986-996). Springer Berlin Heidelberg.

Webb, G. I., Keogh, E., & Miikkulainen, R. (2010). Naïve Bayes. Encyclopedia of machine learning, 15(1), 713-714.

Li, W., Yin, Y., Quan, X., & Zhang, H. (2019). Gene expression value prediction based on XGBoost algo-rithm. Frontiers in genetics, 10, 484931.

Ju, Y., Sun, G., Chen, Q., Zhang, M., Zhu, H., & Rehman, M. U. (2019). A model combining convolutional neural network and LightGBM algorithm for ultra-short-term wind power forecasting. Ieee Access, 7, 28309-28318.

Wang, R. (2012). AdaBoost for feature selection, classification and its relation with SVM, a review. Physics Procedia, 25, 800-807.

Downloads

Published

2024-07-31

How to Cite

Shivanya Shomir Dutta, Aakash Kumar, Amutha S, & R Dhanush. (2024). Enhancing Diabetes Mellitus Prediction: Integrating Hybrid Deep Learning Model with Sampling Techniques. International Journal on Engineering Artificial Intelligence Management, Decision Support, and Policies, 1(1), 29–40. https://doi.org/10.63503/j.ijaimd.2024.8

Issue

Section

Research Articles