A Comprehensive Ensemble Approach Using Blending and Stacking for Credit Card Fraud Detection

Authors

  • Bharti Chugh CSE, KIET Group of Institutions Delhi-NCR , Ghaziabad , India
  • Preeti Garg CSE, KIET Group of Institutions Delhi-NCR , Ghaziabad , India
  • Karnika Dwivedi CSE, KIET Group of Institutions Delhi-NCR , Ghaziabad , India

Keywords:

Fraud detection, Ensemble Learning, Balancing, Feature Selection, Voting, Credit card

Abstract

Detection of credit card fraud transactions is a severe problem, which requires analyzing large volumes of transaction data to identify fraud patterns. It re-quires finding, which transactions are fraudulent out of millions of daily transactions. As the amount of data is increasing, it is now difficult for an indi-vidual to detect meaningful patterns from transaction data, often character-ized by many samples, many dimensions, and online updates. As a result, there is a need for the best possible approach using machine learning that auto-mates the process of identifying fraudulent patterns from large volumes of data. Therefore, this study proposed a comprehensive ensemble approach us-ing Blending (Voting Classifier) and stacking for credit card fraud detection. As the dataset is imbalanced, the proposed method balanced the dataset resampling techniques. Working with selected features instead of all the features reduces the risk of over-fitting, improves accuracy, and decreases the training time. Afterward, three base classifiers with chosen features, an ensemble voting classifier and a Stacking Classifier have been developed. The computational results indicate that the suggested stacking ensemble is the best, and its Random Forest (RF) classifier has also the best performance among other base classifiers. The ensemble stacking classifiers lead to 82.5% recall,85% F1-score and 82.5 % AUC which has superiority over other ensemble-based methods.

References

Mallidi, Manoj Kumar Reddy, and Yeshwanth Zagabathuni. "Analysis of Credit Card Fraud Detection using Machine Learning models on balanced and imbalanced datasets." International Journal of Emerging Trends in Engineering Research 9.7 (2021)

Tiwari, Pooja, et al. "Credit Card Fraud Detection using Machine Learning: A Study." arXiv preprint arXiv:2108.10005 (2021)

Xie, Yalong, et al. "A Heterogeneous Ensemble Learning Model Based on Data Distribution for Credit Card Fraud Detection". Wireless Communications and Mobile Computing 2021 (2021)

Dal Pozzolo, A., Caelen, O., Le Borgne, Y.-A., Waterschoot, S. and Bontempi, G. “Learned lessons in credit card fraud detection from a practitioner perspective”. Expert Syst. Appl., 41, 4915–4928 (2014)

Díez-Pastor, J.F., Rodríguez, J.J., García-Osorio, C.I. and Kuncheva, L.I. “Diversity techniques improve the performance of the best imbalance learning ensembles”. Inf. Sci., 325, 98–117 (2015)

Kültür, Y. and Çaglayan, M.U. “Hybrid approaches for detecting credit card fraud”. Expert. Syst., 34, 1–13 (2017)

Fadaei Noghani, F., Moattar, M.H. “Ensemble classification and extended feature selection for credit card fraud detection”. J. AI Data Min. 5(2), 235–243 (2017)

Kuncheva, L.I. and Rodríguez, J.J. “A weighted voting framework for classifiers ensembles”. Knowl. Inf. Syst., 38, 259–275 (2014)

Akila, S., Reddy, U.S. “Cost-sensitive risk induced bayesian inference bagging (ribib) for credit card fraud detec-tion”. J. Comput. Sci. 27, 247–254 (2018)

Kim, E., Lee, J.; Shin, H., Yang, H., Cho, S.; Nam, S.k., Song, Y., Yoon, J.a., Kim, J.i., “Champion-challenger analysis for credit card fraud detection: Hybrid ensemble and deep learning”. Expert Syst. Appl. 128, 214–224 (2019)

Kalra M. and Patni J.. “Playing Doom with Deep Reinforcement Learning”. International Journal of Computer Applications, vol. 1, pp.14-20, (2019)

Singh, Ajeet, and Anurag Jain. "Adaptive credit card fraud detection techniques based on feature selection method." Advances in computer communication and computational sciences. Springer, Singapore, 167-178 (2019)

Bian, Y., Cheng, M., Yang, C., Yuan, Y., Li, Q., Zhao, J.L. and Liang, L. “Financial Fraud Detection: A New Ensemble Learning Approach for Imbalanced Data”. In Proc. PACIS 2016. Chiayi, Taiwan, June 27, PACIS, Taiwan (2016)

Wang, G., Sun, J., Ma, J., Xu, K. and Gu, J. “Sentiment classification: The contribution of ensemble learning”. Decis. Support. Syst., 57, 77–93 (2014)

Krawczyk, B. and Wozniak, M. “Untrained weighted classifier combination with embedded ensemble pruning”. Neurocomputing, 196, 14–22 (2016)

M. Zareapoor, P. Shamsolmoali, and others, “Application of credit card fraud detection: Based on bagging ensemble classifier”. Procedia Comput. Sci., vol. 48, pp. 679– 685, (2015). https://doi.org/10.1016/j.procs.2015.04.201

F. H. Chen, D.-J. Chi, and J.-Y. Zhu, “Application of Random Forest, Rough Set Theory, Decision Tree and Neural Network to Detect Financial Statement Fraud--Taking Corporate Governance into Consideration,” in International Conference on Intelligent Computing, pp. 221–234 (2014). https://doi.org/10.1007/978-3-319-09333-8_24

S. Patil, V. Nemade, and P. K. Soni, “Predictive Modelling For Credit Card Fraud Detection Using Data Analytics”. Procedia Comput. Sci., vol. 132, pp. 385–395, (2018). https://doi.org/10.1016/j.procs.2018.05.199

UCI Machine Learning Repository: Statlog (German Credit Data) Data Set Available at: https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data

Correa Bahnsen, A., Aouada, D., Stojanovic, A. and Ottersten, B. “Feature engineering strategies for credit card fraud detection”. Expert Syst. Appl., 51, 134–142, (2016)

Singh, Ajeet, and Anurag Jain. "Adaptive credit card fraud detection techniques based on feature selection method." Advances in computer communication and computational sciences. Springer, Singapore, 167-178 (2019)

Sohony, Ishan, Rameshwar Pratap, and Ullas Nambiar. "Ensemble learning for credit card fraud detection." Proceedings of the ACM India Joint International Conference on Data Science and Management of Data. 2018

Patil, Suraj, Varsha Nemade, and Piyush Kumar Soni. "Predictive modelling for credit card fraud detection using data analytics." Procedia computer science, 132 385-395, (2018)

Alfaiz, Noor Saleh, and Suliman Mohamed Fati. "Enhanced Credit Card Fraud Detection Model Using Machine Learning." Electronics 11.4 :662, (2022)

Chugh, Bharti, and Nitin Malik. "Machine Learning Classifiers for Detecting Credit Card Fraudulent Transactions." Information and Communication Technology for Competitive Strategies (ICTCS 2021) ICT: Applications and Social Interfaces. Singapore: Springer Nature Singapore. 223-231, (2022)

Zou, Yao, and Changchun Gao. "Extreme Learning Machine Enhanced Gradient Boosting for Credit Scor-ing." Algorithms 15.5: 149, (2022)

Aung, Maung Hein, et al. "Random Forest Classifier for Detecting Credit Card Fraud based on Performance Metrics." 2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE). IEEE, 2020

He, H., Garcia, E.A. “Learning from imbalanced data”. IEEE Trans. Knowl. Data Eng, 21, 1263–1284, (2009)

Scikit-Learn-Contrib. Imbalanced-Learn. Available online: https://github.com/scikit-learn-contrib/imbalanced-learn (accessed on 22 January 2022)

Saheed, Y.K., Baba, U.A. and Raji, M.A. (2022), "Big Data Analytics for Credit Card Fraud Detection Using Super-vised Machine Learning Models", Sood, K., Balusamy, B., Grima, S. and Marano, P. (Ed.) Big Data Analytics in the Insurance Market (Emerald Studies in Finance, Insurance, and Risk Management), Emerald Publishing Limited, Leeds, pp. 31-56. https://doi.org/10.1108/978-1-80262-637-720221003

Estelami, H. and Liu, K. (2024), "Content analysis of American consumers’ credit card fraud complaints filed with the Consumer Financial Protection Bureau", Journal of Financial Crime, Vol. 31 No. 3, pp. 618-628. https://doi.org/10.1108/JFC-03-2023-0070

Chaudhari, Anagha. "Credit Card Fraud Detection Using Machine Learning and Predictive Models: A Comparative Study." Hybrid Intelligent Systems: 21st International Conference on Hybrid Intelligent Systems (HIS 2021), December 14-16, 2021. Vol. 420. Springer Nature, 2022

Taha, Altyeb Altaher, and Sharaf Jameel Malebary. "An intelligent approach to credit card fraud detection using an optimized light gradient boosting machine." IEEE Access 8: 25579-25587, (2020).

Downloads

Published

2024-07-31

How to Cite

Bharti Chugh, Preeti Garg, & Karnika Dwivedi. (2024). A Comprehensive Ensemble Approach Using Blending and Stacking for Credit Card Fraud Detection. International Journal on Smart & Sustainable Intelligent Computing, 1(1), 19–33. Retrieved from https://submissions.adroidjournals.com/index.php/ijssic/article/view/11

Issue

Section

Research Articles