Fraud Identification in Online Financial Transactions Combining SMOTE and Ensemble Classifiers

Authors

Volume: 16 | Issue: 2 | Pages: 33323-33333 | April 2026 | https://doi.org/10.48084/etasr.16615

Abstract

The increasing reliance on digital payment systems has intensified the exposure of financial institutions to credit card fraud, particularly under conditions of extreme class imbalance where fraudulent transactions account for less than 0.2% of total records. This study presents SMOVO, a Synthetic Minority Oversampling Technique (SMOTE)-based Voting Ensemble. It is a fraud detection framework designed to address this imbalance through the integration of three complementary components: Analysis of Variance (ANOVA) F-test-based feature selection, SMOTE-based data resampling, and a heterogeneous soft-voting ensemble comprising Random Forest (RF), XGBoost, LightGBM, and AdaBoost. Evaluated on the European Credit Card Fraud Dataset, SMOVO attained a recall of 0.962, F1-score of 0.864, Area Under the Curve (AUC) of 0.982, Matthews Correlation Coefficient (MCC) of 0.868, and Geometric mean (G-mean) of 0.980. In comparison with generative oversampling approaches such as ESMOTE-GAN, SMOVO exhibits more stable training behavior, reduced computational overhead, and improved model interpretability. These results indicate that SMOVO provides an effective and practically deployable solution for fraud detection in highly imbalanced financial transaction environments.

Keywords:

fraud detection, SMOTE, ensemble learning, voting classifier, imbalanced data, machine learning

Downloads

Download data is not yet available.

References

M. Barroso and J. Laborda, "Digital transformation and the emergence of the Fintech sector: Systematic literature review," Digital Business, vol. 2, no. 2, Jan. 2022, Art. no. 100028. DOI: https://doi.org/10.1016/j.digbus.2022.100028

H. Chitimira, "Overview analysis of the regulation of insider trading in Zimbabwe during the corona virus pandemic," Journal of Financial Crime, vol. 30, no. 6, pp. 1499–1516, July 2023. DOI: https://doi.org/10.1108/JFC-04-2023-0089

T. Widiyaningtyas, D. D. Prasetya, and H. W. Herwanto, "Time Loss Function-based Collaborative Filtering in Movie Recommender System," International Journal of Intelligent Engineering and Systems, vol. 16, no. 6, pp. 1021–1030, Oct. 2023. DOI: https://doi.org/10.22266/ijies2023.1231.84

P. Sood, C. Sharma, S. Nijjer, and S. Sakhuja, "Review the role of artificial intelligence in detecting and preventing financial fraud using natural language processing," International Journal of System Assurance Engineering and Management, vol. 14, no. 6, pp. 2120–2135, Dec. 2023. DOI: https://doi.org/10.1007/s13198-023-02043-7

M. A. K. Achakzai and J. Peng, "Detecting financial statement fraud using dynamic ensemble machine learning," International Review of Financial Analysis, vol. 89, Oct. 2023, Art. no. 102827. DOI: https://doi.org/10.1016/j.irfa.2023.102827

M. A. Alrasheedi, "Enhancing Fraud Detection in Credit Card Transactions: A Comparative Study of Machine Learning Models," Computational Economics, Aug. 2025. DOI: https://doi.org/10.1007/s10614-025-11071-3

J. Nicholls, A. Kuppa, and N.-A. Le-Khac, "Financial Cybercrime: A Comprehensive Survey of Deep Learning Approaches to Tackle the Evolving Financial Crime Landscape," IEEE Access, vol. 9, pp. 163965–163986, 2021. DOI: https://doi.org/10.1109/ACCESS.2021.3134076

E. Oztemel and M. Isik, "A Systematic Review of Intelligent Systems and Analytic Applications in Credit Card Fraud Detection," Applied Sciences, vol. 15, no. 3, Jan. 2025, Art. no. 1356. DOI: https://doi.org/10.3390/app15031356

J. T. Hancock, R. A. Bauder, H. Wang, and T. M. Khoshgoftaar, "Explainable machine learning models for Medicare fraud detection," Journal of Big Data, vol. 10, no. 1, Oct. 2023, Art. no. 154. DOI: https://doi.org/10.1186/s40537-023-00821-5

F. K. Alarfaj, I. Malik, H. U. Khan, N. Almusallam, M. Ramzan, and M. Ahmed, "Credit Card Fraud Detection Using State-of-the-Art Machine Learning and Deep Learning Algorithms," IEEE Access, vol. 10, pp. 39700–39715, 2022. DOI: https://doi.org/10.1109/ACCESS.2022.3166891

P. Gupta, A. Varshney, M. R. Khan, R. Ahmed, M. Shuaib, and S. Alam, "Unbalanced Credit Card Fraud Detection Data: A Machine Learning-Oriented Comparative Study of Balancing Techniques," Procedia Computer Science, vol. 218, pp. 2575–2584, Jan. 2023. DOI: https://doi.org/10.1016/j.procs.2023.01.231

S. Sendari, I. A. E. Zaeni, D. C. Lestari, and H. P. Hariyadi, "Opinion Analysis for Emotional Classification on Emoji Tweets using the Naïve Bayes Algorithm," Knowledge Engineering and Data Science, vol. 3, no. 1, pp. 50–59, June 2020. DOI: https://doi.org/10.17977/um018v3i12020p50-59

W. Yundong, A. Zhulev, and O. G. Ahmed, "Credit Card Fraud Identification using Logistic Regression and Random Forest," Wasit Journal of Computer and Mathematics Science, vol. 2, no. 3, pp. 1–8, Sept. 2023. DOI: https://doi.org/10.31185/wjcms.184

R. K. L. Kennedy, F. Villanustre, T. M. Khoshgoftaar, and Z. Salekshahrezaee, "Synthesizing class labels for highly imbalanced credit card fraud detection data," Journal of Big Data, vol. 11, no. 1, Mar. 2024, Art. no. 38. DOI: https://doi.org/10.1186/s40537-024-00897-7

M. E. Lokanan, "Predicting mobile money transaction fraud using machine learning algorithms," Applied AI Letters, vol. 4, no. 2, Apr. 2023, Art. no. e85. DOI: https://doi.org/10.1002/ail2.85

A. Hanae, B. Abdellah, E. Saida, and G. Youssef, "End-to-End Real-time Architecture for Fraud Detection in Online Digital Transactions," International Journal of Advanced Computer Science and Applications, vol. 14, no. 6, pp. 749–757, June 2023. DOI: https://doi.org/10.14569/IJACSA.2023.0140680

T. Albalawi and S. Dardouri, "Enhancing credit card fraud detection using traditional and deep learning models with class imbalance mitigation," Frontiers in Artificial Intelligence, vol. 8, Oct. 2025, Art. no. 1643292. DOI: https://doi.org/10.3389/frai.2025.1643292

Y. Wang, "Fraud detection based on FS-SMOTE model for credit card," Highlights in Science, Engineering and Technology, vol. 70, pp. 316–323, Nov. 2023. DOI: https://doi.org/10.54097/hset.v70i.12479

A.-A. Al-Maari, M. Abdulnabi, Y. Nathan, A. Ali, U. Ali, and M. Khan, "Optimized Credit Card Fraud Detection Leveraging Ensemble Machine Learning Methods," Engineering, Technology & Applied Science Research, vol. 15, no. 3, pp. 22287–22294, June 2025. DOI: https://doi.org/10.48084/etasr.10287

Z. Y. Lim, Y. H. Pang, K. Z. B. Kamarudin, S. Y. Ooi, and F. S. Hiew, "Bayesian optimization driven strategy for detecting credit card fraud with Extremely Randomized Trees," MethodsX, vol. 13, Dec. 2024, Art. no. 103055. DOI: https://doi.org/10.1016/j.mex.2024.103055

F. A. Ghaleb, F. Saeed, M. Al-Sarem, S. N. Qasem, and T. Al-Hadhrami, "Ensemble Synthesized Minority Oversampling-Based Generative Adversarial Networks and Random Forest Algorithm for Credit Card Fraud Detection," IEEE Access, vol. 11, pp. 89694–89710, 2023. DOI: https://doi.org/10.1109/ACCESS.2023.3306621

J. K. Afriyie et al., "A supervised machine learning algorithm for detecting and predicting fraud in credit card transactions," Decision Analytics Journal, vol. 6, Mar. 2023, Art. no. 100163. DOI: https://doi.org/10.1016/j.dajour.2023.100163

E. Esenogho, I. D. Mienye, T. G. Swart, K. Aruleba, and G. Obaido, "A Neural Network Ensemble With Feature Engineering for Improved Credit Card Fraud Detection," IEEE Access, vol. 10, pp. 16400–16407, 2022. DOI: https://doi.org/10.1109/ACCESS.2022.3148298

Z. Salekshahrezaee, J. L. Leevy, and T. M. Khoshgoftaar, "The effect of feature extraction and data sampling on credit card fraud detection," Journal of Big Data, vol. 10, no. 1, Jan. 2023, Art. no. 6. DOI: https://doi.org/10.1186/s40537-023-00684-w

A. R. Khalid, N. Owoh, O. Uthmani, M. Ashawa, J. Osamor, and J. Adejoh, "Enhancing Credit Card Fraud Detection: An Ensemble Machine Learning Approach," Big Data and Cognitive Computing, vol. 8, no. 1, Jan. 2024, Art. no. 6. DOI: https://doi.org/10.3390/bdcc8010006

P. Manorom, U. Detthamrong, and W. Chansanam, "Comparative Assessment of Fraudulent Financial Transactions using the Machine Learning Algorithms Decision Tree, Logistic Regression, Naïve Bayes, K-Nearest Neighbor, and Random Forest," Engineering, Technology & Applied Science Research, vol. 14, no. 4, pp. 15676–15680, Aug. 2024. DOI: https://doi.org/10.48084/etasr.7774

E. Ileberi, Y. Sun, and Z. Wang, "Performance Evaluation of Machine Learning Methods for Credit Card Fraud Detection Using SMOTE and AdaBoost," IEEE Access, vol. 9, pp. 165286–165294, 2021. DOI: https://doi.org/10.1109/ACCESS.2021.3134330

H. Hairani, T. Widiyaningtyas, and D. D. Prasetya, "Addressing Class Imbalance of Health Data: A Systematic Literature Review on Modified Synthetic Minority Oversampling Technique (SMOTE) Strategies," JOIV : International Journal on Informatics Visualization, vol. 8, no. 3, pp. 1310–1318, Sept. 2024. DOI: https://doi.org/10.62527/joiv.8.3.2283

T. Widiyaningtyas, I. Hidayah, and T. B. Adji, "Recommendation Algorithm Using Clustering-Based UPCSim (CB-UPCSim)," Computers, vol. 10, no. 10, Oct. 2021, Art. no. 123. DOI: https://doi.org/10.3390/computers10100123

M. M. Ismail and M. A. Haq, "Enhancing Enterprise Financial Fraud Detection Using Machine Learning," Engineering, Technology & Applied Science Research, vol. 14, no. 4, pp. 14854–14861, Aug. 2024. DOI: https://doi.org/10.48084/etasr.7437

M. A. Mim, N. Majadi, and P. Mazumder, "A soft voting ensemble learning approach for credit card fraud detection," Heliyon, vol. 10, no. 3, Feb. 2024, Art. no. e25466. DOI: https://doi.org/10.1016/j.heliyon.2024.e25466

A. R. Manga’, A. N. Handayani, H. W. Herwanto, R. A. Asmara, Y. I. Sulistya, and K. Kasmira, "Analysis of the Ensemble Method Classifier’s Performance on Handwritten Arabic Characters Dataset," ILKOM Jurnal Ilmiah, vol. 15, no. 1, pp. 186–192, Apr. 2023. DOI: https://doi.org/10.33096/ilkom.v15i1.1357.186-192

"Credit Card Fraud Detection." Kaggle, 2013, [Online]. Available: https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud.

F. Carcillo, A. Dal Pozzolo, Y.-A. Le Borgne, O. Caelen, Y. Mazzer, and G. Bontempi, "SCARFF: A scalable framework for streaming credit card fraud detection with spark," Information Fusion, vol. 41, pp. 182–194, May 2018. DOI: https://doi.org/10.1016/j.inffus.2017.09.005

O. Michael, O. Christopher, A. Maduabuchuku, N. O. Miracle, E.-O. chinenye Love, and O. C. Akachukwu, "Fraud Detection in Financial Transactions Using Ensemble Machine Learning Models," International Journal of Research and Scientific Innovation, vol. 12, no. 13, pp. 27–36, Oct. 2025. DOI: https://doi.org/10.51244/IJRSI.2025.1213CS003

C. Iscan, O. Kumas, F. P. Akbulut, and A. Akbulut, "Wallet-Based Transaction Fraud Prevention Through LightGBM With the Focus on Minimizing False Alarms," IEEE Access, vol. 11, pp. 131465–131474, 2023. DOI: https://doi.org/10.1109/ACCESS.2023.3321666

M. Alamri and M. Ykhlef, "Hybrid Undersampling and Oversampling for Handling Imbalanced Credit Card Data," IEEE Access, vol. 12, pp. 14050–14060, 2024. DOI: https://doi.org/10.1109/ACCESS.2024.3357091

A. A. Alhashmi, A. M. Alashjaee, A. A. Darem, A. F. Alanazi, and R. Effghi, "An Ensemble-based Fraud Detection Model for Financial Transaction Cyber Threat Classification and Countermeasures," Engineering, Technology & Applied Science Research, vol. 13, no. 6, pp. 12433–12439, Dec. 2023. DOI: https://doi.org/10.48084/etasr.6401

W. Rahayu et al., "Synthetic Minority Oversampling Technique (SMOTE) for Boosting the Accuracy of C4.5 Algorithm Model," Journal of Artificial Intelligence and Engineering Applications (JAIEA), vol. 3, no. 3, pp. 624–630, June 2024. DOI: https://doi.org/10.59934/jaiea.v3i3.469

O. A. Adepegba, S. A. Akinboro, S. O. Olabiyisi, O. G. Lala, A. A. Onamade, and A. A. Aroyehun, "Credit Card Fraud Detection Using Voter Ensemble Technique (VET)," Computing, Information Systems, Development Informatics & Allied Research Journa, vol. 13, no. 2, pp. 15–20, June 2022. DOI: https://doi.org/10.22624/AIMS/CISDI/V13N2P3

P. Purnawansyah et al., "Congestion Predictive Modelling on Network Dataset Using Ensemble Deep Learning," Journal of Applied Data Sciences, vol. 5, no. 4, pp. 1597–1613, Oct. 2024. DOI: https://doi.org/10.47738/jads.v5i4.333

D. Breskuvienė and G. Dzemyda, "Enhancing credit card fraud detection: highly imbalanced data case," Journal of Big Data, vol. 11, no. 1, Dec. 2024, Art. no. 182. DOI: https://doi.org/10.1186/s40537-024-01059-5

M. R. Baker, Z. N. Mahmood, and E. H. Shaker, "Ensemble Learning with Supervised Machine Learning Models to Predict Credit Card Fraud Transactions," Revue d’Intelligence Artificielle, vol. 36, no. 4, pp. 509–518, Aug. 2022. DOI: https://doi.org/10.18280/ria.360401

Y. Xie, A. Li, L. Gao, and Z. Liu, "A Heterogeneous Ensemble Learning Model Based on Data Distribution for Credit Card Fraud Detection," Wireless Communications and Mobile Computing, vol. 2021, no. 1, July 2021, Art. no. 2531210. DOI: https://doi.org/10.1155/2021/2531210

Downloads

How to Cite

[1]
J. X. Guterres, T. Widwaningtyas, and I. A. E. Zaeni, “Fraud Identification in Online Financial Transactions Combining SMOTE and Ensemble Classifiers”, Eng. Technol. Appl. Sci. Res., vol. 16, no. 2, pp. 33323–33333, Apr. 2026.

Metrics

Abstract Views: 160
PDF Downloads: 189

Metrics Information