EVALUATING THE PERFORMANCE OF RANDOM FOREST AND LOGISTIC REGRESSION MODELS IN DETECTING BANK TRANSACTION FRAUD: A COMPARATIVE STUDY

Authors

  • Lars Johansson Department of Computer Science, Royal Institute of Technology (KTH), Stockholm, Sweden
  • Professor Sarah O'Connor School of Computing, University College Dublin, Dublin, Ireland
  • Ahmed Khan Department of Computer Science, University of New South Wales, Sydney, Australia

DOI:

https://doi.org/10.5281/zenodo.13833848

Keywords:

Fraud detection, Machine learning models, Random Forest, Logistic Regression and Bank transactions

Abstract

This study explores developing and evaluating machine learning models for detecting fraudulent bank transactions. By analyzing transaction data, features such as transaction type, amount, balance, and date are extracted and labeled as genuine or fraudulent based on balance consistency and transaction limits. The dataset is split into training and testing sets, and two models—Random Forest and Logistic Regression—are trained using standardized features. The models are evaluated on accuracy, precision, recall, and F1-score metrics. Results indicate that the Random Forest model outperforms Logistic Regression in terms of accuracy due to its ability to handle complex relationships within the data. However, Logistic Regression offers valuable probabilistic insights. Challenges such as data imbalance and feature extraction quality are addressed with techniques like Synthetic Minority Over-sampling Technique (SMOTE) and advanced preprocessing methods. Prediction probabilities are visualized using Matplotlib for better interpretation. Future work includes enhancing feature extraction, expanding the dataset, and exploring more advanced models to further improve performance. This study demonstrates the potential of combining multiple validation techniques and machine learning models with a userfriendly interface to create a robust solution for detecting fraudulent bank transactions, thereby enhancing financial security

Downloads

Published

2024-09-24

How to Cite

Johansson, L., O'Connor , P. S., & Khan, A. (2024). EVALUATING THE PERFORMANCE OF RANDOM FOREST AND LOGISTIC REGRESSION MODELS IN DETECTING BANK TRANSACTION FRAUD: A COMPARATIVE STUDY. Ayden International Journal of Basic and Applied Sciences, 12(1), 1–9. https://doi.org/10.5281/zenodo.13833848

Issue

Section

Articles