Science, Technology, Engineering and Mathematics.
Open Access

DEEP LEARNING-BASED CREDIT RISK MODELING: ADDRESSING DATA IMBALANCE AND INVARIANCE

Download as PDF

Volume 2, Issue 2, Pp 1-8, 2025

DOI: https://doi.org/10.61784/jtfe3036

Author(s)

LiangYu Chen, Hao Lin*

Affiliation(s)

School of Management, Sun Yat-sen University, Guangzhou 510275, Guangdong, China.

Corresponding Author

Hao Lin

ABSTRACT

Credit risk modeling plays a crucial role in financial decision-making, helping lenders assess the likelihood of default and optimize lending strategies. Traditional credit risk assessment models, including logistic regression and decision trees, often struggle with data imbalance and invariance issues, leading to biased risk predictions and reduced generalization. The rapid advancement of deep learning (DL) techniques has introduced more sophisticated models capable of learning complex credit risk patterns. However, most DL-based credit scoring models still suffer from class imbalance in default prediction and fail to maintain fairness and stability across different demographic and economic conditions.

This study proposes a DL-based credit risk modeling framework designed to address data imbalance through advanced resampling techniques and generative modeling, while also incorporating adversarial learning to improve model invariance across diverse borrower segments. The proposed framework utilizes autoencoders, generative adversarial networks (GANs), and cost-sensitive learning techniques to enhance risk assessment accuracy while reducing bias. Additionally, domain adaptation techniques are introduced to ensure that the model remains robust across different financial environments.

Experiments on real-world credit datasets demonstrate that the proposed framework significantly improves credit risk prediction accuracy, enhances model fairness, and reduces sensitivity to class imbalance compared to traditional credit scoring approaches. The findings highlight the importance of integrating data-centric augmentation techniques with fairness-aware deep learning to improve the reliability of credit risk modeling in modern financial applications.

KEYWORDS

Credit risk modeling; Deep learning; Data imbalance; Invariance; Fairness; Generative models; Adversarial learning

CITE THIS PAPER

LiangYu Chen, Hao Lin. Deep learning-based credit risk modeling: addressing data imbalance and invariance. Journal of Trends in Financial and Economics. 2025, 2(2): 1-8. DOI: https://doi.org/10.61784/jtfe3036.

REFERENCES

[1] Mashrur A, Luo W, Zaidi N A, et al. Machine learning for financial risk management: a survey. IEEE Access, 2020, 8: 203203-203223.

[2] Sudjianto A, Zhang A. Model validation practice in banking: A structured approach for predictive models. 2024. DOI: 10.48550/arXiv.2410.13877.

[3] Addo P M, Guegan D, Hassani B. Credit risk analysis using machine and deep learning models. Risks, 2018, 6(2): 38.

[4] Bhatore S, Mohan L, Reddy Y R. Machine learning techniques for credit risk evaluation: a systematic literature review. Journal of Banking and Financial Technology, 2020, 4(1): 111-138.

[5] Hamori S, Kawai M, Kume T, et al. Ensemble learning or deep learning? Application to default risk analysis. Journal of Risk and Financial Management, 2018, 11(1): 12.

[6] Han X, Yang Y, Chen J, et al. Symmetry-Aware Credit Risk Modeling: A Deep Learning Framework Exploiting Financial Data Balance and Invariance. Symmetry, 2025, 17(3): 341.

[7] Bussmann N, Giudici P, Marinelli D, et al. Explainable machine learning in credit risk management. Computational Economics, 2021, 57(1): 203–216.

[8] Mhlanga D. Financial inclusion in emerging economies: The application of machine learning and artificial intelligence in credit risk assessment. International Journal of Financial Studies, 2021, 9(3): 39.

[9] Bazarbash M. Fintech in financial inclusion: machine learning applications in assessing credit risk. International Monetary Fund, 2019.

[10] Leo M, Sharma S, Maddulety K. Machine learning in banking risk management: A literature review. Risks, 2019, 7(1): 29.

[11] Moscato V, Picariello A, Sperlí G. A benchmark of machine learning approaches for credit score prediction. Expert Systems with Applications, 2021, 165: 113986.

[12] Munkhdalai L, Munkhdalai T, Namsrai O E, et al. An empirical comparison of machine-learning methods on bank client credit assessments. Sustainability, 2019, 11(3): 699.

[13] Berhane T, Melese T, Seid A M. Performance Evaluation of Hybrid Machine Learning Algorithms for Online Lending Credit Risk Prediction. Applied Artificial Intelligence, 2024, 38(1): 2358661.

[14] Kaisar S, Sifat S T. Explainable Machine Learning Models for Credit Risk Analysis: A Survey. Data Analytics for Management, Banking and Finance: Theories and Application. Cham: Springer Nature Switzerland, 2023: 51-72.

[15] Talaei Khoei T, Ould Slimane H, Kaabouch N. Deep learning: systematic review, models, challenges, and research directions. Neural Computing and Applications, 2023, 35(31): 23103-23124.

[16] Kim E, Lee J, Shin H, et al. Champion-challenger analysis for credit card fraud detection: Hybrid ensemble and deep learning. Expert Systems with Applications, 2019, 128: 214-224.

[17] Guo H, Ma Z, Chen X, et al. Generating artistic portraits from face photos with feature disentanglement and reconstruction. Electronics, 2024, 13(5): 955.

[18] Lee Z, Wu Y C, Wang X. Automated Machine Learning in Waste Classification: A Revolutionary Approach to Efficiency and Accuracy. Proceedings of the 2023 12th International Conference on Computing and Pattern Recognition, 2023, 299-303.

[19] Wang X, Wu Y C, Ma Z. Blockchain in the courtroom: exploring its evidentiary significance and procedural implications in US judicial processes. Frontiers in Blockchain, 2024, 7: 1306058.

[20] Seera M, Lim C P, Kumar A, et al. An intelligent payment card fraud detection system. Annals of Operations Research, 2024, 334(1): 445-467.

[21] Wang X, Wu Y C, Zhou M, et al. Beyond surveillance: privacy, ethics, and regulations in face recognition technology. Frontiers in Big Data, 2024, 7: 1337465.

[22] Liu Y, Wu Y C, Fu H, et al. Digital intervention in improving the outcomes of mental health among LGBTQ+ youth: a systematic review. Frontiers in Psychology, 2023, 14: 1242928.

[23] Ejiofor O E. A comprehensive framework for strengthening USA financial cybersecurity: integrating machine learning and AI in fraud detection systems. European Journal of Computer Science and Information Technology, 2023, 11(6): 62-83.

[24] Carneiro N, Figueira G, Costa M. A data mining based system for credit-card fraud detection in e-tail. Decision Support Systems, 2017, 95: 91-101.

[25] Al-Hashedi K G, Magalingam P. Financial fraud detection applying data mining techniques: A comprehensive review from 2009 to 2019. Computer Science Review, 2021, 40: 100402.

[26] Liang Y, Wang X, Wu Y C, et al. A study on blockchain sandwich attack strategies based on mechanism design game theory. Electronics, 2023, 12(21): 4417.

[27] Li X, Wang X, Chen X, et al. Unlabeled data selection for active learning in image classification. Scientific Reports, 2024, 14(1): 424.

[28] Kalluri K. Optimizing Financial Services Implementing Pega's Decisioning Capabilities for Fraud Detection. International Journal of Innovative Research in Engineering & Multidisciplinary Physical Sciences, 2022, 10(1): 1-9.

[29] Chen S, Liu Y, Zhang Q, et al. Multi-Distance Spatial-Temporal Graph Neural Network for Anomaly Detection in Blockchain Transactions. Advanced Intelligent Systems, 2025, 2400898.

[30] Sailusha R, Gnaneswar V, Ramesh R, et al. Credit card fraud detection using machine learning. 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), 2020: 1264-1270. IEEE.

[31] Cui Y, Han X, Chen J, et al. FraudGNN-RL: A Graph Neural Network With Reinforcement Learning for Adaptive Financial Fraud Detection. IEEE Open Journal of the Computer Society, 2025.

[32] Thennakoon A, Bhagyani C, Premadasa S, et al. Real-time credit card fraud detection using machine learning. 2019 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence), 2019: 488-493. IEEE.

All published work is licensed under a Creative Commons Attribution 4.0 International License. sitemap
Copyright © 2017 - 2025 Science, Technology, Engineering and Mathematics.   All Rights Reserved.