Machine Learning-Based Credit Risk Early Warning System for Small and Medium-Sized Financial Institutions: An Ensemble Learning Approach with Interpretable Risk Indicators

Main Article Content

Yifei Li
Sida Zhang

Abstract

Small and medium-sized financial institutions encounter distinct challenges in implementing effective credit risk management systems due to limited resources and technological infrastructure. This study develops an ensemble learning framework tailored for early warning detection in credit portfolios, addressing the urgent need for cost-efficient risk assessment solutions. The proposed methodology combines multiple machine learning algorithms through a hierarchical voting mechanism, effectively handling imbalanced financial datasets with specialized resampling techniques. Experimental validation using real-world credit data from regional banks demonstrates strong performance, achieving 87.3% accuracy in default prediction over a 12-month forecast horizon. The framework also incorporates interpretable feature importance analysis, enabling risk managers to identify key indicators of portfolio deterioration, including debt-to-equity ratios, cash flow volatility patterns, and industry-specific economic signals. Implementation analysis indicates potential cost reductions of 34% compared to traditional risk assessment methods while maintaining compliance with regulatory standards. Furthermore, the system's modular architecture supports incremental deployment, allowing institutions to adopt machine learning capabilities without extensive infrastructure overhaul.

Article Details

Section

Articles

How to Cite

Machine Learning-Based Credit Risk Early Warning System for Small and Medium-Sized Financial Institutions: An Ensemble Learning Approach with Interpretable Risk Indicators. (2025). Journal of Science, Innovation & Social Impact, 1(1), 372-383. https://sagespress.com/index.php/JSISI/article/view/41

References

1. A. Mashrur, W. Luo, N. A. Zaidi, and A. Robles-Kelly, "Machine learning for financial risk management: A survey," IEEE Access, vol. 8, pp. 203203-203223, 2020. doi: 10.1109/access.2020.3036322

2. R. DeYoung, W. C. Hunter, and G. F. Udell, "The past, present, and probable future for community banks," Journal of Financial Services Research, vol. 25, no. 2, pp. 85-133, 2004. doi: 10.2139/ssrn.446961

3. X. Chen, X. Wang, and D. D. Wu, "Credit risk measurement and early warning of SMEs: An empirical study of listed SMEs in China," Decision Support Systems, vol. 49, no. 3, pp. 301-310, 2010. doi: 10.1016/j.dss.2010.03.005

4. J. Roeder, M. Palmer, and J. Muntermann, "Data-driven decision-making in credit risk management: The information value of analyst reports," Decision Support Systems, vol. 158, p. 113770, 2022. doi: 10.1016/j.dss.2022.113770

5. L. Yu, S. Wang, and K. K. Lai, "Credit risk assessment with a multistage neural network ensemble learning approach," Expert Systems with Applications, vol. 34, no. 2, pp. 1434-1444, 2008.

6. R. B. Avery, and K. A. Samolyk, "Bank consolidation and small business lending: The role of community banks," Journal of Financial Services Research, vol. 25, no. 2, pp. 291-325, 2004.

7. C. Wen, J. Yang, L. Gan, and Y. Pan, "Big data driven Internet of Things for credit evaluation and early warning in finance," Future Generation Computer Systems, vol. 124, pp. 295-307, 2021. doi: 10.1016/j.future.2021.06.003

8. W. R. Emmons, R. A. Gilbert, and T. J. Yeager, "Reducing the risk at small community banks: Is it size or geographic diversification that matters? Journal of Financial Services Research, 25(2), 259-281," 2004.

9. G. Du, Z. Liu, and H. Lu, "Application of innovative risk early warning mode under big data technology in Internet credit financial risk assessment," Journal of Computational and Applied Mathematics, vol. 386, p. 113260, 2021. doi: 10.1016/j.cam.2020.113260

10. L. Yu, W. Yue, S. Wang, and K. K. Lai, "Support vector machine based multiagent ensemble learning for credit risk evaluation," Expert Systems with Applications, vol. 37, no. 2, pp. 1351-1360, 2010.

11. K. Xiong, Z. Wu, and X. Jia, "Deepcontainer: A deep learning-based framework for real-time anomaly detection in cloud-native container environments," Journal of Advanced Computing Systems, vol. 5, no. 1, pp. 1-17, 2025. doi: 10.69987/jacs.2025.50101

12. J. F. Au Yeung, Z. K. Wei, K. Y. Chan, H. Y. Lau, and K. F. C. Yiu, "Jump detection in financial time series using machine learning algorithms," Soft Computing, vol. 24, no. 3, pp. 1789-1801, 2020.

13. B. Schwaab, S. J. Koopman, and A. Lucas, "Systemic risk diagnostics: Coincident indicators and early warning signals (No. 1327)," ECB Working Paper, 2011.

14. J. Verbesselt, P. Jonsson, S. Lhermitte, J. van Aardt, and P. Coppin, "Evaluating satellite and climate data-derived indices as fire risk indicators in savanna ecosystems," IEEE Transactions on Geoscience and Remote Sensing, vol. 44, no. 6, pp. 1622-1632, 2006. doi: 10.1109/tgrs.2005.862262

15. A. Midekisa, G. Senay, G. M. Henebry, P. Semuniguse, and M. C. Wimberly, "Remote sensing-based time series models for malaria early warning in the highlands of Ethiopia," Malaria Journal, vol. 11, no. 1, p. 165, 2012. doi: 10.1186/1475-2875-11-165

16. A. Kang, and K. Yu, "The impact of financial data visualization techniques on enhancing budget transparency in local government decision-making," Spectrum of Research, vol. 5, no. 2, 2025.

17. W. Liu, S. Fan, and G. Weng, "Multimodal deep learning framework for early Parkinson's disease detection through gait pattern analysis using wearable sensors and computer vision," Journal of Computing Innovations and Applications, vol. 1, no. 2, pp. 74-86, 2023.

18. W. Liu, S. Fan, and G. Weng, "Multi-modal deep learning framework for early Alzheimer's disease detection using MRI neuroimaging and clinical data fusion," Annals of Applied Sciences, vol. 6, no. 1, 2025.

19. D. Yuan, H. Wang, and L. Guo, "Cultural-behavioral network fingerprinting for Asia-Pacific cross-border securities trading," Academia Nexus Journal, vol. 4, no. 2, 2025.

20. X. Li, and R. Jia, "Energy-aware scheduling algorithm optimization for AI workloads in data centers based on renewable energy supply prediction," Journal of Computing Innovations and Applications, vol. 2, no. 2, pp. 56-65, 2024.

21. L. Yu, and X. Li, "Dynamic optimization method for differential privacy parameters based on data sensitivity in federated learning," Journal of Advanced Computing Systems, vol. 5, no. 6, pp. 1-13, 2025.

22. L. Guo, Z. Li, K. Qian, W. Ding, and Z. Chen, "Bank credit risk early warning model based on machine learning decision trees," Journal of Economic Theory and Business Management, vol. 1, no. 3, pp. 24-30, 2024.

23. C. Fan, W. Ding, K. Qian, H. Tan, and Z. Li, "Cueing flight object trajectory and safety prediction based on SLAM technology," Journal of Theory and Practice of Engineering Science, vol. 4, no. 05, pp. 1-8, 2024. doi: 10.53469/jtpes.2024.04(05).01

24. C. Fan, Z. Li, W. Ding, H. Zhou, and K. Qian, "Integrating artificial intelligence with SLAM technology for robotic navigation and localization in unknown environments," International Journal of Robotics and Automation, vol. 29, no. 4, pp. 215-230, 2024.

25. K. Qian, C. Fan, Z. Li, H. Zhou, and W. Ding, "Implementation of artificial intelligence in investment decision-making in the Chinese A-share market," Journal of Economic Theory and Business Management, vol. 1, no. 2, pp. 36-42, 2024.

26. W. Jiang, K. Qian, C. Fan, W. Ding, and Z. Li, "Applications of generative AI-based financial robot advisors as investment consultants," Applied and Computational Engineering, vol. 67, pp. 28-33, 2024.

27. Z. Li, C. Fan, W. Ding, and K. Qian, "Robot navigation and map construction based on SLAM technology," 2024. doi: 10.53469/wjimt.2024.07(03).02

28. W. Ding, H. Zhou, H. Tan, Z. Li, and C. Fan, "Automated compatibility testing method for distributed software systems in cloud computing," 2024.

29. A. Kang, Z. Li, and S. Meng, "AI-enhanced risk identification and intelligence sharing framework for anti-money laundering in cross-border income swap transactions," Journal of Advanced Computing Systems, vol. 3, no. 5, pp. 34-47, 2023. doi: 10.69987/jacs.2023.30504

30. X. Wang, Z. Chu, and Z. Li, "Optimization research on single image dehazing algorithm based on improved dark channel prior," Artificial Intelligence and Machine Learning Review, vol. 4, no. 4, pp. 57-74, 2023.

31. W. Ding, H. Tan, H. Zhou, Z. Li, and C. Fan, "Immediate traffic flow monitoring and management based on multimodal data in cloud computing," Journal of Transportation Systems, vol. 18, no. 3, pp. 102-118, 2024.