AI-Enhanced Loan Default Prediction Models
Abstract
Financial institutions face increasing challenges with respect to loan default prediction because their existing simple models have become outdated. Past due accounts have been left on bank balance sheets longer, and as a result, older default models are not accurately predicting charge-offs. Keenly interested in new and advanced techniques for assessing loan default risk, banks are turning to artificial intelligence, which enhances default prediction by taking into account many more factors than can be analyzed or weighted by a loan officer. Loans are the largest asset at most banks, and the ability to predict whether or not those loans will be paid is critical to sustaining institutional finances. Similarly, the prediction of which loans will default greatly affects the nation’s capital markets, with implications domestically as well as on the global economies. U.S. banks presently use logistic statistical models, typically either a stand-alone or blended credit score model, whose primary output is a score, i.e., probability, predicting the likelihood of loan default. While useful, it is not effective to approach loan default prediction strictly from a quantitative perspective. Qualitative factors, unique to individual borrower behavior, in conjunction with abundant large-scale data—such as established industry performance data—are important. The addition of borrower account activity to improve the discriminatory ability of blended credit score logistic models would be invaluable. In the domain of consumer lending, numerous models offer consumer-behavioral predictive factors with varying degrees of reliability. In small business lending, however, generating alternative data that a consumer or commercial bank might use to assess default risk is very difficult. Instead, the chosen methodology for this research focuses on unique account activities identified as important in previous unsuccessful small business financial investigations. In comparing recent studies exploring conventional statistical versus alternate AI approaches for loan risk prediction, no research is known to directly compare a blended credit score logistic model to its AI counterpart. Although data mining is often compared with conventional models in the literature, decision trees implemented by AI techniques are very rarely seen. Instead, AI-enhanced linear logistic models, neural networks, or genetic algorithms are conventionally studied. Moreover, easily interpreted decision tree methodology is rare in research, specifically for the domain of loan default prediction.
Downloads
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
License Terms
Ownership and Licensing:
Authors of this research paper submitted to the journal owned and operated by The Science Brigade Group retain the copyright of their work while granting the journal certain rights. Authors maintain ownership of the copyright and have granted the journal a right of first publication. Simultaneously, authors agreed to license their research papers under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License.
License Permissions:
Under the CC BY-NC-SA 4.0 License, others are permitted to share and adapt the work, as long as proper attribution is given to the authors and acknowledgement is made of the initial publication in the Journal. This license allows for the broad dissemination and utilization of research papers.
Additional Distribution Arrangements:
Authors are free to enter into separate contractual arrangements for the non-exclusive distribution of the journal's published version of the work. This may include posting the work to institutional repositories, publishing it in journals or books, or other forms of dissemination. In such cases, authors are requested to acknowledge the initial publication of the work in this Journal.
Online Posting:
Authors are encouraged to share their work online, including in institutional repositories, disciplinary repositories, or on their personal websites. This permission applies both prior to and during the submission process to the Journal. Online sharing enhances the visibility and accessibility of the research papers.
Responsibility and Liability:
Authors are responsible for ensuring that their research papers do not infringe upon the copyright, privacy, or other rights of any third party. The Science Brigade Publishers disclaim any liability or responsibility for any copyright infringement or violation of third-party rights in the research papers.
