PREDICTING BANK LOAN APPROVAL USING LOGISTIC REGRESSION AND FEATURE SELECTION METHOD

Dadang Amir Hamzah, fika lestauli sigalingging

Abstract


Examining bank loan application is a long process that requires a detail check in every stages. This process is important in the banking industry, as it directly impacts the bank's risk management and profitability. However, due to a long process in making decision the customer wait a long time to get the decision which results to the customers dissatisfaction. Therefore, to improve the examination process and provide a quick decision result, the more effective tools is required. Logistic regression is a machine learning method that able to predict the binary output based on the probability value. This method takes the value from the multiple regression method and convert it into probability value using the activation function called sigmoid function. This paper applies the logistic regression method to predict bank loan approvals based on several features considered as independent variables. This research uses the secondary data taken from www.kaggle.com. The model performance is measured using the confusion matrix that consist of accuracy, precision, recall, and F1 score. This research construct three models based on data type. The first model is constructed using numerical data only, the second model is constructed using categorical data only, and the third model is constructed by combining numerical data type and categorical data type. It is determined that the first model return 87.8% accuracy, 95.94% precision, 87.57% recall, and 91.56% F1 score. The second model return accuracy 69%, precision 96.81%, recall 69.87%, and F1 score 81.17%. Moreover, the return 86.6% accuracy, 96.81% precision, 85.64% recall, and 90.88% F1 score. Based on these results, it is concluded that the best method in processing loan bank application data is to use the third model that is the model that includes both categorical and numerical data type.

 


Full Text:

PDF

References


Abdou, H. A., & Pointon, J. (2011). CREDIT SCORING, STATISTICAL TECHNIQUES AND EVALUATION CRITERIA: A REVIEW OF THE LITERATURE. Intelligent Systems in Accounting, Finance and Management, 18(2–3). https://doi.org/10.1002/isaf.325

Altman, E. I. (1968). Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy. The Journal of Finance, 23(4). https://doi.org/10.2307/2978933

Amir Hamzah, D. (2021). Predicting travel insurance policy claim using logistic regression. Applied Quantitative Analysis, 1(1). https://doi.org/10.31098/quant.613

Arowolo, M. O., Adeniyi, O. F., & ... (2022). A Prediction Model for Bank Loans Using Agglomerative Hierarchical Clustering with Classification Approach. Covenant …, 10(2).

Bindal, A., & Chaurasia, S. (2018). Predictive risk analysis for loan repayment of credit card clients. 2018 3rd IEEE International Conference on Recent Trends in Electronics, Information and Communication Technology, RTEICT 2018 - Proceedings. https://doi.org/10.1109/RTEICT42901.2018.9012366

Calibo, D. I., & Ballera, M. A. (2019). Variable selection for credit risk scoring on loan performance using regression analysis. 2019 IEEE 4th International Conference on Computer and Communication Systems, ICCCS 2019. https://doi.org/10.1109/CCOMS.2019.8821664

Ekadjaja, M. (2018). FACTORS DETERMINING BANK LOAN APPROVAL AS SOURCE OF FINANCING FOR MICRO, SMALL, AND MEDIUM ENTERPRISES (MSME) IN JAKARTA. Jurnal Muara Ilmu Ekonomi Dan Bisnis, 2(1). https://doi.org/10.24912/jmieb.v2i1.1563

J. R. Quinlan. (1987). Credit Approval [Dataset]. In UCI Machine Learning Repository. UCI Machine Learning Repository.

Kannan, M. K. J., Kannan, J., Nithej, A. R., Akhil, P., Gowda, P., & Pareek, P. (2023). ML Based Loan Approval Prediction System A Novel Approach. International Journal of Innovative Research in Computer and Communication Engineering | An ISO, 11(3).

Kumar, R., Jain, V., Sharma, P., Awasthi, S., & Jha, G. (2019). Prediction of loan approval using machine learning. International Journal of Advanced Science and Technology, 28(7). https://doi.org/10.56726/irjmets41180

Ohlson, J. A. (1980). Financial Ratios and the Probabilistic Prediction of Bankruptcy. Journal of Accounting Research, 18(1). https://doi.org/10.2307/2490395

Peng, C. Y. J., Lee, K. L., & Ingersoll, G. M. (2002). An introduction to logistic regression analysis and reporting. Journal of Educational Research, 96(1). https://doi.org/10.1080/00220670209598786

Singh Sandhu, H., Sharma, V., & Jassi, V. (2023). Loan Approval Prediction Using Machine Learning. In Emerging Trends In Engineering And Management. https://doi.org/10.56155/978-81-955020-3-5-01

Tobback, E., Bellotti, T., Moeyersoms, J., Stankova, M., & Martens, D. (2017). Bankruptcy prediction for SMEs using relational data. Decision Support Systems, 102. https://doi.org/10.1016/j.dss.2017.07.004

Xiao, F., Chen, W., Wang, J., & Erten, O. (2022). A Hybrid Logistic Regression: Gene Expression Programming Model and Its Application to Mineral Prospectivity Mapping. Natural Resources Research, 31(4). https://doi.org/10.1007/s11053-021-09918-1

Zhou, Y., Chen, S., Zhong, Y., & Deng, X. (2023). Study on Learning Method of Logistic Regression Classification for Class Imbalance Problem. Learning and Analytics in Intelligent Systems, 31. https://doi.org/10.1007/978-3-031-29016-9_34




DOI: http://dx.doi.org/10.33021/icfbe.v0i0.5691

Refbacks

  • There are currently no refbacks.



Editorial Office:

Faculty of Business President University 
Jalan Ki Hajar Dewantara Mekarmukti
Cikarang Utara, Bekasi, Jawa Barat


Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.