Your browser doesn't support javascript.
loading
P2P Lending Default Prediction Based on AI and Statistical Models.
Ko, Po-Chang; Lin, Ping-Chen; Do, Hoang-Thu; Huang, You-Fu.
Afiliación
  • Ko PC; Department of Intelligent Commerce, National Kaohsiung University of Science and Technology, Kaohsiung 82445, Taiwan.
  • Lin PC; AI Fintech Center, National Kaohsiung University of Science and Technology, Kaohsiung 82445, Taiwan.
  • Do HT; AI Fintech Center, National Kaohsiung University of Science and Technology, Kaohsiung 82445, Taiwan.
  • Huang YF; Department of Finance and Information, National Kaohsiung University of Science and Technology, Kaohsiung 82445, Taiwan.
Entropy (Basel) ; 24(6)2022 Jun 08.
Article en En | MEDLINE | ID: mdl-35741522
Peer-to-peer lending (P2P lending) has proliferated in recent years thanks to Fintech and big data advancements. However, P2P lending platforms are not tightly governed by relevant laws yet, as their development speed has far exceeded that of regulations. Therefore, P2P lending operations are still subject to risks. This paper proposes prediction models to mitigate the risks of default and asymmetric information on P2P lending platforms. Specifically, we designed sophisticated procedures to pre-process mass data extracted from Lending Club in 2018 Q3-2019 Q2. After that, three statistical models, namely, Logistic Regression, Bayesian Classifier, and Linear Discriminant Analysis (LDA), and five AI models, namely, Decision Tree, Random Forest, LightGBM, Artificial Neural Network (ANN), and Convolutional Neural Network (CNN), were utilized for data analysis. The loan statuses of Lending Club's customers were rationally classified. To evaluate the models, we adopted the confusion matrix series of metrics, AUC-ROC curve, Kolmogorov-Smirnov chart (KS), and Student's t-test. Empirical studies show that LightGBM produces the best performance and is 2.91% more accurate than the other models, resulting in a revenue improvement of nearly USD 24 million for Lending Club. Student's t-test proves that the differences between models are statistically significant.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Entropy (Basel) Año: 2022 Tipo del documento: Article País de afiliación: Taiwán Pais de publicación: Suiza

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Entropy (Basel) Año: 2022 Tipo del documento: Article País de afiliación: Taiwán Pais de publicación: Suiza