Academic research and the financial industry have recently shown great interest in Machine Learning algorithms capable of solving complex learning tasks, although in the field of firms' default prediction the lack of interpretability has prevented an extensive adoption of the black-box type of models. In order to overcome this drawback and maintain the high performances of black-boxes, this paper has chosen a model-agnostic approach. Accumulated Local Effects and Shapley values are used to shape the predictors' impact on the likelihood of default and rank them according to their contribution to the model outcome. Prediction is achieved by two Machine Learning algorithms (eXtreme Gradient Boosting and FeedForward Neural Networks) compared with three standard discriminant models. Results show that our analysis of the Italian Small and Medium Enterprises manufacturing industry benefits from the overall highest classification power by the eXtreme Gradient Boosting algorithm still maintaining a rich interpretation framework to support decisions.

Crosato, L., Liberati, C., Repetto, M. (2023). Lost in a black-box? Interpretable machine learning for assessing Italian SMEs default. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 39(16), 829-846 [10.1002/asmb.2803].

Lost in a black-box? Interpretable machine learning for assessing Italian SMEs default

Liberati C.
;
Repetto M.
2023

Abstract

Academic research and the financial industry have recently shown great interest in Machine Learning algorithms capable of solving complex learning tasks, although in the field of firms' default prediction the lack of interpretability has prevented an extensive adoption of the black-box type of models. In order to overcome this drawback and maintain the high performances of black-boxes, this paper has chosen a model-agnostic approach. Accumulated Local Effects and Shapley values are used to shape the predictors' impact on the likelihood of default and rank them according to their contribution to the model outcome. Prediction is achieved by two Machine Learning algorithms (eXtreme Gradient Boosting and FeedForward Neural Networks) compared with three standard discriminant models. Results show that our analysis of the Italian Small and Medium Enterprises manufacturing industry benefits from the overall highest classification power by the eXtreme Gradient Boosting algorithm still maintaining a rich interpretation framework to support decisions.
Articolo in rivista - Articolo scientifico
accumulated local effects; default prediction; interpretability; machine learning; small and medium sized enterprises;
English
7-ago-2023
2023
39
16
829
846
none
Crosato, L., Liberati, C., Repetto, M. (2023). Lost in a black-box? Interpretable machine learning for assessing Italian SMEs default. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 39(16), 829-846 [10.1002/asmb.2803].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/435578
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact