Small and Medium Enterprises (SMEs) contribution to the European Union economy has always been relevant, for both value added and the creation of jobs. That is why the prediction of their survival is considered one of the economic pillars UE keeps under observation. Default prediction models, accounting for SMEs idiosyncratic traits, are based on several types of data, mainly accounting indicators. Balance sheet data, indeed, are considered the standard predictors for classification models in this field, although they do not allow to completely overcome the information opacity that is one of the main barriers preventing these firms from accessing credit. In our work, we explore the possibility of complementing accounting information with data scraped from the firms’ websites. We modeled the data using a nonlinear discriminant analysis and we benchmarked the results with the Logistic Regression. The evidence of our study is promising although the combination of online and offline data shows better results in case of survival firms than for defaulted companies.

Crosato, L., Domenech, J., Liberati, C. (2022). Non-conventional data and default prediction: the challenge of companies’ websites. In CARMA 2022 (pp.253-258). Editorial Universitat Politècnica de València [10.4995/CARMA2022.2022.15103].

Non-conventional data and default prediction: the challenge of companies’ websites

Liberati C.
2022

Abstract

Small and Medium Enterprises (SMEs) contribution to the European Union economy has always been relevant, for both value added and the creation of jobs. That is why the prediction of their survival is considered one of the economic pillars UE keeps under observation. Default prediction models, accounting for SMEs idiosyncratic traits, are based on several types of data, mainly accounting indicators. Balance sheet data, indeed, are considered the standard predictors for classification models in this field, although they do not allow to completely overcome the information opacity that is one of the main barriers preventing these firms from accessing credit. In our work, we explore the possibility of complementing accounting information with data scraped from the firms’ websites. We modeled the data using a nonlinear discriminant analysis and we benchmarked the results with the Logistic Regression. The evidence of our study is promising although the combination of online and offline data shows better results in case of survival firms than for defaulted companies.
slide + paper
Website Data; SMEs; Default Prediction; Kernel Discriminant
English
4th International Conference on Advanced Research Methods and Analytics (CARMA2022)
2022
CARMA 2022
978-84-1396-018-0
2022
253
258
open
Crosato, L., Domenech, J., Liberati, C. (2022). Non-conventional data and default prediction: the challenge of companies’ websites. In CARMA 2022 (pp.253-258). Editorial Universitat Politècnica de València [10.4995/CARMA2022.2022.15103].
File in questo prodotto:
File Dimensione Formato  
Carma_2022_15103-44251-1-PB.pdf

accesso aperto

Descrizione: Intervento a convegno
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 758.04 kB
Formato Adobe PDF
758.04 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/392858
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact