The necessity of determining the probability of default often conflicts with the requirement for employing parsimonious methodologies. The inclusion of a large number of regressors in Logit models or Machine Learning approaches can lead to overfitting, thereby introducing biases that distort the results. This study aims to examine the extent to which an indicator that synthesizes information related to balance sheet metrics can achieve a performance comparable to that obtained through a comprehensive set of indicators. In this paper, we introduce the Synthetic Performance Indicator (ISP), which is derived from specific balance sheet indicators. We demonstrate its effectiveness as a synthetic measure of financial stability in localized settings. Furthermore, we assess its potential to serve as a viable alternative to the broader panel of indicators from which it is constructed. Finally, we provide further evidence of how Machine Learning approaches, despite being effective in-sample, perform poorly out-of-sample.

ISP Index: A Parsimonious Method to Predict Defaults

Roberto Casarin
;
Fausto Corradin;Antonio Peruzzi
2025-01-01

Abstract

The necessity of determining the probability of default often conflicts with the requirement for employing parsimonious methodologies. The inclusion of a large number of regressors in Logit models or Machine Learning approaches can lead to overfitting, thereby introducing biases that distort the results. This study aims to examine the extent to which an indicator that synthesizes information related to balance sheet metrics can achieve a performance comparable to that obtained through a comprehensive set of indicators. In this paper, we introduce the Synthetic Performance Indicator (ISP), which is derived from specific balance sheet indicators. We demonstrate its effectiveness as a synthetic measure of financial stability in localized settings. Furthermore, we assess its potential to serve as a viable alternative to the broader panel of indicators from which it is constructed. Finally, we provide further evidence of how Machine Learning approaches, despite being effective in-sample, perform poorly out-of-sample.
2025
Supervised and Unsupervised Statistical Data Analysis
File in questo prodotto:
File Dimensione Formato  
CLADAG_Casarin_Corradin_Peruzzi.pdf

non disponibili

Tipologia: Documento in Pre-print
Licenza: Copyright dell'editore
Dimensione 124.75 kB
Formato Adobe PDF
124.75 kB Adobe PDF   Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/5102749
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact