Financial data classification plays an important role in investment and banking industry with the purpose to control default risk, improve cash and select the best customers. Ensemble learning and classification systems are becoming gradually more applied to classify financial data where outputs from different classification systems are combined. The objective of this research is to assess the relative performance of existing state-of-the-art ensemble learning and classification systems with applications to corporate bankruptcy prediction and credit scoring. The considered ensemble systems include AdaBoost, LogitBoost, RUSBoost, subspace, and bagging ensemble system. The experimental results from three datasets: one is composed of quantitative attributes, one encompasses qualitative data, and another one combines both quantitative and qualitative attributes. By using ten-fold cross-validation method, the experimental results show that AdaBoost is effective in terms of low classification error, limited complexity, and short time processing of the data. In addition, the experimental results show that ensemble classification systems outperform existing models that were recently validated on the same databases. Therefore, ensemble classification system can be employed to increase the reliability and consistency of financial data classification task.

Performance assessment of ensemble learning systems in financial data classification

Giakoumelou A.
;
2020-01-01

Abstract

Financial data classification plays an important role in investment and banking industry with the purpose to control default risk, improve cash and select the best customers. Ensemble learning and classification systems are becoming gradually more applied to classify financial data where outputs from different classification systems are combined. The objective of this research is to assess the relative performance of existing state-of-the-art ensemble learning and classification systems with applications to corporate bankruptcy prediction and credit scoring. The considered ensemble systems include AdaBoost, LogitBoost, RUSBoost, subspace, and bagging ensemble system. The experimental results from three datasets: one is composed of quantitative attributes, one encompasses qualitative data, and another one combines both quantitative and qualitative attributes. By using ten-fold cross-validation method, the experimental results show that AdaBoost is effective in terms of low classification error, limited complexity, and short time processing of the data. In addition, the experimental results show that ensemble classification systems outperform existing models that were recently validated on the same databases. Therefore, ensemble classification system can be employed to increase the reliability and consistency of financial data classification task.
File in questo prodotto:
File Dimensione Formato  
Research Article 3 Intelligent Systems in Accounting, Finance and Management.pdf

non disponibili

Tipologia: Documento in Post-print
Licenza: Accesso chiuso-personale
Dimensione 988.21 kB
Formato Adobe PDF
988.21 kB Adobe PDF   Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/3755092
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 26
  • ???jsp.display-item.citation.isi??? 20
social impact