An accurate tax fraud classifier with feature selection based on complex network node centrality measure

Fiscal evasion represents a very serious issue in many developing countries. In this context, tax fraud detection constitutes a challenging problem, since fraudsters change frequently their behaviors to circumvent existing laws and devise new kinds of frauds. Detecting such changes proves to be challenging, since traditional classifiers fail to select features that exhibit frequent variations. In this paper we provide two contributions that try to tackle effectively the tax fraud detection problem: first, we introduce a novel feature selection algorithm, based on complex network techniques, that is able to capture key fraud indicators - over time, this kind of indicators turn out to be more stable than new fraud indicators. Secondly, we propose a classifier that leverages the aforementioned algorithm to accurately detect tax frauds. In order to prove the validity of our contributions we provide an experimental evaluation, where we use real-world datasets, obtained from the State Treasury Office of Ceará (SEFAZ-CE), Brazil, to show how our method is able to outperform, in terms of F1 scores achieved, the state-of-the-art available in the literature.

An accurate tax fraud classifier with feature selection based on complex network node centrality measure

Matos T.;De Macedo J. A. F.;Monteiro J. M.;Lettich F.

2017-01-01

Abstract

Fiscal evasion represents a very serious issue in many developing countries. In this context, tax fraud detection constitutes a challenging problem, since fraudsters change frequently their behaviors to circumvent existing laws and devise new kinds of frauds. Detecting such changes proves to be challenging, since traditional classifiers fail to select features that exhibit frequent variations. In this paper we provide two contributions that try to tackle effectively the tax fraud detection problem: first, we introduce a novel feature selection algorithm, based on complex network techniques, that is able to capture key fraud indicators - over time, this kind of indicators turn out to be more stable than new fraud indicators. Secondly, we propose a classifier that leverages the aforementioned algorithm to accurately detect tax frauds. In order to prove the validity of our contributions we provide an experimental evaluation, where we use real-world datasets, obtained from the State Treasury Office of Ceará (SEFAZ-CE), Brazil, to show how our method is able to outperform, in terms of F1 scores achieved, the state-of-the-art available in the literature.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2017
			
	Titolo del volume
	
				ICEIS 2017 - Proceedings of the 19th International Conference on Enterprise Information Systems
			
	DOI
	
				https://dx.doi.org/10.5220/0006335501450151
			
	Appare nelle tipologie:
	
				4.1 Articolo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
63355.pdf non disponibili Tipologia: Versione dell'editore Licenza: Accesso chiuso-personale Dimensione 241.25 kB Formato Adobe PDF Visualizza/Apri	241.25 kB	Adobe PDF	Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/3730556

Citazioni

ND

0

1

social impact