Comparison between suitable priors for additive Bayesian networks

Kratzer, G.; Furrer, R.; Pittavino, M.

doi:10.1007/978-3-030-30611-3_10

Additive Bayesian networks (ABN) are types of graphical models that extend the usual Bayesian-generalised linear model to multiple dependent variables through the factorisation of the joint probability distribution of the underlying variables. When fitting an ABN model, the choice of the prior for the parameters is of crucial importance. If an inadequate prior—like a not sufficiently informative one—is used, data separation and data sparsity may lead to issues in the model selection process. In this work we present a simulation study to compare two weakly informative priors with a strongly informative one. For the weakly informative prior, we use a zero mean Gaussian prior with a large variance, currently implemented in the R package abn. The candidate prior belongs to the Student’s t-distribution. It is specifically designed for logistic regressions. Finally, the strongly informative prior is Gaussian with a mean equal to the true parameter value and a small variance. We compare the impact of these priors on the accuracy of the learned additive Bayesian network as function of different parameters. We create a simulation study to illustrate Lindley’s paradox based on the prior choice. We then conclude by highlighting the good performance of the informative Student’s t-prior and the limited impact of Lindley’s paradox. Finally, suggestions for further developments are provided.

Comparison between suitable priors for additive Bayesian networks

Kratzer G.;Furrer R.;Pittavino M.

2019-01-01

Abstract

Additive Bayesian networks (ABN) are types of graphical models that extend the usual Bayesian-generalised linear model to multiple dependent variables through the factorisation of the joint probability distribution of the underlying variables. When fitting an ABN model, the choice of the prior for the parameters is of crucial importance. If an inadequate prior—like a not sufficiently informative one—is used, data separation and data sparsity may lead to issues in the model selection process. In this work we present a simulation study to compare two weakly informative priors with a strongly informative one. For the weakly informative prior, we use a zero mean Gaussian prior with a large variance, currently implemented in the R package abn. The candidate prior belongs to the Student’s t-distribution. It is specifically designed for logistic regressions. Finally, the strongly informative prior is Gaussian with a mean equal to the true parameter value and a small variance. We compare the impact of these priors on the accuracy of the learned additive Bayesian network as function of different parameters. We create a simulation study to illustrate Lindley’s paradox based on the prior choice. We then conclude by highlighting the good performance of the informative Student’s t-prior and the limited impact of Lindley’s paradox. Finally, suggestions for further developments are provided.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2019
			
	Titolo del Volume
	
				Springer Proceedings in Mathematics and Statistics
			
	DOI
	
				https://dx.doi.org/10.1007/978-3-030-30611-3_10
			
	Appare nelle tipologie:
	
				3.1 Articolo su libro

File in questo prodotto:

File	Dimensione	Formato
19Krat_Furr_Pitt_BAYES.pdf non disponibili Tipologia: Versione dell'editore Licenza: Copyright dell'editore Dimensione 4.23 MB Formato Adobe PDF Visualizza/Apri	4.23 MB	Adobe PDF	Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/5052320

Citazioni

ND

4

1

Comparison between suitable priors for additive Bayesian networks

Kratzer G.;Furrer R.;Pittavino M.

2019-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)