Experimental evidence of effective human-AI collaboration in medical decision-making

Artificial Intelligence (Al) systems are precious support for decision-making, with many applications also in the medical domain. The interaction between mos and Al enjoys a renewed interest following the increased possibilities of deep learning devices. However, we still have limited evidence-based knowledge of the context, design, and psychological mechanisms that craft an optimal human-Al collaboration. In this multicentric study, 21 endoscopists reviewed 504 videos of lesions prospectively acquired from real colonoscopies. They were asked to provide an optical diagnosis with and without the assistance of an Al support system. Endoscopists were influenced by Al (or = 3.05), but not erratically: they followed the Al advice more when it was correct (or = 3.48) than incorrect (or = 1.85). Endoscopists achieved this outcome through a weighted integration of their and the Al opinions, considering the case-by-case estimations of the two reliabilities. This Bayesian-like rational behavior allowed the human-Al hybrid team to outperform both agents taken alone. We discuss the features of the human-Al interaction that determined this favorable outcome.

Experimental evidence of effective human-AI collaboration in medical decision-making

Reverberi, Carlo;Rigon, Tommaso;Solari, Aldo;Hassan, Cesare;Cherubini, Paolo;Cherubini, Andrea

2022-01-01

Abstract

Artificial Intelligence (Al) systems are precious support for decision-making, with many applications also in the medical domain. The interaction between mos and Al enjoys a renewed interest following the increased possibilities of deep learning devices. However, we still have limited evidence-based knowledge of the context, design, and psychological mechanisms that craft an optimal human-Al collaboration. In this multicentric study, 21 endoscopists reviewed 504 videos of lesions prospectively acquired from real colonoscopies. They were asked to provide an optical diagnosis with and without the assistance of an Al support system. Endoscopists were influenced by Al (or = 3.05), but not erratically: they followed the Al advice more when it was correct (or = 3.48) than incorrect (or = 1.85). Endoscopists achieved this outcome through a weighted integration of their and the Al opinions, considering the case-by-case estimations of the two reliabilities. This Bayesian-like rational behavior allowed the human-Al hybrid team to outperform both agents taken alone. We discuss the features of the human-Al interaction that determined this favorable outcome.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2022
			
	Titolo della Rivista
	
				SCIENTIFIC REPORTS
			
	N° Volume
	
				12
			
	DOI
	
				https://dx.doi.org/10.1038/s41598-022-18751-2
			
	Appare nelle tipologie:
	
				2.1 Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
s41598-022-18751-2.pdf accesso aperto Tipologia: Versione dell'editore Licenza: Creative commons Dimensione 1.29 MB Formato Adobe PDF Visualizza/Apri	1.29 MB	Adobe PDF	Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/5044570

Citazioni

16

125

110

social impact