Artificial Intelligence (Al) systems are precious support for decision-making, with many applications also in the medical domain. The interaction between mos and Al enjoys a renewed interest following the increased possibilities of deep learning devices. However, we still have limited evidence-based knowledge of the context, design, and psychological mechanisms that craft an optimal human-Al collaboration. In this multicentric study, 21 endoscopists reviewed 504 videos of lesions prospectively acquired from real colonoscopies. They were asked to provide an optical diagnosis with and without the assistance of an Al support system. Endoscopists were influenced by Al (or = 3.05), but not erratically: they followed the Al advice more when it was correct (or = 3.48) than incorrect (or = 1.85). Endoscopists achieved this outcome through a weighted integration of their and the Al opinions, considering the case-by-case estimations of the two reliabilities. This Bayesian-like rational behavior allowed the human-Al hybrid team to outperform both agents taken alone. We discuss the features of the human-Al interaction that determined this favorable outcome.

Experimental evidence of effective human-AI collaboration in medical decision-making

Rigon, Tommaso;Solari, Aldo;
2022-01-01

Abstract

Artificial Intelligence (Al) systems are precious support for decision-making, with many applications also in the medical domain. The interaction between mos and Al enjoys a renewed interest following the increased possibilities of deep learning devices. However, we still have limited evidence-based knowledge of the context, design, and psychological mechanisms that craft an optimal human-Al collaboration. In this multicentric study, 21 endoscopists reviewed 504 videos of lesions prospectively acquired from real colonoscopies. They were asked to provide an optical diagnosis with and without the assistance of an Al support system. Endoscopists were influenced by Al (or = 3.05), but not erratically: they followed the Al advice more when it was correct (or = 3.48) than incorrect (or = 1.85). Endoscopists achieved this outcome through a weighted integration of their and the Al opinions, considering the case-by-case estimations of the two reliabilities. This Bayesian-like rational behavior allowed the human-Al hybrid team to outperform both agents taken alone. We discuss the features of the human-Al interaction that determined this favorable outcome.
2022
12
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/5044570
Citazioni
  • ???jsp.display-item.citation.pmc??? 6
  • Scopus 29
  • ???jsp.display-item.citation.isi??? 25
social impact