Artificial Intelligence (Al) systems are precious support for decision-making, with many applications also in the medical domain. The interaction between mos and Al enjoys a renewed interest following the increased possibilities of deep learning devices. However, we still have limited evidence-based knowledge of the context, design, and psychological mechanisms that craft an optimal human-Al collaboration. In this multicentric study, 21 endoscopists reviewed 504 videos of lesions prospectively acquired from real colonoscopies. They were asked to provide an optical diagnosis with and without the assistance of an Al support system. Endoscopists were influenced by Al (or = 3.05), but not erratically: they followed the Al advice more when it was correct (or = 3.48) than incorrect (or = 1.85). Endoscopists achieved this outcome through a weighted integration of their and the Al opinions, considering the case-by-case estimations of the two reliabilities. This Bayesian-like rational behavior allowed the human-Al hybrid team to outperform both agents taken alone. We discuss the features of the human-Al interaction that determined this favorable outcome.
Experimental evidence of effective human-AI collaboration in medical decision-making
Rigon, Tommaso;Solari, Aldo;
2022-01-01
Abstract
Artificial Intelligence (Al) systems are precious support for decision-making, with many applications also in the medical domain. The interaction between mos and Al enjoys a renewed interest following the increased possibilities of deep learning devices. However, we still have limited evidence-based knowledge of the context, design, and psychological mechanisms that craft an optimal human-Al collaboration. In this multicentric study, 21 endoscopists reviewed 504 videos of lesions prospectively acquired from real colonoscopies. They were asked to provide an optical diagnosis with and without the assistance of an Al support system. Endoscopists were influenced by Al (or = 3.05), but not erratically: they followed the Al advice more when it was correct (or = 3.48) than incorrect (or = 1.85). Endoscopists achieved this outcome through a weighted integration of their and the Al opinions, considering the case-by-case estimations of the two reliabilities. This Bayesian-like rational behavior allowed the human-Al hybrid team to outperform both agents taken alone. We discuss the features of the human-Al interaction that determined this favorable outcome.File | Dimensione | Formato | |
---|---|---|---|
s41598-022-18751-2.pdf
accesso aperto
Tipologia:
Versione dell'editore
Licenza:
Creative commons
Dimensione
1.29 MB
Formato
Adobe PDF
|
1.29 MB | Adobe PDF | Visualizza/Apri |
I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.