In this paper we will discuss the concept of the minimum level of accuracy an Artificial Intelligence system must exhibit in medical settings to be fit to its intended use and improve the daily practice of its intended users, the medical doctors. We will consider simple binary classification tasks in both diagnostic and prognostic ambit (like to discriminate between normal/abnormal case, and improvement/no improvement prospects). We will make the point that the common ways to determine this minimum acceptable accuracy are fraught with many conceptual and practical troubles. We will report about a small user study conducted to elicit the discriminative requirements from a sample of medical doctors, stratified both in general practitioners and specialists. Finally, we will present a simple nomogram by which to determine the minimum accuracy of a technological aid, once the human average performance and the desired level of accuracy are known. The nomogram is to be intended as a provocative simple tool to recognize that the technological tool is less important than a sound protocol in which to use it, responsibly and paying due attention to the role of the human decision makers.
Cabitza, F., Campagner, A., Zotti, F., Ravizza, A., Sternini, F. (2020). All you need is higher accuracy? On the quest for minimum acceptable accuracy for medical artificial intelligence. In Proceedings of the 12th IADIS International Conference e-Health 2020, EH 2020 - Part of the 14th Multi Conference on Computer Science and Information Systems, MCCSIS 2020 (pp. 159-166). IADIS.
All you need is higher accuracy? On the quest for minimum acceptable accuracy for medical artificial intelligence
Cabitza F.;Campagner A.;Ravizza A.;
2020
Abstract
In this paper we will discuss the concept of the minimum level of accuracy an Artificial Intelligence system must exhibit in medical settings to be fit to its intended use and improve the daily practice of its intended users, the medical doctors. We will consider simple binary classification tasks in both diagnostic and prognostic ambit (like to discriminate between normal/abnormal case, and improvement/no improvement prospects). We will make the point that the common ways to determine this minimum acceptable accuracy are fraught with many conceptual and practical troubles. We will report about a small user study conducted to elicit the discriminative requirements from a sample of medical doctors, stratified both in general practitioners and specialists. Finally, we will present a simple nomogram by which to determine the minimum accuracy of a technological aid, once the human average performance and the desired level of accuracy are known. The nomogram is to be intended as a provocative simple tool to recognize that the technological tool is less important than a sound protocol in which to use it, responsibly and paying due attention to the role of the human decision makers.File | Dimensione | Formato | |
---|---|---|---|
Cabitza-2020-MCCSIS-preprint.pdf
Solo gestori archivio
Tipologia di allegato:
Submitted Version (Pre-print)
Licenza:
Tutti i diritti riservati
Dimensione
754.88 kB
Formato
Adobe PDF
|
754.88 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.