Background: Artificial Intelligence (AI), in particular, machine learning (ML) has shown promising results in coronary artery disease (CAD) or cardiovascular disease (CVD) risk prediction. Bias in ML systems is of great interest due to its over-performance and poor clinical delivery. The main objective is to understand the nature of risk-of-bias (RoB) in ML and non-ML studies for CVD risk prediction. Methods: PRISMA model was used to shortlisting 117 studies, which were analyzed to understand the RoB in ML and non-ML using 46 and 32 attributes, respectively. The mean score for each study was computed and then ranked into three ML and non-ML bias categories, namely low-bias (LB), moderate-bias (MB), and high-bias (HB), derived using two cutoffs. Further, bias computation was validated using the analytical slope method. Results: Five types of the gold standard were identified in the ML design for CAD/CVD risk prediction. The low-moderate and moderate-high bias cutoffs for 24 ML studies (5, 10, and 9 studies for each LB, MB, and HB) and 14 non-ML (3, 4, and 7 studies for each LB, MB, and HB) were in the range of 1.5 to 1.95. BiasML < Biasnon-ML by ∼43%. A set of recommendations were proposed for lowering RoB. Conclusion: ML showed a lower bias compared to non-ML. For a robust ML-based CAD/CVD prediction design, it is vital to have (i) stronger outcomes like death or CAC score or coronary artery stenosis; (ii) ensuring scientific/clinical validation; (iii) adaptation of multiethnic groups while practicing unseen AI; (iv) amalgamation of conventional, laboratory, image-based and medication-based biomarkers.

Understanding the bias in machine learning systems for cardiovascular disease risk assessment: The first of its kind review

Faa G.;Saba L.;
2022-01-01

Abstract

Background: Artificial Intelligence (AI), in particular, machine learning (ML) has shown promising results in coronary artery disease (CAD) or cardiovascular disease (CVD) risk prediction. Bias in ML systems is of great interest due to its over-performance and poor clinical delivery. The main objective is to understand the nature of risk-of-bias (RoB) in ML and non-ML studies for CVD risk prediction. Methods: PRISMA model was used to shortlisting 117 studies, which were analyzed to understand the RoB in ML and non-ML using 46 and 32 attributes, respectively. The mean score for each study was computed and then ranked into three ML and non-ML bias categories, namely low-bias (LB), moderate-bias (MB), and high-bias (HB), derived using two cutoffs. Further, bias computation was validated using the analytical slope method. Results: Five types of the gold standard were identified in the ML design for CAD/CVD risk prediction. The low-moderate and moderate-high bias cutoffs for 24 ML studies (5, 10, and 9 studies for each LB, MB, and HB) and 14 non-ML (3, 4, and 7 studies for each LB, MB, and HB) were in the range of 1.5 to 1.95. BiasML < Biasnon-ML by ∼43%. A set of recommendations were proposed for lowering RoB. Conclusion: ML showed a lower bias compared to non-ML. For a robust ML-based CAD/CVD prediction design, it is vital to have (i) stronger outcomes like death or CAC score or coronary artery stenosis; (ii) ensuring scientific/clinical validation; (iii) adaptation of multiethnic groups while practicing unseen AI; (iv) amalgamation of conventional, laboratory, image-based and medication-based biomarkers.
2022
And unseen data
Artificial intelligence
Bias
Carotid ultrasound
Coronary artery disease
Gold standard
Risk prediction
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/365963
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 33
  • ???jsp.display-item.citation.isi??? 31
social impact