Key concepts, common pitfalls, and best practices in artificial intelligence and machine learning: focus on radiomics
PDF
Cite
Share
Request
Artificial Intelligence and Informatics - Review
P: 450-462
September 2022

Key concepts, common pitfalls, and best practices in artificial intelligence and machine learning: focus on radiomics

Diagn Interv Radiol 2022;28(5):450-462
1. Department of Radiology, Başakşehir Cam and Sakura City Hospital, İstanbul, Turkey
No information available.
No information available
Received Date: 10.01.2022
Accepted Date: 18.03.2022
PDF
Cite
Share
Request

ABSTRACT

Artificial intelligence (AI) and machine learning (ML) are increasingly used in radiology research to deal with large and complex imaging data sets. Nowadays, ML tools have become easily accessible to anyone. Such a low threshold to accessibility might lead to inappropriate usage and misinterpretation, without a clear intention. Therefore, ensuring methodological rigor is of paramount importance. Getting closer to the real-world clinical implementation of AI, a basic understanding of the main concepts should be a must for every radiology professional. In this respect, simplified explanations of the key concepts along with pitfalls and recommendations would be helpful for general radiology community to develop and improve their AI mindset. In this work, twenty-two key issues are reviewed within three categories: pre-modeling, modeling, and post-modeling. Firstly, the concept is shortly defined for each issue. Then, related common pitfalls and best practices are provided. Specifically, the issues included in this paper were validity of scientific question, unrepresentative samples, sample size, missing data, quality of reference standard, batch effect, reliability of features, feature scaling, multi-collinearity, class imbalance, data and target leakage, high-dimensional data, optimization, overfitting, generalization, performance metrics, clinical utility, comparison with conventional statistical and clinical methods, interpretability and explainability, randomness, transparent reporting, and sharing data.