Reply: evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System (BI-RADS) Atlas 5th edition
PDF
Cite
Share
Request
Artificıal Intelligence And Informatics - Letter To The Editor
VOLUME: ISSUE:
P: -

Reply: evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System (BI-RADS) Atlas 5th edition

1. Kırıkkale Yüksek İhtisas Hospital, Clinic of Radiology, Kırıkkale, Türkiye
2. Mamak State Hospital, Clinic of Radiology, Ankara, Türkiye
3. Ankara 29 Mayıs State Hospital, Clinic of Radiology, Ankara, Türkiye
4. Ankara Yıldırım Beyazıt University, Ankara Bilkent City Hospital, Department of Radiology, Ankara, Türkiye
No information available.
No information available
Received Date: 24.03.2025
Accepted Date: 20.04.2025
Online Date: 20.05.2025
PDF
Cite
Share
Request

Dear Editor,

We sincerely thank the author for their insightful comments1 and valuable suggestions regarding our manuscript titled “Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5th edition.2 We appreciate the author’s interest and the constructive proposal to incorporate retrieval-augmented generation (RAG) methodologies.1

We fully agree that employing RAG could enhance the accuracy, contextual relevance, and reliability of responses generated by large language models (LLMs), particularly when addressing complex clinical scenarios such as those encountered in breast radiology.2 As noted, RAG effectively mitigates limitations inherent in static models, including knowledge gaps and the risk of hallucinations, by dynamically retrieving relevant external information.3, 4

The examples provided by the author, including the promising results reported by Tozuka et al. in lung cancer tumor, node, metastasis staging using Google’s NotebookLM with RAG, clearly demonstrate the considerable potential of this approach in radiological contexts.5

Given these compelling points, we agree that incorporating RAG methodologies in future research would be highly valuable. Our current study did not include RAG, as its initial scope was limited to evaluating the inherent capabilities of standalone LLMs compared with radiologists based solely on the models’ existing training data. Nonetheless, we acknowledge that future investigations involving retrieval-based augmentation could yield further insights into enhancing LLM performance in clinical radiology decision-making and guideline adherence.6

We appreciate the constructive input and believe that combining LLMs with RAG techniques in future work could substantially advance radiology education and clinical practice.

Thank you again for your thoughtful recommendations and contribution to this important discussion.

Conflict of interest disclosure

The authors declared no conflicts of interest.

References

1
Kaba E. Retrieval-augmented generation for answering breast imaging reporting and data system (BI-RADS)-related questions with large language models.Diagn Interv Radiol.
2
Güneş YC, Cesur T, Çamur E, Günbey Karabekmez L. Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System (BI-RADS) Atlas 5th edition. Diagn Interv Radiol. 2025;31(2):111-129.
3
Zakka C, Shad R, Chaurasia A, et al. Almanac - retrieval-augmented language models for clinical medicine.NEJM AI. 2024;1(2):10.
4
Steybe D, Poxleitner P, Aljohani S, et al. Evaluation of a context-aware chatbot using retrieval-augmented generation for answering clinical questions on medication-related osteonecrosis of the jaw.J Craniomaxillofac Surg. 2025;53(4):355-360.
5
Tozuka R, Johno H, Amakawa A, et al. Application of NotebookLM, a large language model with retrieval-augmented generation, for lung cancer staging.Jpn J Radiol. 2025;43(4):706-712.
6
Liu S, Wright AP, Patterson BL, et al. Using AI-generated suggestions from ChatGPT to optimize clinical decision support.J Am Med Inform Assoc. 2023;30(7):1237-1245.