New Study Examines Short-Term Consistency of Large Language Models in Radiology
Diagnostic Imaging
NOVEMBER 22, 2024
While GPT-4 demonstrated higher overall accuracy than other large language models in answering ACR Diagnostic in Training Exam multiple-choice questions, researchers noted an eight percent decrease in GPT-4’s accuracy rate from the first month to the third month of the study.
Let's personalize your content