Progression of an Artificial Intelligence Chatbot (ChatGPT) for Pediatric Cardiology Educational Knowledge Assessment,Pediatric Cardiology

当前位置： X-MOL 学术 › Pediatr. Cardiol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Progression of an Artificial Intelligence Chatbot (ChatGPT) for Pediatric Cardiology Educational Knowledge Assessment
Pediatric Cardiology ( IF 1.5 ) Pub Date : 2024-01-03 , DOI: 10.1007/s00246-023-03385-6
Michael N Gritti _{1,

2} , Hussain AlTurki _{2,

3} , Pedrom Farid _{1,

4} , Conall T Morgan _{1,

2}

Affiliation

Artificial intelligence chatbots, like ChatGPT, have become powerful tools that are disrupting how humans interact with technology. The potential uses within medicine are vast. In medical education, these chatbots have shown improvements, in a short time span, in generalized medical examinations. We evaluated the overall performance and improvement between ChatGPT 3.5 and 4.0 in a test of pediatric cardiology knowledge. ChatGPT 3.5 and ChatGPT 4.0 were used to answer text-based multiple-choice questions derived from a Pediatric Cardiology Board Review textbook. Each chatbot was given an 88 question test, subcategorized into 11 topics. We excluded questions with modalities other than text (sound clips or images). Statistical analysis was done using an unpaired two-tailed t-test. Of the same 88 questions, ChatGPT 4.0 answered 66% of the questions correctly (n = 58/88) which was significantly greater (p < 0.0001) than ChatGPT 3.5, which only answered 38% (33/88). The ChatGPT 4.0 version also did better on each subspeciality topic as compared to ChatGPT 3.5. While acknowledging that ChatGPT does not yet offer subspecialty level knowledge in pediatric cardiology, the performance in pediatric cardiology educational assessments showed a considerable improvement in a short period of time between ChatGPT 3.5 and 4.0.

中文翻译：

用于儿科心脏病学教育知识评估的人工智能聊天机器人 (ChatGPT) 的进展

ChatGPT 等人工智能聊天机器人已成为颠覆人类与技术交互方式的强大工具。在医学领域的潜在用途是巨大的。在医学教育中，这些聊天机器人在短时间内在综合医学检查中表现出了进步。我们在儿科心脏病学知识测试中评估了 ChatGPT 3.5 和 4.0 之间的整体表现和改进。 ChatGPT 3.5 和 ChatGPT 4.0 用于回答源自儿科心脏病委员会审查教科书的基于文本的多项选择题。每个聊天机器人都接受了 88 个问题测试，分为 11 个主题。我们排除了除文本（声音剪辑或图像）之外的其他形式的问题。使用不配对的双尾t检验进行统计分析。在同样的 88 个问题中，ChatGPT 4.0 正确回答了 66% 的问题 ( n = 58/88)，明显高于 ChatGPT 3.5 ( p < 0.0001)，后者仅回答了 38% (33/88)。与 ChatGPT 3.5 相比，ChatGPT 4.0 版本在每个子专业主题上也做得更好。虽然承认 ChatGPT 尚未提供儿科心脏病学的亚专业水平知识，但儿科心脏病学教育评估的表现显示在 ChatGPT 3.5 和 4.0 之间的短时间内有相当大的进步。

更新日期：2024-01-03

点击分享查看原文

点击收藏

阅读更多本刊新发论文本刊介绍/投稿指南