Evaluating ChatGPT's Accuracy in Responding to Patient Education Questions on Acute Kidney Injury and Continuous Renal Replacement Therapy.,Blood Purification

当前位置： X-MOL 学术 › Blood Purif. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Evaluating ChatGPT's Accuracy in Responding to Patient Education Questions on Acute Kidney Injury and Continuous Renal Replacement Therapy.
Blood Purification ( IF 2.2 ) Pub Date : 2024-04-26 , DOI: 10.1159/000539065
Mohammad Salman Sheikh ₁ , Charat Thongprayoon ₁ , Supawadee Suppadungsuk _{1,

2} , Jing Miao ₁ , Fawad Qureshi ₁ , Kianoush Kashani _{1,

3} , Wisit Cheungpasitporn ₁

Affiliation

INTRODUCTION Acute kidney injury (AKI) and continuous renal replacement therapy (CRRT) are critical areas in nephrology. The effectiveness of ChatGPT in simpler, patient education-oriented questions has not been thoroughly assessed. This study evaluates the proficiency of ChatGPT 4.0 in responding to such questions, subjected to various linguistic alterations. METHODS Eighty-nine questions were sourced from the Mayo Clinic Handbook for educating patients on AKI and CRRT. These questions were categorized as original, paraphrased with different interrogative adverbs, paraphrased resulting in incomplete sentences, and paraphrased containing misspelled words. Two nephrologists verified the questions for medical accuracy. A χ2 test was conducted to ascertain notable discrepancies in ChatGPT 4.0's performance across these formats. RESULTS ChatGPT provided notable accuracy in handling a variety of question formats for patient education in AKI and CRRT. Across all question types, ChatGPT demonstrated an accuracy of 97% for both original and adverb-altered questions and 98% for questions with incomplete sentences or misspellings. Specifically for AKI-related questions, the accuracy was consistently maintained at 97% for all versions. In the subset of CRRT-related questions, the tool achieved a 96% accuracy for original and adverb-altered questions, and this increased to 98% for questions with incomplete sentences or misspellings. The statistical analysis revealed no significant difference in performance across these varied question types (p value: 1.00 for AKI and 1.00 for CRRT), and there was no notable disparity between the artificial intelligence (AI)'s responses to AKI and CRRT questions (p value: 0.71). CONCLUSION ChatGPT 4.0 demonstrates consistent and high accuracy in interpreting and responding to queries related to AKI and CRRT, irrespective of linguistic modifications. These findings suggest that ChatGPT 4.0 has the potential to be a reliable support tool in the delivery of patient education, by accurately providing information across a range of question formats. Further research is needed to explore the direct impact of AI-generated responses on patient understanding and education outcomes.

中文翻译：

评估 ChatGPT 在回答有关急性肾损伤和持续肾脏替代治疗的患者教育问题方面的准确性。

简介急性肾损伤（AKI）和连续肾脏替代治疗（CRRT）是肾脏病学的关键领域。 ChatGPT 在更简单、以患者教育为导向的问题中的有效性尚未得到彻底评估。本研究评估了 ChatGPT 4.0 在进行各种语言更改后回答此类问题的熟练程度。方法 89 个问题来自 Mayo Clinic 手册，用于对患者进行 AKI 和 CRRT 教育。这些问题被分类为原始问题、用不同疑问副词进行释义、释义导致句子不完整以及释义包含拼写错误的单词。两位肾病专家验证了这些问题的医学准确性。进行 χ2 测试以确定 ChatGPT 4.0 在这些格式中的性能是否存在显着差异。结果 ChatGPT 在处理 AKI 和 CRRT 患者教育的各种问题格式方面提供了显着的准确性。在所有问题类型中，ChatGPT 对于原始问题和副词更改问题的准确率均为 97%，对于句子不完整或拼写错误的问题的准确率为 98%。特别是对于 AKI 相关问题，所有版本的准确率始终保持在 97%。在 CRRT 相关问题的子集中，该工具对原始问题和副词更改问题的准确率达到 96%，对于句子不完整或拼写错误的问题，准确率提高到 98%。统计分析显示，这些不同问题类型的表现没有显着差异（p 值：AKI 为 1.00，CRRT 为 1.00），并且人工智能 (AI) 对 AKI 和 CRRT 问题的回答之间没有显着差异（p 值：AKI 为 1.00，CRRT 为 1.00）。值：0.71）。结论 ChatGPT 4。0 在解释和响应与 AKI 和 CRRT 相关的查询时表现出一致且高度的准确性，无论语言如何修改。这些发现表明，ChatGPT 4.0 通过准确提供各种问题格式的信息，有可能成为患者教育的可靠支持工具。需要进一步的研究来探索人工智能生成的反应对患者理解和教育结果的直接影响。

更新日期：2024-04-26

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11