详情页 - 云南省第一人民医院机构库

当前位置：首页 > 详情页

Clinical application potential of large language model: a study based on thyroid nodules

84| 认领 | 导出 | 链接全文 |

文献详情

资源类型：

WOS体系：

Pubmed体系：

收录情况： ◇ SCIE

作者：

机构： [1]Shanghai Jiao Tong Univ, Sch Med, Ruijin Hosp, Dept Ultrasound, Shanghai, Peoples R China [2]Shanghai Jiao Tong Univ, Coll Hlth Sci & Technol, Sch Med, Shanghai, Peoples R China [3]Kongjiang Hosp, Dept Endocrinol, Shanghai, Peoples R China [4]Xianning 1 Peoples Hosp, Dept Thyroid & Breast Surg, Xianning, Peoples R China [5]Handan Hangang Hosp, Dept Thyroid, Handan, Hebei, Peoples R China [6]Zhengzhou Univ, Dept Thyroid Surg, Affiliated Hosp 1, Zhengzhou, Peoples R China [7]LiuYang Peoples Hosp, Thyroid & Breast Surg, Changsha, Peoples R China [8]Fourth Mil Med Univ, Xijing Hosp, Dept Thyroid Breast & Vasc Surg, Xian, Shanxi, Peoples R China [9]Shanxi Prov Canc Hosp, Dept Head & Neck Surg, Taiyuan, Peoples R China [10]Hosp Chengdu Univ Tradit Chinese Med, Dept Endocrinol, Chengdu, Peoples R China [11]Mazhanghuiwen Hosp, Dept Surg, Zhanjiang, Guangdong, Peoples R China [12]Lianshui Peoples Hosp, Endocrine Dept, Huaian, Jiangsu, Peoples R China [13]Kunming Univ Sci & Technol, Anning Peoples Hosp 1, Dept Ultrasound, Anning, Yunnan, Peoples R China [14]Shanghai Jiao Tong Univ, Sch Med, Inst Med Sci, Dept Biostat, Shanghai, Peoples R China

出处：

DOI：

ISSN：

关键词： Artificial intelligence LLM ChatGPT New Bing Chat

摘要：

Background Limited data indicated the performance of large language model (LLM) taking on the role of doctors. We aimed to investigate the potential for ChatGPT-3.5 and New Bing Chat acting as doctors using thyroid nodules as an example. Methods A total of 145 patients with thyroid nodules were included for generating questions. Each question was entered into chatbot of ChatGPT-3.5 and New Bing Chat five times and five responses were acquired respectively. These responses were compared with answers given by five junior doctors. Responses from five senior doctors were regarded as gold standard. Accuracy and reproducibility of responses from ChatGPT-3.5 and New Bing Chat were evaluated. Results The accuracy of ChatGPT-3.5 and New Bing Chat in answering Q2, Q3, Q5 were lower than that of junior doctors (all P < 0.05), while both LLMs were comparable to junior doctors when answering Q4 and Q6. In terms of "high reproducibility and accuracy", ChatGPT-3.5 outperformed New Bing Chat in Q1 and Q5 (P < 0.001 and P = 0.008, respectively), but showed no significant difference in Q2, Q3, Q4, and Q6 (P > 0.05 for all). New Bing Chat generated higher accuracy than ChatGPT-3.5 (72.41% vs 58.62%) (P = 0.003) in decision making of thyroid nodules, and both were less accurate than junior doctors (89.66%, P < 0.001 for both). Conclusions The exploration of ChatGPT-3.5 and New Bing Chat in the diagnosis and management of thyroid nodules illustrates that LLMs currently demonstrate the potential for medical applications, but do not yet reach the clinical decision-making capacity of doctors.

基金：

语种：

被引次数：

WOS：

PubmedID：

中科院(CAS)分区：

出版当年[2025]版：

无

最新[2025]版：

大类 | 3 区医学

小类 | 3 区内分泌学与代谢

JCR分区：

出版当年[2024]版：

无

最新[2023]版：

Q2 ENDOCRINOLOGY & METABOLISM

影响因子： 3 最新[2023版] 3.1 最新五年平均 0 出版当年[2024版] 0 出版当年五年平均 3 出版前一年[2023版]

第一作者：

第一作者机构： [1]Shanghai Jiao Tong Univ, Sch Med, Ruijin Hosp, Dept Ultrasound, Shanghai, Peoples R China [2]Shanghai Jiao Tong Univ, Coll Hlth Sci & Technol, Sch Med, Shanghai, Peoples R China

通讯作者：

通讯机构： [1]Shanghai Jiao Tong Univ, Sch Med, Ruijin Hosp, Dept Ultrasound, Shanghai, Peoples R China [2]Shanghai Jiao Tong Univ, Coll Hlth Sci & Technol, Sch Med, Shanghai, Peoples R China

推荐引用方式(GB/T 7714)：

APA：

MLA：

Clinical application potential of large language model: a study based on thyroid nodules

文献详情

相关文献