高级检索
当前位置: 首页 > 详情页

Clinical application potential of large language model: a study based on thyroid nodules

文献详情

资源类型:
WOS体系:
Pubmed体系:

收录情况: ◇ SCIE

机构: [1]Shanghai Jiao Tong Univ, Sch Med, Ruijin Hosp, Dept Ultrasound, Shanghai, Peoples R China [2]Shanghai Jiao Tong Univ, Coll Hlth Sci & Technol, Sch Med, Shanghai, Peoples R China [3]Kongjiang Hosp, Dept Endocrinol, Shanghai, Peoples R China [4]Xianning 1 Peoples Hosp, Dept Thyroid & Breast Surg, Xianning, Peoples R China [5]Handan Hangang Hosp, Dept Thyroid, Handan, Hebei, Peoples R China [6]Zhengzhou Univ, Dept Thyroid Surg, Affiliated Hosp 1, Zhengzhou, Peoples R China [7]LiuYang Peoples Hosp, Thyroid & Breast Surg, Changsha, Peoples R China [8]Fourth Mil Med Univ, Xijing Hosp, Dept Thyroid Breast & Vasc Surg, Xian, Shanxi, Peoples R China [9]Shanxi Prov Canc Hosp, Dept Head & Neck Surg, Taiyuan, Peoples R China [10]Hosp Chengdu Univ Tradit Chinese Med, Dept Endocrinol, Chengdu, Peoples R China [11]Mazhanghuiwen Hosp, Dept Surg, Zhanjiang, Guangdong, Peoples R China [12]Lianshui Peoples Hosp, Endocrine Dept, Huaian, Jiangsu, Peoples R China [13]Kunming Univ Sci & Technol, Anning Peoples Hosp 1, Dept Ultrasound, Anning, Yunnan, Peoples R China [14]Shanghai Jiao Tong Univ, Sch Med, Inst Med Sci, Dept Biostat, Shanghai, Peoples R China
出处:
ISSN:

关键词: Artificial intelligence LLM ChatGPT New Bing Chat

摘要:
Background Limited data indicated the performance of large language model (LLM) taking on the role of doctors. We aimed to investigate the potential for ChatGPT-3.5 and New Bing Chat acting as doctors using thyroid nodules as an example. Methods A total of 145 patients with thyroid nodules were included for generating questions. Each question was entered into chatbot of ChatGPT-3.5 and New Bing Chat five times and five responses were acquired respectively. These responses were compared with answers given by five junior doctors. Responses from five senior doctors were regarded as gold standard. Accuracy and reproducibility of responses from ChatGPT-3.5 and New Bing Chat were evaluated. Results The accuracy of ChatGPT-3.5 and New Bing Chat in answering Q2, Q3, Q5 were lower than that of junior doctors (all P < 0.05), while both LLMs were comparable to junior doctors when answering Q4 and Q6. In terms of "high reproducibility and accuracy", ChatGPT-3.5 outperformed New Bing Chat in Q1 and Q5 (P < 0.001 and P = 0.008, respectively), but showed no significant difference in Q2, Q3, Q4, and Q6 (P > 0.05 for all). New Bing Chat generated higher accuracy than ChatGPT-3.5 (72.41% vs 58.62%) (P = 0.003) in decision making of thyroid nodules, and both were less accurate than junior doctors (89.66%, P < 0.001 for both). Conclusions The exploration of ChatGPT-3.5 and New Bing Chat in the diagnosis and management of thyroid nodules illustrates that LLMs currently demonstrate the potential for medical applications, but do not yet reach the clinical decision-making capacity of doctors.

基金:
语种:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2024]版:
最新[2023]版:
大类 | 3 区 医学
小类 | 3 区 内分泌学与代谢
JCR分区:
出版当年[2023]版:
Q2 ENDOCRINOLOGY & METABOLISM
最新[2023]版:
Q2 ENDOCRINOLOGY & METABOLISM

影响因子: 最新[2023版] 最新五年平均 出版当年[2023版] 出版当年五年平均 出版前一年[2022版]

第一作者:
第一作者机构: [1]Shanghai Jiao Tong Univ, Sch Med, Ruijin Hosp, Dept Ultrasound, Shanghai, Peoples R China [2]Shanghai Jiao Tong Univ, Coll Hlth Sci & Technol, Sch Med, Shanghai, Peoples R China
通讯作者:
通讯机构: [1]Shanghai Jiao Tong Univ, Sch Med, Ruijin Hosp, Dept Ultrasound, Shanghai, Peoples R China [2]Shanghai Jiao Tong Univ, Coll Hlth Sci & Technol, Sch Med, Shanghai, Peoples R China
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:82494 今日访问量:0 总访问量:681 更新日期:2025-01-01 建议使用谷歌、火狐浏览器 常见问题

版权所有©2020 云南省第一人民医院 技术支持:重庆聚合科技有限公司 地址:云南省昆明市西山区金碧路157号