Evaluation of AI Large Models for Diagnosis and Treatment in Real-World Cases: Multicenter Retrospective Study
NCT07378358 · Status: RECRUITING · Type: OBSERVATIONAL · Enrollment: 800
Last updated 2026-01-30
Summary
This multicenter retrospective study aims to evaluate the diagnostic and therapeutic performance of three large language models-ChatGPT, Gemini and Deepseek-using 800 archived inpatient medical records from urology departments across four tertiary hospitals. The study will focus on the accuracy and applicability of these models in disease recognition, preliminary diagnosis and treatment recommendation generation, in order to explore their potential value and limitations in supporting clinical decision-making in real-world settings.
Conditions
- Urologic Diseases
Interventions
- OTHER
-
Large Language Model Assessment (ChatGPT, Gemini, DeepSeek)
De-identified inpatient medical records were retrospectively collected from the urology departments of four tertiary hospitals (200 cases per site, 800 in total). Each case included standardized clinical information such as demographics, chief complaint, history of present illness, past medical history, physical examination, laboratory and imaging findings, discharge diagnosis and treatment plan. To simulate the role of an AI system in a "first-visit physician" scenario, all diagnostic conclusions, differential diagnoses and treatment plans were removed before being input into the models. Three large language models (ChatGPT, Gemini and DeepSeek) were prompted with a standardized instruction: "Based on the above clinical information, provide your preliminary diagnosis, differential diagnoses and treatment recommendations." Each model generated outputs including (i) primary and secondary diagnoses, (ii) differential diagnosis lists with reasoning and (iii) preliminary treatment suggesti
Sponsors & Collaborators
-
First Affiliated Hospital of Fujian Medical University
lead OTHER
Eligibility
- Min Age
- 18 Years
- Sex
- ALL
- Healthy Volunteers
- No
Timeline & Regulatory
- Start
- 2026-01-01
- Primary Completion
- 2026-04-01
- Completion
- 2026-06-01
Countries
- China
Study Locations
More Related Trials
-
Reasoning Enrichment With Feedback From IA in NEphrology Trial
NCT07352475 ·Status: RECRUITING ·Phase: NA
-
Physician Response Evaluation With Contextual Insights vs. Standard Engines - Artificial Intelligence RAG vs LLM Clinical Decision Support
NCT07037940 ·Status: COMPLETED ·Phase: NA
-
The Diagnostic and Triage Capacity of Laypeople-large Language Model Collaboration in China
NCT07250516 ·Status: COMPLETED ·Phase: NA
-
Large Language Model-Generated Messages to Improve Guideline-Directed Medical Therapy in Heart Failure
NCT07337577 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Interpretation Performance in Chinese and Japanese Medical Consultation Scenarios
NCT06934031 ·Status: COMPLETED
-
AI-LLM Communication Aid in Prostate Cancer Care (AI-CAP)
NCT07082049 ·Status: ENROLLING_BY_INVITATION ·Phase: NA
-
Chest X-Ray Image Diagnosis and Report Generation Dedicated Model Based on Deepseek
NCT06874647 ·Status: COMPLETED
-
LLM-Guided Rehabilitation in Degenerative Knee Disease
NCT07267962 ·Status: ACTIVE_NOT_RECRUITING ·Phase: NA
-
Real-world Effectiveness Evaluation of Clinical Decision Support System Based on Artificial Intelligence (AI-CDSS)
NCT05065931 ·Status: COMPLETED
-
Performances of Large Language Models in Kidney Allograft Diagnostics
NCT07004660 ·Status: COMPLETED
-
The Application of Large Language Model in Emergency Chest Pain Triage
NCT06493175 ·Status: RECRUITING ·Phase: NA
-
Human Doctors or AI: Evaluating Patient Satisfaction in Urinary Stone Disease Consultations
NCT07111845 ·Status: COMPLETED ·Phase: NA
-
The Effects of a Large Language Model on Clinical Questioning Skills
NCT06229379 ·Status: COMPLETED ·Phase: NA
-
LLM-CoManage: Large Language Model-Enabled Co-Management of Hypertension, Diabetes, and Dyslipidemia
NCT07350486 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Evaluate the Performance of Large Language Models in Ophthalmologic Patient Consultation
NCT06824389 ·Status: COMPLETED ·Phase: NA
-
AI Models in Clinical Pathology Diagnosis: A Multicenter RCT
NCT07408167 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Clinical Application of Automated Interpretation System for Chest X-Ray Images Based on Multimodal Large Models
NCT07117266 ·Status: COMPLETED ·Phase: NA
-
Point-of-Care AI Assistance and Critical Care Outcomes: A Randomized Trial
NCT07293078 ·Status: NOT_YET_RECRUITING ·Phase: PHASE1/PHASE2
-
Physician Reasoning on Management Cases With Large Language Models
NCT06208423 ·Status: COMPLETED ·Phase: NA
-
The Influence of Patient Use of Artificial Intelligence on Doctor-Patient Interaction and Clinical Outcomes in Endometriosis Consultations
NCT07393568 ·Status: NOT_YET_RECRUITING
-
Generating Fast and Slow for Entree Level Medical Knowledge
NCT06247475 ·Status: NOT_YET_RECRUITING
-
Use of Artificial Intelligence by Urogynecologic Patients
NCT06481436 ·Status: ACTIVE_NOT_RECRUITING ·Phase: NA
-
AI-Assisted Pathologist Performance Improvement: A Multicenter, Prospective, Randomized Controlled Trial
NCT07291362 ·Status: ENROLLING_BY_INVITATION ·Phase: NA
-
The Impact of Chatbot-aid on Promoting Self-management of Men's Health in the Post COVID-19 Era
NCT05765331 ·Status: UNKNOWN ·Phase: NA
-
Evaluation of AI-Generated Clinical Advice by Physicians
NCT06980467 ·Status: NOT_YET_RECRUITING ·Phase: NA