Evaluation of AI Large Models for Diagnosis and Treatment in Real-World Cases: Multicenter Retrospective Study

NCT07378358 · Status: RECRUITING · Type: OBSERVATIONAL · Enrollment: 800

Last updated 2026-01-30

No results posted yet for this study

Summary

This multicenter retrospective study aims to evaluate the diagnostic and therapeutic performance of three large language models-ChatGPT, Gemini and Deepseek-using 800 archived inpatient medical records from urology departments across four tertiary hospitals. The study will focus on the accuracy and applicability of these models in disease recognition, preliminary diagnosis and treatment recommendation generation, in order to explore their potential value and limitations in supporting clinical decision-making in real-world settings.

Conditions

Urologic Diseases

Interventions

OTHER

Large Language Model Assessment (ChatGPT, Gemini, DeepSeek)

De-identified inpatient medical records were retrospectively collected from the urology departments of four tertiary hospitals (200 cases per site, 800 in total). Each case included standardized clinical information such as demographics, chief complaint, history of present illness, past medical history, physical examination, laboratory and imaging findings, discharge diagnosis and treatment plan. To simulate the role of an AI system in a "first-visit physician" scenario, all diagnostic conclusions, differential diagnoses and treatment plans were removed before being input into the models. Three large language models (ChatGPT, Gemini and DeepSeek) were prompted with a standardized instruction: "Based on the above clinical information, provide your preliminary diagnosis, differential diagnoses and treatment recommendations." Each model generated outputs including (i) primary and secondary diagnoses, (ii) differential diagnosis lists with reasoning and (iii) preliminary treatment suggesti

Evaluation of AI Large Models for Diagnosis and Treatment in Real-World Cases: Multicenter Retrospective Study

Summary

Conditions

Interventions

Sponsors & Collaborators

Eligibility

Timeline & Regulatory

Countries

Study Locations

More Related Trials

Summary

Conditions

Interventions

Sponsors & Collaborators

Eligibility

Timeline & Regulatory

Countries

Study Locations

Related Clinical Trials

Improving Patient Understanding of Their Prostate Cancer Diagnosis Using AI

Tracking AI/LLM Literacy and Knowledge in Urology Outpatients (TALK-U)

Multi-Disciplinary Treatment on the Anthropomorphism of Large Language Models

Physician Reasoning on Diagnostic Cases With Large Language Models

Multi-agent LLMs for Decision Support in Cervical Cancer During Pregnancy

More Related Trials