Mitigating Automation Bias in Physician-LLM Diagnostic Reasoning Using Behavioral Nudges
NCT07328815 · Status: RECRUITING · Phase: NA · Type: INTERVENTIONAL · Enrollment: 50
Last updated 2026-03-31
Summary
The goal of this randomized controlled trial is to evaluate whether behavioral nudges can reduce automation bias, the uncritical acceptance of automated output, in physicians using large language models (LLM) like ChatGPT-5.1 for clinical decision-making.
The main question it aims to answer is: Does a dual-mechanism behavioral nudge intervention (baseline accuracy anchoring plus case-specific color-coded confidence signals) reduce physicians' uncritical acceptance of incorrect LLM recommendations?
Researchers will compare physicians who receive LLM recommendations along with a behavioral nudge to those who receive LLM recommendations without the nudge to assess if the nudge reduces automation bias.
Participants will:
* Evaluate six clinical vignettes accompanied by LLM-generated recommendations (half containing deliberate, clinically significant errors).
* Control group: Be able to view LLM recommendations in standard format without the nudge.
* Treatment group: Be able to view ChatGPT's diagnostic accuracy on standard medical datasets as an initial anchor, then receive color-coded confidence signals alongside each recommendation (e.g., red for low confidence).
* Have their responses evaluated by blinded reviewers using an expert-developed assessment rubric to detect uncritical acceptance of erroneous information.
Conditions
- Diagnosis
Interventions
- OTHER
-
Behavioral Nudge Intervention
Participants in the treatment group will receive a behavioral nudge intervention embedded in the LLM recommendations interface that presents two synchronized cognitive cues when the LLM panel is expanded: (1) an anchoring cue displaying ChatGPT's baseline diagnostic accuracy on standard medical datasets at the top of the panel to set realistic expectations before viewing the specific recommendation, and (2) a selective attention cue located immediately below, which shows the LLM recommendation alongside a case-specific and color-coded confidence signal. This signal is categorized as red when the mean ensemble confidence falls below the established baseline accuracy, flagging high-uncertainty cases that demand critical evaluation; orange when confidence meets or exceeds the baseline but remains below 100%, intended to prevent complacency and maintain active clinical scrutiny; and green for a 100% ensemble consensus, though standard cautionary warnings still apply to guard against.
Sponsors & Collaborators
-
Lahore University of Management Sciences
lead OTHER
Principal Investigators
-
Ihsan Ayyub Qazi, PhD · Lahore University of Management Sciences (LUMS)
-
Muhammad Hamad Alizai, PhD · Lahore University of Management Sciences (LUMS)
-
Muhammad Asadullah Khawaja, MBBS · King Edward Medical University
-
Ali Zafar Sheikh, MBBS · Lahore General Hospital
-
Muhammad Junaid Akhtar, MBBS · Children's Hospital, Lahore
Study Design
- Allocation
- RANDOMIZED
- Purpose
- DIAGNOSTIC
- Masking
- SINGLE
- Model
- PARALLEL
Eligibility
- Sex
- ALL
- Healthy Volunteers
- Yes
Timeline & Regulatory
- Start
- 2026-01-17
- Primary Completion
- 2026-07-31
- Completion
- 2026-08-31
Countries
- Pakistan
Study Locations
More Related Trials
-
Comparing Physician and Artificial Intelligence Chatbot Responses to Frequently Asked Questions From Osteoarthritis Patients
NCT07202286 ·Status: COMPLETED
-
Physician Response Evaluation With Contextual Insights vs. Standard Engines - Artificial Intelligence RAG vs LLM Clinical Decision Support
NCT07037940 ·Status: COMPLETED ·Phase: NA
-
The Diagnostic and Triage Capacity of Laypeople-large Language Model Collaboration in China
NCT07250516 ·Status: COMPLETED ·Phase: NA
-
AI-Assisted Skin Assessment for Pressure Injury Prevention in Critical Care Nurses
NCT07318571 ·Status: RECRUITING ·Phase: NA
-
Effect of Perception-based Interventions on Public Acceptance of Using Large Language Models in Medicine
NCT07304908 ·Status: ACTIVE_NOT_RECRUITING ·Phase: NA
-
Evaluation of AI-Generated Clinical Advice by Physicians
NCT06980467 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Physician Diagnosis of Neurologic Cases With Large Language Models
NCT07344792 ·Status: COMPLETED ·Phase: NA
-
AI-Assisted Pathologist Performance Improvement: A Multicenter, Prospective, Randomized Controlled Trial
NCT07291362 ·Status: ENROLLING_BY_INVITATION ·Phase: NA
-
Conversations in Health Literacy Using AI Technology for Osteoarthritis Patients
NCT06778486 ·Status: ACTIVE_NOT_RECRUITING ·Phase: NA
-
Large Language Models to Aid Gynecological Oncology Treatment
NCT06865534 ·Status: RECRUITING ·Phase: NA
-
LLM-Generated Coaching Prompts
NCT06880315 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Attitudes and Perceptions of Corresponding Authors From Top International Medical Journals Regarding the Use of Artificial Intelligence in the Scientific Process
NCT07326501 ·Status: ENROLLING_BY_INVITATION
-
Evaluating AI-Generated Plain Language Summaries on Patient Comprehension of Ophthalmology Notes Among English-Speaking Patients
NCT06859216 ·Status: RECRUITING ·Phase: NA
-
AI-LLM Communication Aid in Prostate Cancer Care (AI-CAP)
NCT07082049 ·Status: ENROLLING_BY_INVITATION ·Phase: NA
-
Effect of Large Language Model in Assisting Discharge Summary Notes Writing for Hospitalized Patients
NCT06263855 ·Status: WITHDRAWN ·Phase: NA
-
Large Language Models To Improve the Quality of Care of Cardiology Patients
NCT06935253 ·Status: RECRUITING ·Phase: NA
-
Multi-Disciplinary Treatment on the Anthropomorphism of Large Language Models
NCT06627985 ·Status: NOT_YET_RECRUITING
-
NLP-Based Feedback to Improve Risk Comms and Informed Shared Decision Making
NCT05923684 ·Status: RECRUITING ·Phase: NA
-
Diagnostic Reasoning With Customized GPT-4 Model
NCT06911645 ·Status: COMPLETED ·Phase: NA
-
Enhancing Medical Researchers' Self-learning With an Intelligent Language Model
NCT06015178 ·Status: UNKNOWN ·Phase: NA
-
Effectiveness of an Interactive Consulting System to Enhance Patients' Decision-making: A Randomized Controlled Trial
NCT04262596 ·Status: COMPLETED ·Phase: NA
-
A Pilot Randomized Controlled Trial of Chatbot Use in Older Adults for Lifestyle Change
NCT06901037 ·Status: RECRUITING ·Phase: NA
-
Large Linguistic Model for Clinical Reaoning of Physical Therapy Students
NCT06809634 ·Status: RECRUITING ·Phase: PHASE2
-
AI-Driven Smart Learning Platform for University Students
NCT06848803 ·Status: COMPLETED ·Phase: NA
-
Evaluate the Performance of Large Language Models in Ophthalmologic Patient Consultation
NCT06824389 ·Status: COMPLETED ·Phase: NA