RecruitingNot Applicable

Mitigating Automation Bias in Physician-LLM Diagnostic Reasoning Using Behavioral Nudges

Pakistan50 participantsStarted 2026-01-17

Plain-language summary

The goal of this randomized controlled trial is to evaluate whether behavioral nudges can reduce automation bias, the uncritical acceptance of automated output, in physicians using large language models (LLM) like ChatGPT-5.1 for clinical decision-making. The main question it aims to answer is: Does a dual-mechanism behavioral nudge intervention (baseline accuracy anchoring plus case-specific color-coded confidence signals) reduce physicians' uncritical acceptance of incorrect LLM recommendations? Researchers will compare physicians who receive LLM recommendations along with a behavioral nudge to those who receive LLM recommendations without the nudge to assess if the nudge reduces automation bias. Participants will: * Evaluate six clinical vignettes accompanied by LLM-generated recommendations (half containing deliberate, clinically significant errors). * Control group: Be able to view LLM recommendations in standard format without the nudge. * Treatment group: Be able to view ChatGPT's diagnostic accuracy on standard medical datasets as an initial anchor, then receive color-coded confidence signals alongside each recommendation (e.g., red for low confidence). * Have their responses evaluated by blinded reviewers using an expert-developed assessment rubric to detect uncritical acceptance of erroneous information.

Who can participate

SexALL

See this in plain English?

AI-rewrites the medical criteria so a patient or caregiver can understand them. Always confirm with the trial site.

Inclusion Criteria: * Full or Provisionally Registered Medical Practitioners with the Pakistan Medical and Dental Council (PMDC). * Completed Bachelor of Medicine, Bachelor of Surgery (MBBS) Exam. The equivalent degree of MBBS in US and Canada is the Doctor of Medicine (MD). * Participants must have completed a structured training program on the use of ChatGPT (or a comparable large language model), totaling at least 10 hours of instruction. The program must include hands-on practice related to LLM's key aspects, specifically prompt engineering and content evaluation. Exclusion Criteria: * Any other Registered Medical Practitioners (Full or Provisional) with PMDC (e.g., professionals with Bachelor of Dental Surgery or BDS).

What they're measuring

Diagnostic reasoning accuracy score

Timeframe: Assessed at a single time point for each case, during the scheduled diagnostic reasoning evaluation session, which takes place between 0-5 days after participant enrollment.

Trial details

NCT IDNCT07328815

SponsorLahore University of Management Sciences

Sponsor typeOTHER

Study typeINTERVENTIONAL

Primary completion2026-07

Contact for this trial

Ihsan Ayyub Qazi, PhD

+923233333766 ihsan.qazi@lums.edu.pk

View on ClinicalTrials.gov More Diagnosis trials

Plain-language summary

Who can participate

SexALL

See this in plain English?

AI-rewrites the medical criteria so a patient or caregiver can understand them. Always confirm with the trial site.