Evaluating ChatGPT-5 for Detecting Potential Drug-Drug Interactions in Intensive Care: A Comparative Analysis with a Clinical Decision Support System Background: Polypharmacy is a frequent challenge in intensive care units (ICUs), where critically ill patients are exposed to multiple concurrent medications. This situation significantly increases the risk of potential drug-drug interactions (pDDIs), which may contribute to adverse drug events, prolonged ICU stays, and higher morbidity and mortality rates. Ensuring timely and accurate detection of pDDIs is therefore a cornerstone of patient safety in critical care settings. Traditional rule-based clinical decision support systems (CDSSs), such as the UpToDate Drug Interaction Checker, provide standardized alerts but may have limitations in contextual interpretation and adaptability. Recently, large language models (LLMs), such as ChatGPT-4.0, have emerged as advanced tools with natural language processing capabilities, potentially offering a novel approach to medication safety. Objective: This study aims to compare the performance of ChatGPT-4.0 with the UpToDate Drug Interaction Checker in identifying, classifying, and interpreting potential drug-drug interactions within real ICU patient medication orders. Methods: A retrospective dataset of ICU patient orders will be systematically analyzed using both ChatGPT-4.0 and the UpToDate Drug Interaction Checker. Each potential interaction will be assessed for sensitivity, specificity, accuracy, and clinical relevance. Discrepancies between the two systems will be documented and evaluated by independent critical care experts. Statistical analysis will be performed to compare detection rates and the qualitative depth of interaction explanations provided by each tool. Expected Outcomes: The study is expected to determine whether ChatGPT-4.0, as an AI-based system, can enhance the detection of clinically meaningful drug-drug interactions compared to traditional CDSS. The results may inform future integration of generative AI into ICU clinical workflows and contribute to safer pharmacotherapy practices in critical care. Conclusion: By directly comparing a state-of-the-art LLM with a widely used rule-based system, this study seeks to highlight the strengths, weaknesses, and potential clinical implications of generative AI in the domain of drug safety.
Age range
18 Years
Sex
ALL
See this in plain English?
AI-rewrites the medical criteria so a patient or caregiver can understand them. Always confirm with the trial site.
Bring these to your next appointment. They're a starting point for a shared conversation — not a sign you qualify or a recommendation to enrol.
Generated to help you prepare — always confirm anything about your own eligibility and care with the study team and your doctor.
The trial coordinator is the person who runs the study day to day. These cover the practical side — logistics, costs, and what taking part would actually mean for your life. The study team confirms whether you meet the criteria; these are questions to ask, not a sign you qualify.
A starting point for the conversation — always confirm anything about your own eligibility, costs, and care with the study team and your doctor.
Sensitivity of pDDI Detection
Timeframe: September 1 2025 to october 1 2025
Accuracy of Drug-Drug Interaction Detection of chatgpt
Timeframe: from seprember 1 2025 to october 1 2025