,

Clinical Decision Support: Two Faces of AI from Sepsis to Early Warning

AI's most critical promise in the hospital is catching deterioration early. In sepsis, a few hours can be the difference between life and death. But the same technology, when poorly deployed, can produce alarm fatigue and distrust. This article places two opposing examples — a success and a failure — side by side for an evidence-based view of clinical decision support.

By Cem Akaltun, MD · · ~9 min read Clinical AI & LLMs

A clinical decision support system (CDSS) is software that assists physician decision-making with data and alerts. It spans a wide spectrum, from a simple drug-interaction warning to a complex machine learning model that continuously scans the electronic health record (EHR) and predicts that a patient is deteriorating. In the era of artificial intelligence (AI), the most-discussed application of these systems is sepsis early warning — and this is where AI in medicine holds both its greatest promise and its most instructive failure at the same time.

Why Sepsis?

Sepsis — a dysregulated host response to infection that leads to organ injury and can be rapidly fatal — is hard to diagnose because its early signs (fever, increased heart rate, low blood pressure) are subtle and nonspecific. The cornerstones of treatment — timely appropriate antibiotics and fluids — reduce mortality the earlier they are delivered. A system that warns "hours in advance" therefore, in theory, saves lives. The problem is translating theory into real patient outcomes.

A Success Story: TREWS and a Real Drop in Mortality

TREWS (Targeted Real-time Early Warning System), developed through a collaboration between Johns Hopkins and Bayesian Health, produced some of the strongest prospective evidence in this field. The multicenter study published in Nature Medicine in 2022 evaluated hundreds of thousands of patients in real time across five hospitals.

The critical finding was this: in sepsis patients whose alert was confirmed by a physician within 3 hours, in-hospital mortality fell by approximately 3.3 percentage points in absolute terms and about 18.7% in relative terms compared with those whose alert was not confirmed in time; organ failure and length of stay also decreased. Two things make this result important. First, this was a prospective, large-scale evaluation. Second, the benefit was conditional on the clinician responding to the system. In other words, the value came not from the algorithm's "correct prediction" alone, but from the correct prediction being converted to the correct clinical action at the right time.

Algorithm + workflow = outcome

TREWS's success had as much to do with how it was deployed as with the accuracy of the model: an alert clinicians trusted, that did not break their workflow, and that pointed toward action. A model with the same accuracy could, with a bad interface and excessive alarms, do the opposite.

A Cautionary Tale: External Validation of the Epic Sepsis Model

The other side of the coin is the independent evaluation of the proprietary Epic Sepsis Model (ESM), used in hundreds of U.S. hospitals. University of Michigan researchers tested the model on their own patient data and published the results in JAMA Internal Medicine in 2021. The findings were strikingly unfavorable: the model performed far worse than the manufacturer had claimed.

On external validation, the ESM had a sensitivity of only 33%, a positive predictive value of 12%, and an area under the curve (AUC) of 0.63 — that is, it missed two-thirds of sepsis cases while the vast majority of its alarms were false positives. Clinically, this means harm in two directions: an unreliable safety net and alarm fatigue that desensitizes clinicians to warnings. The study became a watershed in showing that "manufacturer-reported performance" can never replace independent, local external validation.

DimensionTREWS (Nature Medicine, 2022)Epic Sepsis Model (JAMA Intern Med, 2021)
Type of evaluationProspective, multicenter, outcome-focusedRetrospective external validation
Key finding~18% relative mortality reduction with timely confirmationSensitivity 33%, AUC 0.63
Main lessonAlgorithm + workflow + clinician responseIndependent external validation is essential; alarm fatigue is a real risk

Common Principles from the Two Cases

The two cases yield complementary lessons for clinical decision support systems:

  • Local external validation is indispensable. A model performing well at one hospital does not mean it will perform well at yours; the ESM case shows this in painful terms.
  • Accuracy is necessary but not sufficient. TREWS's benefit depended on correct predictions translating into clinician action. Without a measured outcome metric (mortality), claims of a "good model" remain incomplete.
  • Alarm design is a patient-safety issue. Too-frequent and false-positive alerts lead to the system being ignored entirely.
  • Calibration, less discussed than discrimination, is critical. For a model's reported "probability" to guide clinical action, it must reflect reality.

Beyond Sepsis

The same principles apply to many CDSS applications: acute kidney injury, clinical deterioration (rapid response team activation), fall risk, post-discharge readmission, and medication safety alerts. The role of AI here is to direct the clinician's attention to the right patient at the right time — while leaving the final decision to the clinician.

Conclusion

AI has two faces in clinical decision support: TREWS, which, deployed correctly, measurably reduced mortality; and the Epic Sepsis Model, which, taken on assumption, eroded trust. The difference is not the technology itself but the rigor of independent validation, workflow integration, alarm design, and clinician trust. A CDSS's quality should be measured not by the accuracy of its predictions but by whether it genuinely improves patient outcomes.

References

  1. Adams R, et al. Prospective, multi-site study of patient outcomes after implementation of the TREWS machine learning-based early warning system for sepsis. Nature Medicine 2022. nature.com
  2. Henry KE, et al. Factors driving provider adoption of the TREWS early warning system and its effects on sepsis treatment timing. Nature Medicine 2022. nature.com
  3. Wong A, et al. External Validation of a Widely Implemented Proprietary Sepsis Prediction Model in Hospitalized Patients. JAMA Internal Medicine 2021. jamanetwork.com
  4. Habib AR, et al. The Epic Sepsis Model Falls Short — The Importance of External Validation (editorial). JAMA Internal Medicine 2021. jamanetwork.com
  5. External validation of the Epic sepsis predictive model in 2 county emergency departments. PMC11560849. ncbi.nlm.nih.gov
Disclaimer: This content is for educational and informational purposes only and does not substitute for diagnosis or treatment decisions. Clinical decision support systems are designed to support the final decision of the physician; alerts must always be interpreted in clinical context.