Mobile Device Application Helps Predict Postoperative Complications

Dr. Maria Altieri

As surgeons often make high-stakes, time-sensitive decisions, there is a growing interest in the use of clinically actionable analytics to augment surgical decision-making.

Among the many decision-support tools available, the POTTER (Predictive OpTimal Trees in Emergency Surgery Risk) artificial intelligence-based (AI) calculator has yielded promising results that harbor relatively unique potential for clinical application. POTTER—leveraging a flexible decision-tree-based machine-learning approach—has predictive accuracy that is on par with, or greater than, many other risk calculators. Its algorithm was validated using the ACS National Surgical Quality Improvement Program (NSQIP^®) data from 2014. The Surgical Risk Preoperative Assessment System (SURPAS) model (developed and validated via ACS NSQIP data) and several algorithms reported by Chiew and colleagues have state-of-the-art accuracy for predicting mortality and postoperative intensive care unit (ICU) admission for broad, heterogenous surgical patient populations.^1,2

Figure 1. POTTER Calculator

Although POTTER use has been reported for specific patient populations such as emergency surgical patients and elderly patients, the evidence suggests that this tool is broadly generalizable.^3-5 Perhaps most importantly, POTTER is available in a user-friendly mobile application, making it ready for deployment in clinical settings to assess operative risk or to aid in counseling patients.

POTTER has some opportunities for improvement. The application requires manual data entry, which could be a small hindrance for some users, although it requires manual entry of less than 10 variables (see Figure 1).⁶

In contradistinction, AI predictive analytic platforms using automated electronic health record (EHR) data inputs obviate manual data entry requirements.⁷ Historically, there has been a lack of high-level evidence from prospective studies supporting automated EHR data entry, but recent evidence suggests that this approach is effective and can be deployed as a mobile device application.⁸

In addition, POTTER uses preoperative data and does not incorporate intraoperative data that are potentially informative in predicting postoperative complications. POTTER algorithms were trained primarily on outcomes in the US 2007–2013, which may not generalize well to other countries and may not accurately represent risk in 2023. Like most similar risk calculators, POTTER requires prospective validation and assessments of its effects on decision-making and patient outcomes.

Few studies have robustly tested the effects of AI-enabled decision support on clinical decision-making. The Hypotension Prediction (HYPE) trial⁹ is a notable exception. In a randomized study, the authors deployed an AI algorithm that predicted impending intraoperative hypotension. The HYPE trial showed that anesthesiologists using the algorithm acted earlier, differently, and more frequently, and their patients experienced fewer hypotensive events and less time-weighted hypotension.

Although unproven, it remains plausible that on a larger scale, the HYPE trial algorithm could decrease complications related to intraoperative hypotension (e.g., acute kidney injury), and thus improve patient outcomes.

Likewise, it remains to be determined whether the POTTER app and similar surgical AI decision support systems improve patient outcomes. Lupei, Sun, and colleagues^10,11 previously have reported an AI model degradation from internal or external validation to real-time validation, as well as the impact of AI-enabled tools that are integrated into real-time clinical workflows. Although AI offers greater potential to accurately represent complex, nonlinear pathophysiology compared with basic statistical modeling, recent studies have demonstrated no great superiority of deep learning over regression in classifying illness severity of individual patients using readily available clinical data.^12,13

For cases in which AI offers no predictive performance advantage, it may be preferable to use regression-based algorithms that are more easily interpreted by clinicians and have a longer, stronger record of success in clinical settings.

Our personal, anecdotal experience with POTTER is that it provides an accurate, data-driven prediction of postoperative complications, which can be useful adjuncts to shared decision-making processes and prognostic conversations with patients and caregivers, especially when the prognosis is poor.

The POTTER app is useful as it helps predict postoperative morbidity and mortality following emergency surgery compared to similar elective surgery. After inputting information regarding the patient, the user can select the outcome for which a risk estimate is desired. A series of questions follows, and each new question is based on the answer to the previous question as it forms a decision tree, which finally calculates the risk based on the previous responses. The final result predicts the risk of death for patients undergoing emergency general surgery procedures and 18 postoperative complications.

By augmenting, rather than replacing, the knowledge, intuition, and skills that surgeons offer their patients, clinically actionable predictive analytics can anchor decision-making and prognostication with objectivity and reduce the variability that is inherent to the provider-specific hypothetical, deductive reasoning that is the hallmark of current surgical decision-making practices.

Disclaimers

The authors have no conflicts of interest related to the POTTER application.

The thoughts and opinions expressed in this viewpoint article are solely those of the authors and do not necessarily reflect those of the ACS.

Dr. Maria Altieri is the section chief of gastrointestinal surgery at the Hospital of the University of Pennsylvania (Penn) in Philadelphia and assistant professor of surgery at the Perelman School of Medicine at Penn.

References

Chiew CJ, Liu N, Wong TH, Sim YE, Abdullah HR. Utilizing machine learning methods for preoperative prediction of postsurgical mortality and intensive care unit admission. Ann Surg. 2020;272(6):1133-1139.
Rozeboom PD, Henderson WG, Dyas AR, et al. Development and validation of a multivariable prediction model for postoperative intensive care unit stay in a broad surgical population. JAMA Surg. 2022;157(4):344-352.
Maurer LR, Chetlur P, Zhuo D, et al. Validation of the AI-based Predictive OpTimal Trees in Emergency Surgery Risk (POTTER) calculator in patients 65 years and older. Ann Surg. 2023;277(1):8-15.
El Hechi MW, Maurer LR, Levine J, et al. Validation of the artificial intelligence-based Predictive Optimal Trees in Emergency Surgery Risk (POTTER) calculator in emergency general surgery and emergency laparotomy patients. J Am Coll Surg. 2021;232(6):912-919.
Gebran A, Vapsi A, Maurer LR, et al. POTTER-ICU: An artificial intelligence smartphone-accessible tool to predict the need for intensive care after emergency surgery. Surgery. 2022;172(1):470-475.
Leeds IL, Rosenblum AJ, Wise PE, et al. Eye of the beholder: Risk calculators and barriers to adoption in surgical trainees. Surgery. 2018;164(5):1117-1123.
Bihorac A, Ozrazgat-Baslanti T, Ebadi A, et al. MySurgeryRisk: Development and validation of a machine-learning risk algorithm for major complications and death after surgery. Ann Surg. 2018;269(4):652-662.
Ren Y, Loftus TJ, Datta S, et al. Performance of a machine learning algorithm using electronic health record data to predict postoperative complications and report on a mobile platform. JAMA Netw Open. 2022;5(5):e2211973.
Wijnberge M, Geerts BF, Hol L, et al. Effect of a machine learning-derived early warning system for intraoperative hypotension vs. standard care on depth and duration of intraoperative hypotension during elective noncardiac surgery: The HYPE Randomized Clinical Trial. JAMA. 2020;323(11):1052-1060.
Lupei MI, Li D, Ingraham NE, et al. A 12-hospital prospective evaluation of a clinical decision support prognostic algorithm based on logistic regression as a form of machine learning to facilitate decision making for patients with suspected COVID-19. Plos One. 2022;17(1): e0262193.
Sun J, Peng L, Li TH, et al. Performance of a chest radiograph AI diagnostic tool for COVID-19: A prospective observational study. Radiol-Artif Intell. 2022;4(4):1-10.
Norgeot B, Quer G, Beaulieu-Jones BK, et al. Minimum information about clinical artificial intelligence modeling: The MI-CLAIM checklist. Nat Med. 2020;26(9):1320-1324.
Khera R, Haimovich J, Hurley NC, et al. Use of machine learning models to predict death after acute myocardial infarction. JAMA Cardiol. 2021;6(6):633-641.