Hiring in life sciences? Share your open positions with our professional community. Read more Close

Advertisement

Development and external validation of a machine learning model for predicting drug-induced immune thrombocytopenia in a real-world hospital cohort.

Created on 16 Jul 2025

Authors

Hoang Van Dung, Vu Manh Tan, Nguyen Thi Dieu, Pham Van Linh, Nguyen Van Khai, Tran Thi Ngan, Nguyen Thi Thu Phuong

Published in

BMC medical informatics and decision making. Volume 25. Issue 1. Pages 265. Jul 15, 2025. Epub Jul 15, 2025.

Abstract

Drug-induced immune thrombocytopenia (DITP) is a rare but potentially life-threatening adverse drug reaction, often underrecognized due to its nonspecific presentation and the lack of real-time diagnostic tools. Early identification of at-risk patients is critical to improving medication safety and preventing severe complications.
To develop and externally validate a machine learning model for predicting the risk of DITP using routinely collected hospital data, and to optimize its clinical applicability through threshold adjustment.
We conducted a retrospective cohort study using electronic medical records from Hai Phong International Hospital (2018-2024) for model development and internal validation. An independent cohort from Hai Phong International Hospital - Vinh Bao (2024) served as external validation. Eligible patients received at least one drug previously implicated in DITP and had serial platelet counts. A Light Gradient Boosting Machine (LightGBM) model was trained on demographic, clinical, laboratory, and pharmacological features. Model performance was assessed using area under the ROC curve (AUC), accuracy, recall, and F1-score. Shapley Additive explanations (SHAP) were used to interpret feature contributions. Threshold tuning and decision curve analysis (DCA) supported clinical applicability.
Among 17,546 patients in the training cohort and 1,403 in the external cohort, DITP occurred in 432 (2.46%) and 70 (4.99%) patients, respectively. In internal validation, LightGBM achieved an AUC of 0.860, recall of 0.392, and F1-score of 0.310. External validation confirmed model robustness with an AUC of 0.813 and an F1-score of 0.341 at the optimized threshold (0.09). SHAP analysis identified AST, baseline platelet count, and renal function as key contributors. DCA and clinical impact curves demonstrated potential benefit in supporting real-time risk stratification. Clopidogrel and vancomycin were frequently associated with suspected DITP cases.
This externally validated machine learning model enables early identification of hospitalized patients at risk of DITP using data available in routine care. Its integration into electronic medical records may support clinical decision-making, reduce diagnostic delays, and improve pharmacovigilance practices in hospital settings.

PMID:
40665302
Bibliographic data and abstract were imported from PubMed on 16 Jul 2025.

Read full publication at:
Please sign in to see all details.

Advertisement

Stats

  • Community rating n/a 0 votes
  • Reviewers' rating n/a 0 votes
  • Your rating

1-terrible, 9-excellent. How would you rate this publication? Sign in in to submit your rating.

  • Recommendations n/a n/a positive of 0 vote(s)
  • Views 45
  • Comments 0

Recommended by

  • No recommendations yet.

Post a comment

You need to be signed in to post comments. You can sign in here.

Comments

There are no comments yet.

Advertisement