Hiring in life sciences? Share your open positions with our professional community. Read more Close

Advertisement

Two-Stage Interrupted Time Series Analysis with Machine Learning: Evaluating the Health Effects of the 2018 Wildfire Smoke Event in San Francisco County as a Case Study.

Created on 15 Jul 2025

Authors

Arnab K Dey, Yiqun Ma, Gabriel Carrasco-Escobar, Changwoo Han, François Rerolle, Tarik Benmarhnia

Published in

American journal of epidemiology. Jul 11, 2025. Epub Jul 11, 2025.

Abstract

Randomized controlled trials (RCTs) are considered a key identification strategy for establishing causal relationships between exposures and outcomes. When evaluating the health impacts of extreme weather events, however, RCTs are generally infeasible due to ethical issues, costs, and the lack of a suitable control group. Quasi-experimental designs capitalizing on the timing of natural experiments, such as Interrupted Time Series (ITS), offer a valuable alternative to estimate causal effects when control groups are not available. This paper explores the application of a two-stage ITS framework that compares traditional autoregressive integrated moving average (ARIMA) models and two machine learning algorithms: Neural Network Autoregressive (NNETAR) and Prophet-Extreme Gradient Boosting (XGBoost). As a case study, we assess the impacts of the 2018 wildfire smoke event on respiratory hospitalizations in San Francisco County, California. We split the data into pre- and post-event periods to train and evaluate the models, perform cross-validation for hyperparameter tuning, and predict hospitalizations under the counterfactual scenario. Data and R code are provided for reproducibility. In the case study, the Prophet-XGBoost shows the best model performance and was used to generate the counterfactual trends. We estimate that the 2018 smoke event resulted in a total of 92 (95% empirical confidence interval: 24, 125) excess respiratory hospitalizations (12.5% of the observed hospitalization count during the event period). Our proposed approach offers a powerful tool for assessing the effects of extreme weather events and can be broadly applied to other epidemiological contexts, such as public health policy evaluation.

PMID:
40663094
Bibliographic data and abstract were imported from PubMed on 15 Jul 2025.

Read full publication at:
Please sign in to see all details.

Advertisement

Stats

  • Community rating n/a 0 votes
  • Reviewers' rating n/a 0 votes
  • Your rating

1-terrible, 9-excellent. How would you rate this publication? Sign in in to submit your rating.

  • Recommendations n/a n/a positive of 0 vote(s)
  • Views 43
  • Comments 0

Recommended by

  • No recommendations yet.

Post a comment

You need to be signed in to post comments. You can sign in here.

Comments

There are no comments yet.

Advertisement