Forecasting occupational accidents in Turkey using multivariate ARMAX and NLARX models


Creative Commons License

Kaplanvural S., Tosyalı E., EKMEKÇİ İ.

Scientific Reports, cilt.16, sa.1, 2026 (SCI-Expanded, Scopus) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 16 Sayı: 1
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1038/s41598-026-36210-0
  • Dergi Adı: Scientific Reports
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, BIOSIS, Chemical Abstracts Core, MEDLINE, Directory of Open Access Journals
  • Anahtar Kelimeler: ARMAX, NLARX, Occupational accidents, OHS modelling, Time series forecasting
  • Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
  • İstanbul Ticaret Üniversitesi Adresli: Evet

Özet

Occupational accidents remain a critical issue in Turkey, with significant social and economic consequences, and understanding accident trends is essential for developing effective prevention strategies. This study employed both linear AutoRegressive Moving Average with Exogenous Input (ARMAX) and Nonlinear AutoRegressive with Exogenous Input (NLARX) models to forecast future occupational accidents using four accident-related populations (,,,) derived from official insurance records. Due to the lack of consistently reported monthly data from the Social Security Institution (SSI), exogenous variables such as sectoral, economic, or demographic indicators were not incorporated, and the models were therefore identified based solely on the endogenous accident dynamics. The ARMAX identification process yielded a relatively large set of candidate parameters, which were subsequently evaluated using statistical significance criteria; only coefficients with p values below 0.05 and confidence intervals excluding zero were retained for interpretation. Model performance was evaluated using the normalized mean squared error (NMSE), which was computed separately for the training period, the test (out-of-sample forecasting) period, and the full dataset for each model. This multi-level evaluation enabled a consistent comparison of in-sample fitting accuracy, out-of-sample generalization capability, and overall predictive performance across ARMAX and NLARX models. The significance-based analysis revealed distinct linear dynamic structures across the output groups, with the populations characterized by a larger number of moderate-magnitude significant coefficients, whereas exhibited fewer but more dominant linear effects. The ARMAX model produced the lowest NMSE values across the training, test, and full datasets for most populations, demonstrating particularly strong and stable predictive accuracy for and. The NLARX model yielded the best performance for and showed comparable NMSE values to ARMAX for and, although it exhibited higher forecasting errors for, especially in the test period. Overall, the results indicated that while NLARX was capable of capturing nonlinear patterns in specific cases, the ARMAX framework provided a more robust, interpretable, and consistently generalizable representation of the dominant temporal dynamics governing occupational accident trends. These findings highlighted the potential of multivariate time series models to support evidence-based decision-making in occupational safety planning and policy development.