Brief report: Lyme disease forecasting in the U.S. Department of Defense: summary of one- to three-month forecasts, January–October 2024

Image of Cover5. Lyme disease forecasting can provide vital information for anticipating the impact of its incidence on military health and readiness, as well as informing effective public health responses and mitigation efforts.

Lyme disease incidence has been increasing among U.S. Department of Defense (DOD) service members over the past 20 years, threatening the health and readiness of the force.1 Syndromic surveillance of tick bite visits can provide timely information that might predict changes in tick-borne disease incidence and geographic spread.2 Tick and tick-borne illness surveillance programs in the U.S. face many barriers, however, including inconsistent funding and limited capacities.3

Epidemic forecasting models can be used to enhance routine surveillance and inform public health policy.4 Previous research on Lyme disease forecasting has focused on general time series and machine-learning models; however, these models have demonstrated high percentage errors and limited predictive accuracy.5,6 Further research is needed to explore alternative modeling approaches that may improve forecasting performance.

Since 2019, the Armed Forces Health Surveillance Division, Integrated Biosurveillance (AFHSD-IB) Branch, within the Defense Health Agency’s Public Health Directorate, has been conducting respiratory disease forecasting using syndromic surveillance data.7 Since 2024, AFHSD-IB has integrated vector-borne disease forecasting to provide situational awareness and inform public health responses from DOD senior leaders. This report aims to predict the number of future Lyme disease and tick bite encounters among U.S Military Health System (MHS) beneficiaries using outpatient encounter data and multiple forecasting methods.

Methods

Tick bite encounters were used as a proxy for tick exposure, while Lyme disease encounters served as a proxy for Lyme disease diagnoses in the absence of trusted case reporting. Encounter definitions were developed using internal criteria.

A single instance of a Lyme disease encounter was defined using the International Classification of Diseases, 10th Revision (ICD-10), discharge diagnosis code ‘A69.2’ and chief complaint terms “Lyme disease,” “erythema migrans,” or “bulls-eye rash,” or their misspellings, in the chief complaint field; any mention of a history of Lyme disease was excluded. Similarly, tick bite encounters were defined as health records mentioning “tick bite” and associated misspellings (including “tic” or “tik” for tick and “bit” for bite) in the chief complaint field.

Monthly direct-sourced outpatient encounter data for each military treatment facility were obtained from the DOD’s Electronic Surveillance System for the Early Notification of Community-based Epidemics (ESSENCE). Data were then aggregated into 4 U.S. surveillance regions, selected based on their high volume of Lyme disease encounters in 2024: National Capital Region, New England, Tidewater, and West Point. Surveillance regions are collections of neighboring military installations, clinics, and hospitals used for disease surveillance and reporting. (Information about each surveillance region is shown in Supplementary Table 1.) Monthly encounter data from January 2021 through October 2024 were collected and used to generate 1- through 3-month forecasts for each outcome metric and surveillance region for the period January through October 2024. Models were trained each month, using data from January 2021 through the most recent month.

Several time series and machine-learning models were used for forecasting. The seasonal autoregressive integrated moving average (SARIMA) model is based on the widely used Box-Jenkins method for univariate time series forecasting.8 The error, trend, seasonal (ETS) model is another time series model that belongs to a special class of exponential smoothing models known as state-space models.8 The exponentially weighted moving average (EWMA) model is a highly sensitive model that effectively identifies subtle data pattern changes due to its weighting scheme, which is of particular importance when assessing rare outcomes.9 The vector autoregressive (VAR) model is a forecasting algorithm that can be used when 2 or more time series influence each other.10 The neural network (NNET) model is a machine-learning model that can adapt to changing inputs, generating the best possible results without needing to redesign the output criteria.11 Prophet is open-source forecasting model created by Facebook that allows users to easily customize and produce high quality forecasts.12 Finally, a baseline naïve model was created using data from the previous tick-borne season. An ensemble (ENS) model was also calculated as the average of all individual model forecasts.

A log-transformed weighted interval score (WIS) was used to measure accuracy of the model forecasts. WIS was previously established as a scoring method for respiratory disease forecasts,13 with lower values indicating more accurate forecasts. The median absolute percentage error (MAPE), another common metric for forecasting error, was also calculated. All analyses were performed in R software 4.4 (R Foundation for Statistical Computing, Vienna, Austria), including the fable14 and fabletools15 packages.

Results

Our analyses indicated that observed Lyme disease encounters increased from late winter to early spring, peaking during the summer months, except for West Point, which peaked in April. West Point also had the highest peak (34.5 per 100,000 MHS beneficiaries), followed by New England (31.4 in July), National Capital Region (13.9, June), and Tidewater (4.2, June) (Figure 1). Observed tick bite encounters increased from February until spring for each surveillance region. The National Capital Region had the highest peak (50.0 per 100,000 MHS beneficiaries in June), followed by West Point (39.5, April), New England (29.1, April), and Tidewater (18.9, May). Figure 1 also shows 1- through 3-month ahead horizon forecasts for the ENS model.

FIGURE 1. Ensemble Model Observed and Predicted Values, by Surveillance Region and Outcome Metric, 2024 This is a panel of eight line charts designed to evaluate the performance of an ensemble forecasting model. The charts compare the actual, observed monthly encounters for Lyme disease and tick bites against the model's predictions made one, two, and three months in advance, across four U.S. surveillance regions during 2024. The charts demonstrate a clear seasonal pattern for both Lyme disease and tick bites, with encounters peaking in spring and summer. The model's forecasts, particularly the 1-month-ahead predictions, successfully capture this seasonal trend, though accuracy diminishes as the forecast horizon extends further into the future.

For Lyme disease encounter forecasts, the NNET model had the lowest median log-WIS across the 2-month (0.6) and 3-month (0.8) ahead horizons, while the ENS model had the lowest median log-WIS for the 1-month ahead horizon (0.7) (Figure 2). The ETS model had the lowest MAPE for Lyme disease encounters for the 1-month ahead horizon (33%), while the NNET model had the lowest MAPE for the 2-month ahead horizon (37%), and the ENS model had the lowest MAPE for the 3-month ahead horizon (28%) ahead horizon (Supplementary Figure 2). During the months with the highest activity (March–October), MAPE for the ENS model improved to 29% and 27%, respectively, for the 1-month and 3-month ahead horizons, while the MAPE for the NNET model improved to 29% for the 2-month ahead horizon, indicating greater forecasting accuracy during high activity periods when compared to the total surveillance time (Supplementary Table 2).

FIGURE 2. Median Log-Weighted Interval Score by Forecasting Horizon, Outcome Metric and Model, 2024 This panel of eight box plots compares the forecast accuracy of eight different models for predicting Lyme disease and tick bite encounters. The metric used for comparison is the Median Log-Weighted Interval Score, where a lower score indicates a more accurate forecast. The results show that for predicting Lyme disease, the ensemble (ENS) and neural network (NNET) models are generally the most accurate. For predicting tick bites, the error, trend, seasonal (ETS) model consistently performs the best across one, two, and three-month forecast horizons.

For tick bite encounter forecasts, the ETS model had the lowest score (1.0) for all horizons, as well as for the 1- (0.8), 2- (1.0), and 3-month (1.2) ahead horizons, followed by the NNET model (1.2, all horizons; 1.2, 1-month; 1.1, 2-month; and 1.2, 3-month) (Figure 2). The ETS model had the lowest MAPE across all 1-month (25%), 2-month (23%), and 3-month (24%) ahead horizons (Supplementary Figure 2). During the months of highest activity, MAPE for the ETS model improved to 21%, 22%, and 24% respectively for the 1-month, 2-month, and 3-month ahead horizons (Supplementary Table 2).

Discussion

This study produced 2 major findings. First, among all the time series and machine-learning models mentioned in our analysis, 3 models—ENS, ETS, NNET—provided the most accurate forecasts of Lyme disease and tick bite activity in MHS beneficiaries, with increased accuracy during peak activity periods. This finding suggests that these 3 models would be valuable as early warning signals for public health surveillance and preparedness. It is important to note, however, that model accuracy decreased at longer horizons. Second, MAPE estimates were higher than those reported in previous studies using SARIMA models, although accuracy was improved by focusing on periods of higher activity. Individual surveillance regions aligned with the findings of the aforementioned study.6

Limitations of this study must be considered. First, data lags and inconsistent reporting of Lyme disease cases within the MHS prevented validation of encounter data against confirmed case data. Consequently, sensitivity analysis of encounter definition robustness could not be performed, and therefore, results may not be representative of overall disease incidence. The training data set was derived from a 3-year period of encounter data, but additional historical data may be needed to properly train some of the models. Due to limited technology capacity, particularly hardware, this study also did not utilize more computationally intensive machine-learning models, such as long short-term memory and random forest models, potentially limiting the accuracy of the forecasts. In addition, these models are susceptible to over-fitting when there is insufficient training data.

Despite these limitations, this study provided the first quantitative evidence of the use of outpatient encounter syndromic surveillance data for forecasting Lyme disease and tick bite encounters in the MHS population. Lyme disease forecasting can provide vital information for anticipating the impact on military health and readiness, as well as informing effective public health responses and mitigation efforts within the DOD. Further research is required to explore additional models, more robust training data, and other covariates, including incident Lyme disease cases and other key predictors, such as host species, geographic factors, climate, and weather.

SUPPLEMENTARY FIGURE 1. Observed and Predicted Values Not Shown in FIGURE 1, by Surveillance Region, Outcome Metric and Forecasting Model, 2024 This is a panel of eight line charts that provides a detailed comparison of multiple individual forecasting models against observed data for Lyme disease and tick bite encounters in 2024. The purpose is to visualize the performance of each model—such as ETS, NNET, and SARIMA—across four surveillance regions. The charts reveal that while most models capture the general seasonal peak in cases, there is significant variation in their accuracy, with some models overestimating and others underestimating the actual number of encounters at different points in the year.

SUPPLEMENTARY FIGURE 2. Median Absolute Percentage Error by Forecasting Horizon, Outcome Metric and Model, 2024 This panel of eight box plots compares the accuracy of eight different forecasting models using the Median Absolute Percentage Error (MAPE), where a lower value signifies a better forecast. The charts evaluate predictions for both Lyme disease and tick bite encounters at one, two, and three-month horizons. The results indicate that for Lyme disease, the ensemble (ENS) and neural network (NNET) models have the lowest error. For tick bites, the error, trend, seasonal (ETS) model is the most accurate. In contrast, the Naïve model consistently shows the highest percentage error, making it the least reliable predictor.

Authors’ Affiliation

Integrated Biosurveillance Branch, Armed Forces Health Surveillance Division, Public Health Directorate, Defense Health Agency, Silver Spring, MD

References

  1. Melanson VR, Bateman SL, Hering K, et al. A review of force health protection aspects of Lyme disease in the U.S. military. Mil Med. 2025;190(3-4):e561-e568. doi:10.1093/milmed/usae415 
  2. Marx GE, Spillane M, Beck A, et al. Emergency department visits for tick bites: United States, January 2017–December 2019. MMWR Morb Mortal Wkly Rep. 2021;70(17):612-616. doi:10.15585/mmwr.mm7017a2 
  3. Mader EM, Ganser C, Geiger A, et al. A survey of tick surveillance and control practices in the United States. J Med Entomol. 2021;58(4):1503-1512. doi:10.1093/jme/tjaa094 
  4. Chowell G, Skums P. Investigating and forecasting infectious disease dynamics using epidemiological and molecular surveillance data. Phys Life Rev. 2024;51:294-327. doi:10.1016/j.plrev.2024.10.011 
  5. Sun YQ, Z XY, Fan TC, et al. Projecting Lyme disease risk in the United States: a machine learning approach integrating environmental, socioeconomic and vector factors. One Health. 2025;21:101111. doi:10.1016/j.onehlt.2025.101111 
  6. Kapitány-Fövény M, Ferenci T, Sulyok Z, et al. Can Google Trends data improve forecasting of Lyme disease incidence? Zoonoses Public Health. 2019;66(1):101-107. doi:10.1111/zph.12539 
  7. Bova ML, McGee SA, Elliott KR, Ubiera JI. Predicting COVID-19 and respiratory illness: results of the 2022-2023 Armed Forces Health Surveillance Division forecasting challenge. MSMR. 2024;31(5):24-30. Accessed May 8, 2025. https://www.health.mil/news/articles/2024/05/01/msmr-afhsd-ib-2022-respiratory-forecast 
  8. Khan DM, Ali M, Iqbal N, et al. Short-term prediction of COVID-19 using novel hybrid ensemble empirical mode decomposition and error trend seasonal model. Front Public Health. 2022;10:922795. doi:10.3389/fpubh.2022.922795 
  9. Alfasanah Z, Niam MZH, Wardiani S, et al. Monitoring air quality index with EWMA and individual charts using XGBoost and SVR residuals. MethodsX. 2024;14:103107. doi:10.1016/j.mex.2024.103107 
  10. Vector Autoregression (VAR): Comprehensive Guide with Examples in Python. Accessed Aug. 20, 2025. https://www.machinelearningplus.com/timeseries/vector-autoregression-examples-python   
  11. Kumar GS. Understanding and Building Neural Network (NN) Models. 2025. Accessed Aug. 20, 2025. https://builtin.com/machine-learning/nnmodels  
  12. Taylor SJ, Letham B. Prophet: Forecasting at Scale. 2017. Accessed Aug. 20, 2025. https://research.facebook.com/blog/2017/02/prophet-forecasting-at-scale 
  13. Bracher J, Ray EL, Gneiting T, Reich NG. Evaluating epidemic forecasts in an interval format. [Published correction in PLoS Comput Biol. 2022;18(10):e1010592. doi:10.1371/journal.pcbi.1010592.]. PLoS Comput Biol. 2021;17(2):e1008618. doi:10.1371/journal.pcbi.1008618 
  14. O’Hara-Wild M, Hyndman R, Wang E. fable: Forecasting Models for Tidy Time Series, R Package Version 0.4.0. 2024. Accessed Mar. 12, 2025. https://fable.tidyverts.org 
  15. O’Hara-Wild M, Hyndman R, Wang E. fabletools: Core Tools for Packages in the ‘fable’ Framework. R package version 0.5.0. 2024. Accessed Mar. 12, 2025. https://fabletools.tidyverts.org/index.html

You also may be interested in...

Topic
Feb 3, 2026

Medical Surveillance Monthly Report

The Medical Surveillance Monthly Report, a peer-reviewed journal launched in 1995, is the Armed Forces Health Surveillance Division's flagship publication. The MSMR provides monthly evidence-based estimates of the incidence, distribution, impact, and trends of health-related conditions among service members.

Report
Dec 1, 2025

MSMR Vol. 32 No. 12 - December 2025

.PDF | 2.90 MB

The December 2025 issue of MSMR features four full reports: the annual update on cold weather injuries among the active and reserve components; a report on trends in the prevalence of obesity among U.S. active component service members and civilians; the annual update on diagnoses of mental health disorders among active component service members; a ...

Article
Dec 1, 2025

Update: Cold weather injuries among the active and reserve components of the U.S. Armed Forces, July 2020–June 2025

Since 2004, MSMR has published annual updates on the incidence of cold weather injuries affecting U.S. Armed Forces members for the five most recent cold seasons. This 2025 report discusses the occurrence of frostbite, immersion hand and foot injuries, hypothermia, as well as “other specified and unspecified effects of reduced temperature.”

Report
Nov 1, 2025

MSMR Vol. 32 No. 11 - November 2025

.PDF | 3.56 MB

The November 2025 MSMR presents four full reports on sexually transmitted infections (STIs), led by the annual update on STIs among active component service members, 2016–2024; followed by a report on follow up testing for chlamydia or gonorrhea among male Air Force basic trainees, 2017–2023; accompanied by a report on sexual networks of service ...

Report
Oct 1, 2025

MSMR Vol. 32 No. 10 - October 2025

.PDF | 2.90 MB

The October 2025 MSMR presents five full reports: measles, mumps, rubella and varicella within the Military Health System; seasonal influenza hospitalization among service members; BMI, physical fitness, and COVID-19 hospitalization in the Army; adverse pregnancy outcomes in female service members after COVID-19 infection or vaccination; long COVID ...

Refine your search