255376 Using Box-Jenkins modeling techniques to forecast future disease burden and identify aberrations in public health surveillance reports

Monday, October 29, 2012

Larry Garrett, BSN, MPH , HealthInsight, Salt Lake City, UT
Objective: This study examines the potential to predict future disease burden based upon the historical record within specific jurisdictions using Box-Jenkins statistical models. It also examines what influence, if any, jurisdictional size and rate of disease has upon the accuracy of a forecast.

Methods: Box–Jenkins forecast models were constructed by stratifying 30,104 disease reports by year, disease type and jurisdiction. Three holdout samples were used to compare forecast accuracy against the actual values for the same time periods. Accuracy was determined by comparing the projected data against the holdout sample using Mean Absolute Percentage Error calculations. Forecast predication intervals were also used to explore the relationships between a holdout sample actual value and the predication interval associated with each forecast at one, two and three standard deviations.

Results: Forecasts have an absolute accuracy between 77 – 89% with the accuracy being independent of jurisdictional size (r (3) = .593, p > .05) or rate of disease (r (3) = .455, p > .05). Larger jurisdictions are significantly (rho (3) = .9, p < .05) more likely to have forecast values located within the forecast interval associated with one standard deviation.

Conclusions: This study demonstrates that it is possible to predict future disease burden within jurisdictions of varying size using Box-Jenkins forecasting techniques. It shows promise as a tool to monitor disease trends, predict disease investigation resource allocation and potentially identify outbreaks earlier. Each forecast establishes a baseline that may be used to compare observed vs. expected values.

Learning Areas:
Epidemiology
Public health or related research

Learning Objectives:
1. Recognize the need for better disease detection methods. 2. Describe the key components for baseline calculations. 3. Identify methods to control for outlier and/or outbreak related data.

Keywords: Epidemiology, Biostatistics

Presenting author's disclosure statement:

Qualified on the content I am responsible for because: The abstract author is responsible for all of the information contained in this abstract. It is a product of an analysis contained in the author’s dissertation for a PhD in Interdisciplinary Health Sciences from Western Michigan University.
Any relevant financial relationships? No

I agree to comply with the American Public Health Association Conflict of Interest and Commercial Support Guidelines, and to disclose to the participants any off-label or experimental uses of a commercial product or service discussed in my presentation.