Skip to main content

EWMA

Description

Current biosurveillance systems run multiple univariate statistical process control (SPC) charts to detect increases in multiple data streams. The method of using multiple univariate SPC charts is easy to implement and easy to interpret. By examining alarms from each control chart, it is easy to identify which data stream is causing the alarm. However, testing multiple data streams simultaneously can lead to multiple testing problems that inflate the combined false alarm probability. Although methods such as the Bonferroni correction can be applied to address the multiple testing problem by lowering the false alarm probability in each control chart, these approaches can be extremely conservative. Biosurveillance systems often make use of variations of popular univariate SPC charts such as the Shewart Chart, the cumulative sum chart (CUSUM), and the exponentially weighted moving average chart (EWMA). In these control charts an alarm is signaled when the charting statistic exceeds a pre-defined control limit. With the standard SPC charts, the false alarm rate is specified using the in-control average run length (ARL0). If multiple charts are used, the resulting multiple testing problem is often addressed using family-wise error rate (FWER) based methods that are known to be conservative - for error control. A new temporal method is proposed for early event detection in multiple data streams. The proposed method uses p-values instead of the control limits that are commonly used with standard SPC charts. In addition, the proposed method uses false discovery rate (FDR) for error control over the standard ARL0 used with conventional SPC charts. With the use of FDR for error control, the proposed method makes use of more powerful and up-to-date procedures for handling the multiple testing problem than FWER-based methods.

Objective: To propose a computationally simple, fast, and reliable temporal method for early event detection in multiple data streams.

Submitted by elamb on
Description

Absenteeism has great advantages in promoting the early detection of epidemics. School absenteeism surveillance could timely detect the aggregations of absentees in time and space, so as to provide effective early warning and prevention and control of infectious diseases outbreaks in schools. Since April 1, 2012, an integrated syndromic surveillance system (ISSC) has been implemented in rural Hubei Province, China. With school absence data, finding the optimal model and related appropriate parameters for early warning of epidemics is necessary and practical.

Objective

To explore the optimal model and its related parameters via EWMA and CUSUM (C1, C2, C3) models in school absenteeism surveillance for early detection of infectious disease outbreaks in rural China.

Submitted by NSSP_KR_Admin on
Description

San Diego County Public Health has been conducting syndromic surveillance for the past few years. Currently, the system has become largely automated and processes and analyzes data from a variety of disparate sources including hospital emergency departments, 911 call centers, prehospital transports, and over-the-counter drug sales. What has remained constant since the system’s initial conceptualization is the local opinion that the data should be analyzed and interpreted in a variety of ways, in anticipation for the variety of contexts in which events that are of public health interest may unfold. Relatively small increases in volume that are sustained over time will likely be detected by methods designed to detect “small process shifts”, and include the CUSUM and EWMA methods. Larger increases in volume that are not sustained over time will likely be detected by other employed methods (P-Chart in the event of a non-proportional increase in volume, U-Chart in the event of a proportional increase in volume). A retrospective analysis was conducted on historical data from various data sources to determine the frequency of signals and detected events as well as the context within which the alert occurred (i.e., the “shape” of the data). Findings regarding several actual public health events will also be discussed.

 

Objective

This paper describes the frequency, various “shapes” and magnitudes of data anomalies, and varying ways actual public health events may present themselves in syndromic data.

Submitted by elamb on
Description

While several authors have advocated wavelets for biosurveillance, there are few published wavelet method evaluations using real syndromic data. Goldenberg et al. performed an analysis using wavelet predictions as a way of detecting a simulated anthrax outbreak. The commercial RODS application uses averaged wavelet levels to normalize for longterm trends and negative singularities. In line with the implementation in and in contrast to, we introduce two preconditioning steps to account for the strong day-of-week effect and holidays, and then use all levels of the wavelets to predict or alarm.

Objective

Syndromic data are created by processes that operate on different time scales (daily, weekly, or even yearly) and can include events of different durations from a 1-2 day outbreak of foodborne illness to a more gradual, protracted flu season. The duration of an outbreak caused by a new pathogenic strain or a bioterrorist attack is indeterminate. Wavelets are well suited for detecting signals of uncertain duration because they decompose data at multiple time and frequency scales. This study evaluates the use of several wavelet-based algorithms for both time series forecasting and anomaly detection using real-world syndromic data from multiple data sources and geographic locations.

Submitted by elamb on