Skip to main content

Data Analytics

Description

We propose a new method for detecting patterns of disease cases that correspond to emerging outbreaks. Our Anomaly Pattern Detector (APD) first uses a "local anomaly detector" to identify individually anomalous records and then searches over subsets of the data to detect self-similar patterns of anomalies.

Submitted by elamb on
Description

This paper describes a new class of space-time scan statistics designed for rapid detection of emerging disease clusters. We evaluate these methods on the task of prospective disease surveillance, and show that our methods consistently outperform the standard space-time scan statistic approach.

Submitted by elamb on
Description

While several authors have advocated wavelets for biosurveillance, there are few published wavelet method evaluations using real syndromic data. Goldenberg et al. performed an analysis using wavelet predictions as a way of detecting a simulated anthrax outbreak. The commercial RODS application uses averaged wavelet levels to normalize for longterm trends and negative singularities. In line with the implementation in and in contrast to, we introduce two preconditioning steps to account for the strong day-of-week effect and holidays, and then use all levels of the wavelets to predict or alarm.

Objective

Syndromic data are created by processes that operate on different time scales (daily, weekly, or even yearly) and can include events of different durations from a 1-2 day outbreak of foodborne illness to a more gradual, protracted flu season. The duration of an outbreak caused by a new pathogenic strain or a bioterrorist attack is indeterminate. Wavelets are well suited for detecting signals of uncertain duration because they decompose data at multiple time and frequency scales. This study evaluates the use of several wavelet-based algorithms for both time series forecasting and anomaly detection using real-world syndromic data from multiple data sources and geographic locations.

Submitted by elamb on
Description

We developed, implemented and evaluated Meningitis and Encephalitis (M/E) syndrome case definitions based on electronic Emergency Department (ED) chief complaint data; and assessed their ability to detect aberrations that correspond with M/E outbreaks.

Submitted by elamb on
Description

Ideal anomaly detection algorithms shoulddetect both sudden and gradual changes, while keeping the background false positive alert rate at a tolerable level. The algorithms should also be easy to use. Our objective was to develop an anomaly detection algorithm that adapts to the time series being analyzed and reduces false positive signals.

Submitted by elamb on
Description

Accurate and precise estimation of disease rates for a given population during a specified time frame is a major concern for public health practitioners and researchers in biosurveillance. Many diseases follow distinct patterns; incidence and prevalence of many diseases increase approximately exponentially with age, including many cancers, respiratory infections, and gastroenteritis. With increasing demographic information available in biosurveillance systems leading to more complex and comprehensive disease databases, seeking concise and informative summary measures of disease burden over space and time is becoming more critical for public health surveillance. In this paper we present two summary measures of disease burden in the elderly that simultaneously reflect disease dynamics and population characteristics.

 

Objective

To better estimate disease burden in the elderly population we illustrate an approach—the Slope Intercept Modeling for Population Linear Estimation (SIMPLE) method—that summarizes age-specific disease rates in the 65+ population using the observed exponential increase in disease rates with age in this dynamic and rapidly growing population subgroup.

Submitted by elamb on
Description

Our toolkit adds statistical trend analysis, interactive plots, and kernel density estimation to an existing spatio-temporal visualization platform. The goal of these tools is to provide both a quick assessment of the current syndromic levels across a large area and then allow the analyst to view the actual data for a specific region or hospital over a period of time along with an indication as to whether or not a given data point is statistically significant. The sample data used for this toolkit come from over 70 emergency rooms throughout the state of Indiana.

 

Objective

This paper presents a toolkit designed to aid in the assessment of disease outbreak by visualizing spatiotemporal trends and interactively displaying detailed statistical data.

Submitted by elamb on