Skip to main content

Bayesian Models

Description

The multivariate Bayesian scan statistic (MBSS) enables timely detection and characterization of emerging events by integrating multiple data streams. MBSS can model and differentiate between multiple event types: it uses Bayes’ Theorem to compute the posterior probability that each event type Ek has affected each space-time region S. Results are visualized using a ‘posterior probability map’ showing the total probability that each location has been affected. Although the original MBSS method assumes a uniform prior over circular regions, and thus loses power to detect elongated and irregular clusters, our Fast Subset Sums (FSS) method assumes a hierarchical prior, which assigns non-zero prior probabilities to every subset of locations, substantially improving detection power and accuracy for irregular regions.

Objective

We propose a new, computationally efficient Bayesian method for detection and visualization of irregularly shaped clusters. This Generalized Fast Subset Sums (GFSS) method extends our recently proposed MBSS and FSS approaches, and substantially improves timeliness and accuracy of event detection.

Submitted by teresa.hamby@d… on
Description

The ability to rapidly detect any substantial change in disease incidence is of critical importance to facilitate timely public health response and, consequently, to reduce undue morbidity and mortality. Unlike testing methods (1, 2), modeling for spatio-temporal disease surveillance is relatively recent, and this is a very active area of statistical research (3). Models describing the behavior of diseases in space and time allow covariate effects to be estimated and provide better insight into etiology, spread, prediction and control. Most spatio-temporal models have been developed for retrospective analyses of complete data sets (4). However, data in public health registries accumulate over time and sequential analyses of all the data collected so far is a key concept to early detection of disease outbreaks. When the analysis of spatially aggregated data on multiple diseases is of interest, the use of multivariate models accounting for correlations across both diseases and locations may provide a better description of the data and enhance the comprehension of disease dynamics.

Objective

This study deals with the development of statistical methodology for on-line surveillance of small area disease data in the form of counts. As surveillance systems are often focused on more than one disease within a predefined area, we extend the surveillance procedure to the analysis of multiple diseases. The multivariate approach allows for inclusion of correlation across diseases and, consequently, increases the outbreak detection capability of the methodology

Submitted by elamb on
Description

Block 3 of the US Military Electronic Surveillance System for Early Notification of Community-Based Epidemics (ESSENCE) system affords routine access to multiple sources of data. These include administrative clinical encounter records in the Comprehensive Ambulatory Patient Encounter Record (CAPER), records of filled prescription orders in the Pharmacy Data Transaction Service, developed at the Department of Defense (DoD) Pharmacoeconomic Center, Laboratory test orders and results in HL7 format, and others. CAPER records include a free-text Reason for Visit field, analogous to chief complaint text in civilian records, and entered by screening personnel rather than the treating healthcare provider. Other CAPER data fields are related to case severity. DoD ESSENCE treats the multiple, recently available data sources separately, requiring users to integrate algorithm results from the various evidence types themselves. This project used a Bayes Network approach to create an ESSENCE module for analytic integration, combining medical expertise with analysis of 4 years of data using documented outbreaks.

 

Objective

The project objective was to develop and test a decision support module using the multiple data sources available in the U.S. DoD version of ESSENCE.

Submitted by elamb on
Description

Optimal sequential management of disease outbreaks has been shown to dramatically improve the realized outbreak costs when the number of newly infected and recovered individuals is assumed to be known (1,2). This assumption has been relaxed so that infected and recovered individuals are sampled and therefore the rate of information gain about the infectiousness and morbidity of a particular outbreak is proportional to the sampling rate (3). We study the effect of no recovered sampling and signal delay, features common to surveillance systems, on the costs associated with an outbreak.

Objective

Development of general methodology for optimal decisions during disease outbreaks that incorporate uncertainty in both parameters governing the outbreak and the current outbreak state in terms of the number of current infected, immune, and susceptible individuals.

Submitted by elamb on
Description

With the increase in GPS enabled devices, pin-point spatial data is an obvious future growth area for cluster detection research. The FBSSS handles binary labelled point data, but requires Monte Carlo testing to obtain inference [1]. In the Bayesian Poisson SSS [2], Monte Carlo is replaced by use of historic data, manifoldly speeding up processing. Following [2], [3] derived the BBSSS, replacing historic data with expert knowledge on cluster relative risk. This paper compares the spatial accuracy of BBSSS and FBSSS using new measure [4] which, being independent of inference level, permits direct comparison between Bayesian and frequentist methods. To compare the spatial accuracy of a Bayesian Bernoulli spatial scan statistic (BBSSS) and the frequentist Bernoulli spatial scan statistic (FBSSS), using benchmark trials.

Submitted by elamb on
Description

As technology advances, the implementation of statistically and computationally intensive methods to detect unusual clusters of illness becomes increasingly feasible at the state and local level [2]. Bayesian methods allow for the incorporation of prior knowledge directly into the model, which could potentially improve estimation of expected counts and enhance outbreak detection. This method is one of eight being formally evaluated as part of a grant from the Alfred P. Sloan Foundation.

Objective

To adapt a previously described Bayesian model-based surveillance technique for cluster detection [1] to NYC Emergency Department (ED) visits.

Submitted by knowledge_repo… on
Description

This work incorporates model learning into a Bayesian framework for outbreak detection. Our method learns the spatial characteristics of each outbreak type from a small number of labeled training examples, assuming a generative outbreak model with latent center. We show that using the learned models to calculate prior probabilities for a Bayesian scan statistic significantly improves detection performance.

Submitted by elamb on
Description

This presentation introduces the U.S. Department of Homeland Security (DHS) National Bio-Surveillance Integration System (NBIS) and the analytics functionality within the NBIS that integrates and analyzes structured and unstructured data streams across domains to provide inter-agency analysts with an integrated view of threat scenarios. The integration of Human and Animal incidences of Avian Influenza will be used to demonstrate initial capability.

Submitted by elamb on
Description

Many disease-outbreak detection algorithms, such as control chart methods, use frequentist statistical techniques. We describe a Bayesian algorithm that uses data D consisting of current day counts of some event (e.g., emergency department (ED) chief complaints of respiratory disease) that are tallied according to demographic area (e.g., zip codes).

Objective

We introduce a disease-outbreak detection algorithm that performs complete Bayesian Model Averaging (BMA) over all possible spatial distributions of disease, yet runs in polynomial time.

Submitted by elamb on