Skip to main content

Cluster Detection

Description

The Bureau of Communicable Disease (BCD) at the NYC Department of Health and Mental Hygiene performs daily automated analyses using SaTScan to detect spatio-temporal clusters for 37 reportable diseases. Initially, we analyzed one address per patient, prioritizing home address if available. On September 25, 2015, a BCD investigator noticed two legionellosis cases with similar work addresses. A third case was identified in a nearby residential facility, and an investigation was initiated to identify a common exposure source. Four days later, after additional cases living nearby were reported, the SaTScan analysis detected a corresponding cluster.  In response to this signaling delay, we implemented a multiple address (MA) analysis to improve upon single address (SA) analyses by using all location data available on possible exposure sites.

Objective

To improve timeliness and sensitivity of legionellosis cluster detection in New York City (NYC) by using all addresses available for each patient in one analysis.

Submitted by Magou on
Description

The Biosurveillance Ecosystem (BSVE) is a biological and chemical threat surveillance system sponsored by the Defense Threat Reduction Agency (DTRA). BSVE is intended to be user-friendly, multi-agency, cooperative, modular and threat agnostic platform for biosurveillance [2]. In BSVE, a web-based workbench presents the analyst with applications (apps) developed by various DTRAfunded researchers, which are deployed on-demand in the cloud (e.g., Amazon Web Services). These apps aim to address emerging needs and refine capabilities to enable early warning of chemical and biological threats for multiple users across local, state, and federal agencies. Soda Pop is an app developed by Pacific Northwest National Laboratory (PNNL) to meet the current needs of the BSVE for early warning and detection of disease outbreaks. Aimed for use by a diverse set of analysts, the application is agnostic to data source and spatial scale enabling it to be generalizable across many diseases and locations. To achieve this, we placed a particular emphasis on clustering and alerting of disease signals within Soda Pop without strong prior assumptions on the nature of observed diseased counts.

Objective

To introduce Soda Pop, an R/Shiny application designed to be a disease agnostic time-series clustering, alarming, and forecasting tool to assist in disease surveillance “triage, analysis and reporting” workflows within the Biosurveillance Ecosystem (BSVE). In this poster, we highlight the new capabilities that are brought to the BSVE by Soda Pop with an emphasis on the impact of metholodogical decisions.

Submitted by Magou on
Description

Drug overdoses are an increasingly serious problem in the United States and worldwide. The CDC estimates that 47,055 drug overdose deaths occurred in the United States in 2014, 61% of which involved opioids (including heroin, pain relievers such as oxycodone, and synthetics).1 Overdose deaths involving opioids increased 3-fold from 2000 to 2014.1 These statistics motivate public health to identify emerging trends in overdoses, including geographic, demographic, and behavioral patterns (e.g., which combinations of drugs are involved). Early detection can inform prevention and response efforts, as well as quantifying the effects of drug legislation and other policy changes.

The fast subset scan2 detects significant spatial patterns of disease by efficiently maximizing a log-likelihood ratio statistic over subsets of data points, and has recently been extended to multidimensional data (MD-Scan).3 While MD-Scan is a potentially useful tool for drug overdose surveillance, the high dimensionality and sparsity of the data requires a new approach to estimate and represent baselines (expected counts), maintaining both accuracy and efficient computation when searching over subsets. 

Objective

We present the multidimensional tensor scan (MDTS), a new method for identifying emerging patterns in multidimensional spatio-temporal data, and demonstrate the utility of this approach for discovering emerging geographic, demographic, and behavioral trends in fatal drug overdoses. 

Submitted by Magou on
Description

The first travel-associated cases of Zika virus infection in New York City (NYC) were identified in January 2016. Local transmission of Zika virus from imported cases is possible due to presence of Aedes albopictus mosquitos. Timely detection of local Zika virus transmission could inform public health interventions and mitigate additional spread of illness. Daily emergency department (ED) visit surveillance to detect individual cases and spatio-temporal clusters of locally-acquired Zika virus disease was initiated in June 2016. 

Objective

Case and cluster identification of emergency department visits related to local transmission of Zika virus. 

Submitted by Magou on
Description

Since the majority of emerging infectious diseases over the past several decades have been zoonotic, animal health surveillance is now recognized as a key element in predicting public health risks. Surveillance of animal populations can provide important early warnings of emerging threats to human populations from bioterrorism or naturally occurring infectious disease epidemics. This study investigated current animal data collection and surveillance systems, isolated major gaps in state and national surveillance capabilities, and provided recommendations to fill those gaps.

Objective

To identify gaps in current U.S. animal data collection and surveillance systems, describe how surveillance of animal populations can provide important early warnings of emerging threats to human populations from infectious disease epidemics, and explain the benefits of integrating human and animal surveillance data into a common linked system.

 

Submitted by uysz on
Description

The Centers for Disease Control and Prevention (CDC) uses the National Poison Data System (NPDS) to conduct surveillance of calls to United States PCs. PCs provide triage and treatment advice for hazardous exposures through a free national hotline. Information on demographics, health effects, implicated substance(s), medical outcome of the patient, and other variables are collected.

CDC uses automated algorithms to identify anomalies in both pure call volume and specific clinical effect volume, and to identify calls reporting exposure to high priority agents. Pure and clinical effect volume anomalies are identified when an hourly call count exceeds a threshold based on historical data using HLM.1 Clinical toxicologists and epidemiologists at the American Association of Poison Control Centers and CDC apply standardized criteria to determine if the anomaly identifies a potential incident of public health significance (IPHS) and to notify the respective health departments and local PCs as needed. Discussions with NPDS users and analysis of IPHS showed that alerting based on pure call volume yielded excessive false positives. A study using a 5-year NPDS call dataset assessed the positive predictive value (PPV) of the call volume-based approach. This study showed that less than 4% of anomalies were IPHS.2 A low PPV can cause unnecessary waste of staff time and resources analyzing false positive anomalies.

As an alternative to pure call volume-based detection where all calls to each PC are aggregated for anomaly detection, we considered separating calls by toxicologically-relevant exposure categories for more targeted anomaly detection. We hypothesized that this stratified approach would reduce the number of false positives. 

Objective

Our objective was to compare the effectiveness of applying the historical limits method (HLM) to poison center (PC) call volumes with vs without stratifying by exposure type. 

Submitted by Magou on
Description

Identifying, solving, and stopping foodborne outbreaks in the U.S. requires the collaboration and coordination of multiple federal agencies and centers as well as state and local authorities. FDA’s Coordinated Outbreak Response and Evaluation (CORE) Network is responsible for outbreak surveillance, response, and post-response activities related to incidents involving multiple illnesses linked to FDA-regulated food. CORE collaborates with CDC to obtain data on foodborne illnesses and illness clusters and with FDA Centers and field staff to obtain laboratory and inspectional information related to contaminated foods and foodborne illness outbreaks. CORE’s Signals and Surveillance team coordinates isolate tracking activities among several organizations within FDA and CDC and the isolate database was developed for timely information sharing and early signal detection. 

Objective

To create a forum and database for FDA and CDC epidemiologists, laboratorians, and outbreak scientists for tracking recent food and environmental surveillance sampling isolates identified through Reportable Food Registries reports and regulatory inspectional findings, and analyzing them for matches to clinical isolates for early outbreak detection. 

Submitted by Magou on
Description

The Centers for Disease Control and Prevention (CDC) uses the National Poison Data System (NPDS) to conduct surveillance of calls to United States poison centers (PCs) to identify clusters of reports of hazardous exposures and illnesses. NPDS stores basic information from PC calls including call type (information request only or call reporting a possible chemical exposure), exposure agent, demographics, clinical, and other variables.

CDC looks for anomalies in PC data by using automated algorithms to analyze call and clinical effect volume, and by identifying calls reporting exposures to pre-specified high priority agents. Algorithms analyzing call and clinical effect volume identify anomalies when the number of calls exceeds a threshold using the historical limits method (HLM). Clinical toxicologists and epidemiologists at the American Association of Poison Control Centers and CDC apply standardized criteria to determine if the anomaly is a potential incident of public health significance (IPHS) and then notify the respective health departments and PCs as needed. Discussions with surveillance system users and analysis of past IPHS determined that call volume-based surveillance results in a high proportion of false positive anomalies. A study assessing the positive predictive value (PPV) of this approach determined that fewer than four percent of anomalies over a five-year period were IPHS.1 A low PPV can cause an unnecessary waste of staff time and resources. We hypothesized that first stratifying call volume by exposure category would reduce the number of false positives. With the help of medical toxicologists, we created 20 toxicologically-relevant exposure categories to test this hypothesis. 

Objective

Our objective was to determine if the detection performance of current surveillance algorithms to detect call clusters is improved by stratifying by exposure category. 

Submitted by Magou on

Use for the Analytic Solutions for Real-Time Surveillance: Asyndromic Cluster Detection consultancy held June 9-10, 2015 at the University of North Carolina, Chapel Hill.

Problem Summary

Submitted by ctong on