Skip to main content

BioSurveillance

Description

Modern biosurveillance data contains thousands of unique time series defined across various categorical dimensions (zipcode, age groups, hospitals). Many algorithms are overly specific (tracking each time series independently would often miss early signs of outbreaks), or too general (detections at state level may lack specificity reflective of the actual process at hand). Disease outbreaks often impact multiple values (disjunctive sets of zipcodes, hospitals, multiple age groups) along subsets of multiple dimensions of data. It is not uncommon to see outbreaks of different diseases occurring simultaneously (e.g. food poisoning and flu) making it hard to detect and characterize the individual events. We proposed Disjunctive Anomaly Detection (DAD) algorithm to efficiently search across millions of potential clusters defined as conjunctions over dimensions and disjunctions over values along each dimension. An example anomalous cluster detectable by DAD may identify zipcode = {z1 or z2 or z3 or z5} and age_group = {child or senior} to show unusual activity in the aggregate. Such conjunctive-disjunctive language of cluster definitions enables finding realworld outbreaks that are often missed by other state-of-art algorithms like What’s Strange About Recent Events (WSARE) or Large Average Submatrix (LAS). DAD is able to identify multiple interesting clusters simultaneously and better explain complex anomalies in data than those alternatives.

Objective

Disjunctive anomaly detection (DAD) algorithm can efficiently search across multidimensional biosurveillance data to find multiple simultaneously occurring (in time) and overlapping (across different data dimensions) anomalous clusters. We introduce extensions of DAD to handle rich cluster interactions and diverse data distributions

Submitted by ynwang@ufl.edu on
Description

Veterans accessing Veterans Affairs (VA) health care have higher suicide rates and more characteristics associated with suicide risk, including being male, having multiple medical and psychiatric comorbidities, and being an older age, compared with the general U.S. population. The Veterans Crisis Line is a telephone hotline available to Veterans with urgent mental health concerns; however, not all Veterans are aware of this resource. By contrast, telephone triage is a national telephone-based triage system used by the VA to assess and triage all Veterans with acute medical or mental health complaints.

Objective

To characterize Veterans who call telephone triage because of suicidal ideation (SI) or depression and to identify opportunities for suicide prevention efforts among these telephone triage users using a biosurveillance application.

 

Submitted by uysz on
Description

Temporal alerting algorithms commonly used in syndromic surveillance systems are often adjusted for data features such as cyclic behavior but are subject to overfitting or misspecification errors when applied indiscriminately. In a project for the Armed Forces Health Surveillance Center to enable multivariate decision support, we obtained 4.5 years of outpatient, prescription and laboratory test records from all US military treatment facilities. A proof-of-concept project phase produced 16 events with multiple evidence corroboration for comparison of alerting algorithms for detection performance. We used the representative streams from each data source to compare sensitivity of 6 algorithms to injected spikes, and we used all data streams from 16 known events to compare them for detection timeliness.

Objective

For a multi-source decision support application, we sought to match univariate alerting algorithms to surveillance data types to optimize detection performance.

Submitted by uysz on
Description

A decade ago, the primary objective of syndromic surveillance was bioterrorism and outbreak early event detection (EED. Syndromic systems for EED focused on rapid, automated data collection, processing and statistical anomaly detection of indicators of potential bioterrorism or outbreak events. The paradigm presented a clear and testable surveillance objective: the early detection of outbreaks or events of public health concern. Limited success in practice and limited rigorous evaluation, however, led to the conclusion that syndromic surveillance could not reliably or accurately achieve EED objectives. At the federal level, the primary rationale for syndromic surveillance shifted away from bioterrorism EED, and towards allhazards biosurveillance and SA. The shift from EED to SA occurred without a clear evaluation of EED objectives, and without a clear definition of the scope or meaning of SA in practice. Since public health SA has not been clearly defined in terms of operational surveillance objectives, statistical or epidemiological methods, or measurable outcomes and metrics, the use of syndromic surveillance to achieve SA cannot be evaluated.

Objective

Review concept of situation awareness (SA) as it relates to public health surveillance, epidemiology and preparedness. Outline hierarchical levels and organizational criteria for SA. Initiate consensus building process aimed at developing a working definition and measurable outcomes and metrics for SA as they relate to syndromic surveillance practice and evaluation.

Submitted by teresa.hamby@d… on
Description

Global biosurveillance is an extremely important, yet challenging task. One form of global biosurveillance comes from harvesting open source online data (e.g. news, blogs, reports, RSS feeds). The information derived from this data can be used for timely detection and identification of biological threats all over the world. However, the more inclusive the data harvesting procedure is to ensure that all potentially relevant articles are collected, the more data that is irrelevant also gets harvested. This issue can become even more complex when the online data is in a non-native language. Foreign language articles not only create language-specific issues for Natural Language Processing (NLP), but also add a significant amount of translation costs. Previous work shows success in the use of combinatory monolingual classifiers in specific applications, e.g., legal domain. A critical component for a comprehensive, online harvesting biosurveillance system is the capability to identify relevant foreign language articles from irrelevant ones based on the initial article information collected, without the additional cost of full text retrieval and translation.

Objective:

The objective is to develop an ensemble of machine learning algorithms to identify multilingual, online articles that are relevant to biosurveillance. Language morphology varies widely across languages and must be accounted for when designing algorithms. Here, we compare the performance of a word embedding-based approach and a topic modeling approach with machine learning algorithms to determine the best method for Chinese, Arabic, and French languages.

Submitted by elamb on
Description

NBIC is charged with enhancing the capability of the Federal Government to enable early warning and shared situational awareness of acute biological events to support better decisions through rapid identification, characterization, localization, and tracking. A key aspect of this mission is the requirement to integrate and collaborate with federal and, state, local, tribal, and territorial (SLTT) government agencies. NBIC develops and disseminates a variety of products to its stakeholders, including daily reports, ad-hoc reports, analytic collaborations, and leadership briefings upon request. Stakeholders interact with and utilize NBIC’s products in different ways, depending on the mission and jurisdiction involved. Specific collaborations with individual stakeholders are most frequent and evident during major infectious disease events, such as the recent Zika epidemic in the Americas and the associated microcephaly and other neurological disorders PHEIC. Collaborative efforts and known outcomes among varying levels of government are described in detail below in order to highlight NBIC’s integration focus and capabilities in this role.

Objective:

An important part of the National Biosurveillance Integration Center’s (NBIC) mission is collaboration with federal, state, local, tribal, and territorial governments for the purpose of enhancing early warning, shared situational awareness, and related decision support for infectious disease events. Several such collaborations occurred at multiple jurisdictional levels during the recent Zika epidemic in the Americas and the associated microcephaly and other neurological disorders Public Health Event of International Concern (PHEIC). The collaborations and their known outcomes from this major infectious disease event are described below, and NBIC stands ready to support similar efforts for future events.

Submitted by elamb on
Description

NBIC integrates, analyzes, and distributes key information about health and disease events to help ensure the nation’s responses are well-informed, save lives, and minimize economic impact. To meet its mission objectives, NBIC utilizes a variety of data sets, including open source information, to provide comprehensive coverage of biological events occurring across the globe. NBIC Biofeeds is a digital tool designed to improve the efficiency of analyzing large volumes of open source reporting and increase the number of relevant insights gleaned from this dataset. Moreover, the tool provides a mechanism to disseminate tailored, electronic message notifications in near-real time so that NBIC can share specific information of interest to its interagency partners in a timely manner. NBIC is deploying the tool for operational use by the Center and eventual use by federal partners with biosurveillance mission objectives. Core functionality for data collection, curation, and dissemination useful to other federal agencies was implemented, and NBIC is incorporating custom taxonomies for capturing metadata specific to the unique missions of NBIC partners.

Objective:

The National Biosurveillance Integration Center (NBIC) is deploying a scalable, flexible open source data collection, analysis, and dissemination tool to support biosurveillance operations by the U.S. Department of Homeland Security (DHS) and its federal interagency partners.

Submitted by elamb on
Description

The Epi Evident application was designed for clear and comprehensive visualization for monitoring, comparing, and forecasting notifiable diseases simultaneously across chosen countries. Epi Evident addresses the taxing analytical evaluation of how diseases behave differently across countries. This application provides a user-friendly platform with easily interpretable analytics which allows analysts to conduct biosurveillance with minimal user tasks. Developed at the Pacific Northwest National Laboratory (PNNL), Epi Evident utilizes time-series disease case count data from the Biosurveillance Ecosystem (BSVE) application Epi Archive. This diverse data source is filtered through the flexible Epi Evident workflow for forecast model building designed to integrate any entering combination of country and disease. The application aims to quickly inform analysts of anomalies in disease & location specific behavior and aid in evidence based decision making to help control or prevent disease outbreaks.

Objective:

Epi Evident is a web based application built to empower public health analysts by providing a platform that improves monitoring, comparing, and forecasting case counts and period prevalence of notifiable diseases for any scale jurisdiction at regional, country, or global-level. This proof of concept application development addresses improving visualization, access, situational awareness, and prediction of disease behavior.

Submitted by elamb on
Description

Twelve years into the 21st century, after publication of hundreds of articles and establishment of numerous biosurveillance systems worldwide, there is no agreement among the disease surveillance community on most effective technical methods for public health data monitoring. Potential utility of such methods includes timely anomaly detection, threat corroboration and characterization, follow-up analysis such as case linkage and contact tracing, and alternative uses such as providing supplementary information to clinicians and policy makers. Several factors have impeded establishment of analytical conventions. As immediate owners of the surveillance problem, public health practitioners are overwhelmed and understaffed. Goals and resources differ widely among monitoring institutions, and they do not speak with a single voice. Limited funding opportunities have not been sufficient for cross-disciplinary collaboration driven by these practitioners. Most academics with the expertise and luxury of method development cannot access surveillance data. Lack of data access is a formidable obstacle to developers and has caused talented statisticians, data miners, and other analysts to abandon the field. The result is that older research is neglected and repeated, literature is flooded with papers of varying utility, and the decision-maker seeking realistic solutions without detailed technical knowledge faces a difficult task. Regarding conventions, the disease surveillance community can learn from older, more established disciplines, but it also poses some unique challenges. The general problem is that disease surveillance lies on the fringe of disparate fields (biostatistics, statistical process control, data mining, and others), and poses problems that do not adequately fit conventional approaches in these disciplines. In its eighth year, the International Society of Disease Surveillance is well positioned to address the standardization problem because its membership represents the involved stakeholders including progressive programs worldwide as well as resource-limited settings, and also because best practices in disease surveillance is fundamental to its mission. The proposed panel is intended to discuss how an effective, sustainable technical conventions group might be maintained and how it could support stakeholder institutions.

Objective

The panel will present the problem of standardizing analytic methods for public health disease surveillance, enumerate goals and constraints of various stakeholders, and present a straw-man framework for a conventions group.

 

Submitted by Magou on
Description

GAS pharyngitis affects hundreds of millions of individuals globally each year, and over 12 million seek care in the United States annually for sore throat. Clinicians cannot differentiate GAS from other causes of acute pharyngitis based on the oropharynx exam, so consensus guidelines recommend use of clinical scores to classify GAS risk and guide management of adults with acute pharyngitis. When the clinical score is low, consensus guidelines agree patients should neither be tested nor treated for GAS. A prediction model that could identify very-low risk patients prior to an ambulatory visit could reduce low-yield, unnecessary visits for a most common outpatient condition. We recently showed that real-time biosurveillance can further identify patients at low-risk of GAS. With increasing emphasis on patient-centric health care and the well-documented barriers impeding clinicians’ incorporation of prediction models into medical practice, this presents an opportunity to create a patient-centric model for GAS pharyngitis based on history and recent local epidemiology. We refer to this model as the “home score,” because it is designed for use prior to a physical exam.

Objective

1. To derive and validate an accurate clinical prediction model (“home score”) to estimate a patient’s risk of group A streptococcal (GAS) pharyngitis before a health care visit based only on history and real-time local biosurveillance, and to compare its accuracy to traditional clinical prediction models composed of history and physical exam features. 2. To examine the impact of a home score on patient and public health outcomes.

Submitted by rmathes on