Skip to main content

Scholer Matthew

Description

The variability of free text emergency department (ED) data is problematic for biosurveillance, and current methods of identifying search terms for symptoms of interest are inefficient as well as time- and labor-intensive. Our ad hoc approach to term identification for the North Carolina Disease and Epidemiologic Collection Tool (NC DETECT) begins with development of clinical case definitions from which we build automated syndrome queries in standard query language. The queries are used to search free text clinical data from EDs, with the goal of identifying free text terms to match the case definitions. The free text search terms were initially collected from epidemiologists and clinical and technical staff at NC DETECT through informal review of ED data. Over time, we reviewed individual cases missed by our queries and identified additional search terms. We also manually reviewed records to find misspellings, abbreviations and acronyms for known search terms (e.g., dypnea, diff. br. and SHOB for dyspnea), and developed a pre-processor to clean text prior to syndromic classification. The purpose of this project was to develop and test a more standardized approach to search term identification.

 

Objective

This paper describes and applies a new method for identifying biosurveillance search terms using the Semantic Network of the Unified Medical Language System.

Submitted by elamb on
Description

While early event detection systems aim to detect disease outbreaks before traditional means, following up on the many alerts generated by these systems can be time-consuming and a drain on limited resources.

Authorized users at local, regional and state levels in North Carolina rely on the North Carolina Disease Event Tracking and Epidemiologic Collection Tool's (NC DETECT) Java-based Web application to monitor and follow-up on signals based on the CDC’s EARS CUSUM algorithms. The application provides users with access to aggregate syndrome-based reports as well as to patient-specific line listing reports for three data sources: emergency departments, ambulance runs and the statewide poison control center. All NC DETECT Web functionality is developed in a user-centered, iterative process with user feedback guiding enhancements and new development. This feedback, along with the need for improved situational awareness and the desire to improve communication among users drove the development of the Annotation Reports and the Custom Event Report.

 

Objective

We describe the addition of two reports to NC DETECT designed to improve NC public health situational awareness capability.

Submitted by elamb on
Description

Text-based syndrome case definitions published by the Center for Disease Control (CDC)1 form the basis for the syndrome queries used by the North Carolina Disease Event Tracking and Epidemiologic Collection Tool (NC DETECT). Keywords within these case definitions were identified by public health epidemiologists for use as search terms with the goal of capturing symptom complexes from free-text chief complaint and triage note data for the purpose of early event detection and situational awareness. Initial attempts at developing SQL queries incorporating these search terms resulted in the return of many unwanted records due to the inability to control for certain terms imbedded within unrelated free text strings. For example, a query containing the search term “h/a”, a common abbreviation for headache, also returns false positives such as “cough/asthma”, “skin rash/allergic reaction” or “psych/anxiety”.  Simple abbreviations without punctuation, such as “ha”, were even more problematic.  Global wildcards ('%') indicate that zero or more characters of any type may substitute for the wildcard.2 The term “ha” as a synonym for "headache" appears frequently in the data, but searching this term bracketed by global wildcards returns any instance where the two letters appear together (e.g. pharyngitis, hand, hallucinations, toothache). Using global wild cards to search for common symptoms such as headache using simple abbreviations, with or without specialized punctuation, results in the return of many unwanted false positive records. We describe here the advanced application of SQL character set wildcards to address this problem.

Objective

This paper describes a novel approach to the construction of syndrome queries written in Structured Query Language (SQL). Through the advanced application of character set wildcards, we are able to increase the number of valid records identified by our queries while simultaneously decreasing the number of false positives.

Submitted by elamb on
Description

In North Carolina, select hospital emergency departments have been submitting data since 2003 for use in syndromic surveillance. These data are collected, stored, and parsed into syndrome categories by the North Carolina Emergency Department Database. The fever with rash illness syndrome is designed to capture smallpox cases. This syndrome was created as a combination of the separate fever and rash syndromes proposed by the consensus recommendations of the CDC’s Working Group on Syndrome Groups.

 

Objective 

This paper describes the construction of a syndromic surveillance case definition and a test for its ability to capture the appropriate syndromic cases.

Submitted by elamb on
Description

The goal of this paper is to describe a methodology used to create a gold standard set of emergency department (ED) data that can subsequently be used to evaluate the sensitivity and specificity of syndrome definitions.

Submitted by elamb on