Skip to main content

Conway Michael

Description

We are developing a Bayesian surveillance system for realtime surveillance and characterization of outbreaks that incorporates a variety of data elements, including free-text clinical reports. An existing natural language processing (NLP) system called Topaz is being used to extract clinical data from the reports. Moving the NLP system from a research project to a real-time service has presented many challenges.

 

Objective

Adapt an existing NLP system to be a useful component in a system performing real-time surveillance.

Submitted by hparton on
Description

Ontologies representing knowledge from the public health and surveillance domains currently exist. However, they focus on infectious diseases (infectious disease ontology), reportable diseases (PHSkbFretired) and internet surveillance from news text (BioCaster ontology), or are commercial products (OntoReason public health ontology). From the perspective of biosurveillance text mining, these ontologies do not adequately represent the kind of knowledge found in clinical reports. Our project aims to fill this gap by developing a stand-alone ontology for the public health/biosurveillance domain, which (1) provides a starting point for standard development, (2) is straightforward for public health professionals to use for text analysis, and (3) can be easily plugged into existing syndromic surveillance systems.

 

Objective

To develop an application ontology - the extended syndromic surveillance ontology - to support text mining of ER and radiology reports for public health surveillance. The ontology encodes syndromes, diagnoses, symptoms, signs and radiology results relevant to syndromic surveillance (with a special focus on bioterrorism).

Submitted by hparton on
Description

Recently, a growing number of studies have made use of Twitter to track the spread of infectious disease. These investigations show that there are reliable spikes in traffic related to keywords associated with the spread of infectious diseases like Influenza [1], as well as other Syndromes [2]. However, little research has been done using Social Media to monitor chronic conditions like Asthma, which do not spread from sufferer to sufferer. We therefore test the feasibility of using Twitter for Asthma surveillance, using techniques from NLP and machine learning to achieve a deeper understanding of what users Tweet about Asthma, rather than relying only on keyword search.

Objective

We present a Content Analysis project using Natural Language Processing to aid in Twitter-based syndromic surveillance of Asthma

Submitted by rmathes on
Description

Vast amounts of free, real-time, localizable Twitter data offer new possibilities for public health workers to identify trends and attitudes that more traditional surveillance methods may not capture, particu- larly in emerging areas of public health concern where reliable sta- tistical evidence is not readily accessible. Existing applications include tracking public informedness during disease outbreaks. Twitter-based surveillance is particularly suited to new challenges in tobacco control. Hookah and e-cigarettes have surged in popular- ity, yet regulation and public information remain sparse, despite con- troversial health effects. Ubiquitous online marketing of these products and their popularity among new and younger users make Twitter a key resource for tobacco surveillance. 

Objective

We present results of a content analysis of tobacco-related Twitter posts (tweets), focusing on tweets referencing e-cigarettes and hookah. 

Submitted by jababrad@indiana.edu on
Description

Major depressive disorder has a lifetime prevalence of 16.6% in the United States. Social media platforms ā€“ e.g. Twitter, Facebook, Reddit ā€“ are potential resources for better understanding and monitoring population-level mental health status over time. Based on DSM-5 diagnostic criteria, our research aims to develop a natural language processing-based system for monitoring major depressive disorder at the population-level using public social media data.

Objective

We aim to develop an annotation scheme and corpus of depression-related tweets to serve as a test-bed for the development of natural language processing algorithms capable of automatically identifying depression-related symptoms from Twitter feeds.

Submitted by teresa.hamby@dā€¦ on