Machine Learning

Presented December 13, 2018.

For public health surveillance, is machine learning worth the effort? What methods are relevant? Do you need special hardware? This talk was motivated by these and other questions asked by ISDS members. It will focus on providing practical—and slightly opinionated—advice about how to determine whether machine learning could be a useful tool for your problem.

Presenter

Presented November 27, 2018.

Presented November 16, 2018.

The current opioid overdose/addiction crisis in the United States presents a challenge to public health intervention due to a lack of data on current and past incidence. Very little information is known regarding what is happening when/where and in comparison to the past. Marin County, California is addressing the lack of clarity in opioid overdose data by designing a novel cloud-based system to identify opioid overdoses for both surveillance and outreach purposes using county owned Emergency Medical Services (EMS) data.

Much attention has been given recently to the purported ability of social media to provide early warning and/or situational awareness and event characterization during a biological event of national concern. The National Biosurveillance Integration Center's (NBIC) innovation project on Social Media Analysis seeks to demonstrate the viability of extracting relevant, health information from social media data, with the ultimate goal to establish an operational social media system for biological event surveillance. Early work in this project has focused on demonstrating the relevance of social media to the biosurveillance problem through data analysis and algorithm development. Preliminary assessments of a commercial social media product also yielded valuable insights for the system architecture required to support such an operational tool. In addition to continued analysis of data utility (algorithm development) and system architecture, future work will include development of a comprehensive concept of operations (CONOPS) for implementation and use of a social media capability within the NBIC.

Objective

Through ongoing and future projects we will examine the utility of social media data for biosurveillance, including machine learning approaches for algorithm development, as well as the system and organizational architectures required to implement an operational system.

Submitted by knowledge_repo… on Wed, 08/22/2018 - 21:29

Scientists have utilized many chief complaint (CC) classification techniques in biosurveillance including keyword search, weighted keyword search, and naïve Bayes. These techniques may utilize CC-to-syndrome or CC-to-symptom-to-syndrome classification approaches. In the former approach, we classify a CC directly into syndrome categories. In the latter approach, we first classify a CC into symptom categories. Then, we use a syndrome definition, a combination of one or more symptoms, to determine whether or not a chief complaint belongs in a particular syndrome category. One approach to CC-to-symptom-to-syndrome classification uses manually weighted keyword search and Boolean operations to build syndrome classifiers. A limitation to this approach is that it does not address uncertainty in the data and the system is manually parameterized. A CC-tosymptom-to-syndrome approach that is both probabilistic and utilizes machine learning addresses these limitations.

Objective

Design, build and evaluate a symptom-based probabilistic chief complaint classifier for the Real-time Outbreak and Disease Surveillance System.

Referenced File

Syco_A_Probabilistic_Machine_Learning_Method_For_Classifying_Chief_Complaints_Into_Symptom_And_Syndrome_Categories.pdf