Statistical Methods

With the increase in GPS enabled devices, pin-point spatial data is an obvious future growth area for cluster detection research. The FBSSS handles binary labelled point data, but requires Monte Carlo testing to obtain inference [1]. In the Bayesian Poisson SSS [2], Monte Carlo is replaced by use of historic data, manifoldly speeding up processing. Following [2], [3] derived the BBSSS, replacing historic data with expert knowledge on cluster relative risk. This paper compares the spatial accuracy of BBSSS and FBSSS using new measure [4] which, being independent of inference level, permits direct comparison between Bayesian and frequentist methods. To compare the spatial accuracy of a Bayesian Bernoulli spatial scan statistic (BBSSS) and the frequentist Bernoulli spatial scan statistic (FBSSS), using benchmark trials.

Submitted by elamb on Thu, 05/02/2019 - 08:52

Presented December 13, 2018.

For public health surveillance, is machine learning worth the effort? What methods are relevant? Do you need special hardware? This talk was motivated by these and other questions asked by ISDS members. It will focus on providing practical—and slightly opinionated—advice about how to determine whether machine learning could be a useful tool for your problem.

Presenter

There is limited closed-form statistical theory to indicate how well the prospective space-time permutation scan statistic will perform in the detection of localized excess illness activity. Instead, detection methods can be applied to simulated data to gain insight about detection performance. Such results are dependent on the way outbreaks are simulated and the nature of the background data. As an alternative, we explore an empirical approach in which the membership of a large health plan is used to represent a community and detection performance is assessed in samples from the larger group.

Objective

Our goal was to assess the impact of sentinel sample size and criteria for a signal on performance of daily prospective space-time permutation detection by comparing results in varying size random samples from a large health plan to results found in the full membership.

Submitted by Sandra.Gonzale… on Thu, 09/20/2018 - 13:44

Expectation-based scan statistics extend the traditional spatial scan statistic approach by using historical data to infer the expected counts for each spatial location, then detecting regions with higher than expected counts. Here we consider five recently proposed expectation-based statistics: the expectation-based Poisson (EBP), expectation-based Gaussian (EBG), population-based Poisson (PBP), populationbased Gaussian (PBG), and robust Bernoulli-Poisson (RBP) methods. We also consider five different time series analysis methods used to predict the expected counts (including the Holt-Winters method and moving averages optionally adjusted for day of week and seasonality), giving a total of 25 methods to compare. All of these methods are detailed in the full paper.

Objective

We present a systematic empirical comparison of five recently proposed expectation-based scan statistics, in order to determine which methods are most successful for which spatial disease surveillance tasks.

Submitted by Sandra.Gonzale… on Thu, 09/20/2018 - 13:16

Seasonal influenza accounts for a high proportion of outpatient morbidity during the winter months. However, influenza case counts are greatly underestimated due to frequently undiagnosed influenza. Electronic medical record (EMR) systems provide a very large, complex data source for influenza surveillance at both the patient and population level. It is important to identify influenza patients for specimen collection, respiratory isolation for school age children, prescription of an appropriate influenza drug, or to identify patients at risk for complications. At a population level, public health agencies monitor the tempo and spread of influenza season for resource management, as well as maintain situational awareness for avian influenza.

Objective

The objective of this work was to evaluate the utility of classification tree methods for syndromic surveillance case definition development using an EMR system as a data source.

Referenced File

Syndromic_Surveillance_Case_Definition_Development_Using_Recursive_Partitioning_Techniques_For_Highly_Dimensional_Databases.pdf