Skip to main content

Ellipse-Based Clustering Analysis Using a Time Series Algorithm

Description

Many cities in the US and the Center for Disease Control and Prevention have deployed biosurveillance systems to monitor regional health status. Biosurveillance systems rely on algorithms that analyze data in temporal domain (e.g., CuSUM) and/or spatial domain (e.g., SaTScan). Spatial domain-based algorithms often require population information to normalize the counts (e.g., emergency department visits) within a geographic region. This paper presents a new algorithm Ellipse-based Clustering Analysis (ECA) that analyzes data in both temporal and spatial domains--using time series analysis for each of zip codes with abnormal counts and using pattern recognition methods for spatial clusters.

 

Objective

This paper describes a new clustering algorithm ECA, which uses a time series algorithm to identify zip codes with abnormal counts, and uses a pattern recognition method to identify spatial clusters in ellipse shapes. Using ellipses could help detect elongated clusters resulting from wind dispersion of bio-agents. We applied the ECA to over-the-counter medicine sales. The pilot study demonstrated the potential use of the algorithm in detection of clustered outbreak regions that could be associated with aerosol release of bio-agents.

Submitted by elamb on