Skip to main content

Data De-Identification Toolkit


Developing effective data-driven algorithms and visualizations for disease surveillance hinges on the ability to provide application developers with realistic data. However, the sensitivity of the data creates a barrier to its distribution. We have created a tool that assists data providers with de-identifying their data in preparation for sharing. The functions in the tool help data providers comply with the HIPAA 'Safe Harbor' de-identification standard by removing or obscuring information such as names, geographic locations, and identifying numbers.


To develop a robust, flexible, and easy-to-use data de-identification tool that makes it easier for data providers to create data sets that are sharable with external collaborators.

Submitted by knowledge_repo… on