Skip to main content

Generous Nicholas

Description

Government reporting of notifiable disease data is common and widespread, though most countries do not report in a machine-readable format. This is despite the WHO International Health Regulations stating that [e]ach State Party shall notify WHO, by the most efficient means of communication available. Data are often in the form of a file that contains text, tables and graphs summarizing weekly or monthly disease counts. This presents a problem when information is needed for more data intensive approaches to epidemiology, biosurveillance and public health. While most nations likely store incident data in a machine-readable format, governments can be hesitant to share data openly for a variety of reasons that include technical, political, economic, and motivational. A survey conducted by LANL of notifiable disease data reporting in over fifty countries identified only a few websites that report data in a machine-readable format. The majority (>70%) produce reports as PDF files on a regular basis. The bulk of the PDF reports present data in a structured tabular format, while some report in natural language or graphical charts. The structure and format of PDF reports change often; this adds to the complexity of identifying and parsing the desired data. Not all websites publish in English, and it is common to find typos and clerical errors. LANL has developed a tool, Epi Archive, to collect global notifiable disease data automatically and continuously and make it uniform and readily accessible.

Objective: Automatically collect and synthesize global notifiable disease data and make it available to humans and computers. Provide the data on the web and within the Biosurveillance Ecosystem (BSVE) as a novel data stream. These data have many applications including improving the prediction and early warning of disease events.

Submitted by elamb on
Description

The National Strategy for Biosurveillance defines biosurveillance as 'the process of gathering, integrating, interpreting, and communicating essential information related to all-hazards threats or disease activity affecting human, animal, or plant health to achieve early detection and warning, contribute to overall situational awareness of the health aspects of an incident, and to enable better decision-making at all levels.' However, the strategy leaves unanswered how 'essential information' is to be identified and integrated, or what the metrics qualify information as being 'essential'. Multi-Attribute Utility Theory (MAUT), a type of multi-criteria decision analysis, provides a structured approach that can offer solutions to this problem. While the use of MAUT has been demonstrated in a variety of fields, this method has never been applied to decision support in biosurveillance. We have developed a decision support analytic framework using MAUT that can facilitate identifying data streams for use in biosurveillance. We applied this framework to the problem of evaluating data streams for use in a global infectious disease surveillance system.

Objective

To describe how multi-criteria decision analysis can be applied to identifying essential biosurveillance information and demonstrate feasibility by applying it to prioritize data streams.

Submitted by elamb on
Description

Los Alamos National Laboratory (LANL) was tasked with developing methods to determine the relevance of data streams for an integrated global biosurveillance system. We used a novel method of evaluating the effectiveness of data streams called the 'surveillance window'. We defined a surveillance window as the brief period of time when information gathered can be used to assist decision makers in effectively responding to an impending outbreak. Information obtained for data streams beyond this window is deemed to have limited use.

Objective

The goal of this project was to provide an approach and evaluation of data stream utility for integrated, global disease surveillance. This effort is part of a larger project which is developing tools to aid decision-makers with timely information to predict, prepare for, and mitigate the spread of disease.

Submitted by knowledge_repo… on
Description

Most countries do not report national notifiable disease data in a machine-readable format. Data are often in the form of a file that contains text, tables and graphs summarizing weekly or monthly disease counts. This presents a problem when information is needed for more data intensive approaches to epidemiology, biosurveillance and public health. While most nations likely store incident data in a machine-readable format, governments are often hesitant to share data openly for a variety of reasons that include technical, political, economic, and motivational issues1. A survey conducted by LANL of notifiable disease data reporting in over fifty countries identified only a few websites that report data in a machine-readable format. The majority (>70%) produce reports as PDF files on a regular basis. The bulk of the PDF reports present data in a structured tabular format, while some report in natural language. The structure and format of PDF reports change often; this adds to the complexity of identifying and parsing the desired data. Not all websites publish in English, and it is common to find typos and clerical errors. LANL has developed a tool, Epi Archive, to collect global notifiable disease data automatically and continuously and make it uniform and readily accessible.

Objective:

LANL has built software that automatically collects global notifiable disease data, synthesizes the data, and makes it available to humans and computers within the Biosurveillance Ecosystem (BSVE) as a novel data stream. These data have many applications including improving the prediction and early warning of disease events.

Submitted by elamb on
Description

The re-emergence of an infectious disease is dependent on social, political, behavioral, and disease-specific factors. Global disease surveillance is a requisite of early detection that facilitates coordinated interventions to these events. Novel informatics tools developed from publicly available data are constantly evolving with the incorporation of new data streams. Re-emerging Infectious Disease (RED) Alert is an open-source tool designed to help analysts develop a contextual framework when planning for future events, given what has occurred in the past. Geospatial methods assist researchers in making informed decisions by incorporating the power of place to better explain the relationships between variables.

Objective:

The application of spatial analysis to improve the awareness and use of surveillance data.

Submitted by elamb on
Description

Infectious disease remains costly in human and economic terms. Effective and timely disease surveillance is a critical component of prevention and mitigation strategies. The limitations of traditional disease surveillance systems have motivated new techniques based upon internet data sources such as search queries and social media. However, 4 challenges remain before internet-based disease surveillance models can be reliably integrated into an operational system: openness, breadth, transferability, and forecasting. We evaluated a new data source, Wikipedia access logs, in these 4 challenges for global disease surveillance and forecasting

Objective

To explore the use of Wikipedia as a data source for disease surveillance.

Submitted by aising on
Description

Most countries do not report national notifiable disease data in a machine-readable format. Data are often in the form of a file that contains text, tables and graphs summarizing weekly or monthly disease counts. This presents a problem when information is needed for more data intensive approaches to epidemiology, biosurveillance and public health as exemplified by the Biosurveillance Ecosystem (BSVE). While most nations do likely store their data in a machine-readable format, the governments are often hesitant to share data openly for a variety of reasons that include technical, political, economic, and motivational issues. For example, an attempt by LANL to obtain a weekly version of openly available monthly data, reported by the Australian government, resulted in an onerous bureaucratic reply. The obstacles to obtaining data included: paperwork to request data from each of the Australian states and territories, a long delay to obtain data (up to 3 months) and extensive limitations on the data’s use that prohibit collaboration and sharing. This type of experience when attempting to contact public health departments or ministries of health for data is not uncommon. A survey conducted by LANL of notifiable disease data reporting in 52 countries identified only 10 as being machine-readable and 42 being reported in pdf files on a regular basis. Within the 42 nations that report in pdf files, 32 report in a structured, tabular format and 10 in a non-structured way. As a result, LANL has developed a tool-Epi Archive (formerly known as EPIC)-to automatically and continuously collect global notifiable disease data and make it readily accesible.

Objective

LANL has built a software program that automatically collects global notifiable disease data—particularly data stored in files—and makes it available and shareable within the Biosurveillance Ecosystem (BSVE) as a new data source. This will improve the prediction and early warning of disease events and other applications.

Submitted by teresa.hamby@d… on