142nd APHA Annual Meeting and Exposition

Annual Meeting Recordings are now available for purchase

309155
Public Health Dashboard powered by Digital Disease Detection for Local Epidemiologists: A Gastrointestinal illness case study

142nd APHA Annual Meeting and Exposition (November 15 - November 19, 2014): http://www.apha.org/events-and-meetings/annual
Tuesday, November 18, 2014 : 1:30 PM - 1:50 PM

Elizabeth Musser, MPH candidate , Network Dynamics Science Simulaton Laboratory, Virginia Bioinformatics Inst, Blacksburg, VA
James Schlitt, Computational epidemiologist , Network Dynamics Science Simulation, Virginia Bioinformatics Inst, Blacksburg, VA
Bryan Lewis, MPH, PhD , Social and Decision Informatics Laboratory, Virginia Bioinformatics Institute at Virginia Tech, Arlington, VA
Stephen Eubank, PhD , Network Dynamics and Simulation Science Laboratory, Virginia Bioinformatics Institute at Virginia Tech, Blacksburg, VA
A toolkit harnessing high-throughput machine learning classification on publicly available data has been developed. A dashboard powered by this toolkit has been designed to provide digital disease detection for public health action. The disease detection algorithms rely on guided machine learning to filter for statements of interest via a human-trained Natural Language ToolKit (NLTK) classifier. The initial case study uses data from both Twitter and online media reports to identify outbreaks of gastro-intestinal illness. Statements are assigned a level of relevance and then a clustering algorithm detects elevations in disease activity across geographic regions. Relevant statements are then visualized on the dashboard as timeseries, annotated maps, and word flow series. Contextual meta-data for each statement is also presented along with more traditional sources like health care surveillance data, CDC reports, and school absenteeism. Importantly, the dashboard is designed to crowdsource relevance of the statements and incorporate additional data that isn’t present in the original statements.  This allows humans to augment the automated methods and share information through the dashboard itself. For example, an Emergency Medical Technician may know that a mentioned restaurant is operated by the same owner as restaurants previously cited for health violations, with this knowledge the local epidemiologist decides to take action thus quickly halting the outbreak. To ensure usability feedback from an expert panel of epidemiologists, pharmacists, culture and linguistic specialists, and infectious disease physicians was integrated in the dashboard’s design reflecting geographical norms regarding illness and disease. Additionally an evaluation component of the project will capture the utility of identified events and will be used to iteratively refine the dashboard.  The toolkit was designed to be readily adaptable for detection of diverse disease risk factors, languages, and public data sources. This modularity ensures an effective and efficient social media surveillance dashboard tailored for local use.

Learning Areas:

Epidemiology

Learning Objectives:
Analyze the utility of automatic high throughput Machine Learning Technology in Digital Disease Detection and Surveillance for the local epidemiologist

Keyword(s): Information Technology, Ethics

Presenting author's disclosure statement:

Qualified on the content I am responsible for because: Pursuing academic work in computational epidemiology. I have worked with multiple federally funded computational epidemiology grant projects involving social media mining and digital disease detection. My focus is on the practical application of IT projects to local and state health districts.
Any relevant financial relationships? No

I agree to comply with the American Public Health Association Conflict of Interest and Commercial Support Guidelines, and to disclose to the participants any off-label or experimental uses of a commercial product or service discussed in my presentation.