Abstract

Both Mirror and Complement: A Comparison of Social Media Data and Survey Data about Flu Vaccination

David Broniatowski, Ph.D.1, Mark Dredze, PhD2, Karen Hilyard, PhD3, Maeghan Dessecker, M.A.4, Sandra Crouse Quinn5, Amelia Jamison, M.A.6 and Michael Paul, PhD7
(1)George Washington University, Washington, DC, (2)Johns Hopkins University, Baltimore, MD, (3)FHI 360, Atlanta, GA, (4)University of Georgia, Athens, GA, (5)University of Maryland, School of Public Health, College Park, MD, (6)University of Maryland, College Park, MD, (7)University of Colorado, Boulder, Boulder, CO

APHA 2016 Annual Meeting & Expo (Oct. 29 - Nov. 2, 2016)

While social media can provide real time insight into knowledge, attitudes and behaviors, twin challenges for health communication researchers are gathering accurate, detailed demographic information about users and grappling with a large quantity of qualitative information. Surveys aptly handle these challenges, but may be too slow in an emerging public health crisis such as an infectious disease outbreak. Further, surveys often underrepresent young, urban participants and minorities, groups which can readily be captured via social media platforms. Using the 2014-15 influenza season as a test case, we employ rapid, computational Big Data techniques that use advanced natural language algorithms and geolocation to parse demographics and psychographics of social media. We compare our social media findings to published CDC survey data about vaccination, reporting where the data complement and/or overlap each other, where they diverge, and importantly, for which demographic groups. Our results suggest opportunities for researchers to track emerging epidemics more quickly and cheaply via social media, as well as suggesting other instances where social media may be insufficient and survey data is necessary to provide depth of detail in easily quantifiable and analyzable formats.

Communication and informatics Public health or related research Social and behavioral sciences