335435
Comparing social media comments about children's vaccines with national survey findings
Objective—To compare findings from analysis of social media content with findings from a national survey.
Methods--We used Natural Language Processing to analyze comments about childhood vaccinations from a social media forum popular with new and expecting mothers. We developed a sentiment analysis classification tool by manually coding 300 postings on the forum and training a computer learning algorithm to classify about 33,000 postings into positive, negative, and neutral sentiments. We conducted topic modeling using Latent Dirichlet Allocation on the social media postings to determine a number of distinct topics.
Results--The sentiment analysis classifier was not accurate, perhaps because attitudes toward childhood vaccination is complicated—parents may agree with the vaccine schedule but cringe when their children are stuck with needles. Topic modeling proved more useful; 25 topics were produced, each representing a profile of parents’ positions on vaccination. Some topics found by modeling were absent from the survey, which indicates that this analysis may be useful for checking content validity of survey instruments.
Discussion—Our next step will compare the findings from the social media site with responses to the NIS. Analyses will assess the extent to which the data from social media and surveys correlate, and will provide guidelines to communication professionals and researchers on opportunities and limitations for using social media content analysis to help prepare messages for public health campaigns.
Learning Areas:
Assessment of individual and community needs for health educationCommunication and informatics
Social and behavioral sciences
Learning Objectives:
Analyze social media content to help guide health communication campaigns
Compare quality of information from social media with survey responses
Explain strengths and limitations of information available from social media
Keyword(s): Social Media, Research
Qualified on the content I am responsible for because: I have over 20 years of experience in research and program evaluation. In my role as a social scientist for federal and state public health initiatives, I have used natural language processing techniques with social media content to enhance findings from traditional methods of formative research.
Any relevant financial relationships? No
I agree to comply with the American Public Health Association Conflict of Interest and Commercial Support Guidelines, and to disclose to the participants any off-label or experimental uses of a commercial product or service discussed in my presentation.