In this Section |
249420 Identifying Health-Related Topics on Twitter: An Exploration of Tobacco-Related Tweets as a Test TopicTuesday, November 1, 2011: 11:10 AM
Public health-related topics are difficult to identify in large conversational datasets like Twitter. This study examines how to model and discover public health topics and themes in tweets. Tobacco use is chosen as a test case to demonstrate the effectiveness of topic modeling via LDA across a large, representational dataset from the United States, as well as across a smaller subset that was seeded by tobacco-related queries. Topic modeling across the large dataset uncovers several public health-related topics, although tobacco is not detected by this method. However, topic modeling across the tobacco subset provides valuable insight about tobacco use in the United States. The methods used in this paper provide a possible toolset for public health researchers and practitioners to better understand public health problems through large datasets of conversational data.
Learning Areas:
Assessment of individual and community needs for health educationCommunication and informatics Planning of health education strategies, interventions, and programs Social and behavioral sciences Learning Objectives: Keywords: Tobacco, Internet
Presenting author's disclosure statement:
Qualified on the content I am responsible for because: I have experience in data mining, social networks, and research regarding social influences of health behavior. I agree to comply with the American Public Health Association Conflict of Interest and Commercial Support Guidelines, and to disclose to the participants any off-label or experimental uses of a commercial product or service discussed in my presentation.
See more of: Data Mining Technologies and Other Applications
See more of: Health Informatics Information Technology |