TY - JOUR
T1 - Topicality boosts popularity
T2 - a comparative analysis of NYT articles and Reddit memes
AU - Barnes, Kate
AU - Juhász, Péter
AU - Nagy, Marcell
AU - Molontay, Roland
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/12
Y1 - 2024/12
N2 - This study sheds light on interconnected topic dynamics across traditional news sources and social media platforms, emphasizing the influential role of topicality in shaping content popularity in social media. Using the Latent Dirichlet Allocation and BERTopic models, we define sets of 120 New York Times (NYT) topics to compare with 899,766 image-with-text memes from Reddit, showing that social media content aligns with many of the same topical patterns observed in news outlets. Topicality is formalized based on the temporal distributions of topics over the past 5 years. Using these topicality features, the investigation reveals significant correlations between the rising popularity of NYT topics and increased average upvotes on Reddit, particularly evident in “innovator” memes posted during the early stages of a topic’s prevalence in the NYT. Furthermore, topicality features show significant predictive power over other content-based control features in a CatBoost classifier prediction of viral Reddit memes.
AB - This study sheds light on interconnected topic dynamics across traditional news sources and social media platforms, emphasizing the influential role of topicality in shaping content popularity in social media. Using the Latent Dirichlet Allocation and BERTopic models, we define sets of 120 New York Times (NYT) topics to compare with 899,766 image-with-text memes from Reddit, showing that social media content aligns with many of the same topical patterns observed in news outlets. Topicality is formalized based on the temporal distributions of topics over the past 5 years. Using these topicality features, the investigation reveals significant correlations between the rising popularity of NYT topics and increased average upvotes on Reddit, particularly evident in “innovator” memes posted during the early stages of a topic’s prevalence in the NYT. Furthermore, topicality features show significant predictive power over other content-based control features in a CatBoost classifier prediction of viral Reddit memes.
KW - BERTopic
KW - Latent Dirichlet Allocation
KW - Machine learning
KW - Memes
KW - New York Times
KW - Popularity prediction
KW - Reddit
UR - http://www.scopus.com/inward/record.url?scp=85196630979&partnerID=8YFLogxK
U2 - 10.1007/s13278-024-01272-3
DO - 10.1007/s13278-024-01272-3
M3 - Journal article
AN - SCOPUS:85196630979
SN - 1869-5450
VL - 14
JO - Social Network Analysis and Mining
JF - Social Network Analysis and Mining
IS - 1
M1 - 119
ER -