Working Paper.
During the 2016 US Presidential election, Internet created a huge quantity of memes about the candidates. In October 2016, a meme aggregator called Sizzle shared a subset of their meme data that was filtered for political content. The data contain over 38,000 memes spanning four social networks: Facebook, Instagram, Twitter, and Imgur. This report first checks the relative quantity of candidate memes across social media platforms. We then test to see whether memes about Clinton or Trump receive more likes. Finally, we visualize the change in meme liking over time. Although liking a candidate meme is not equivalent to liking the candidate, it is safe to say that more likes are similar to more attention. The plot of candidate meme likes over time reveals the monthly competition among US presidential candidates.
Social Media have played an ever-larger role in US Presidential elections. During Howard Dean’s 2004 election bid, campaign manager Joe Trippi leveraged online services, including blogs, to organize a “grassroots” fundraising movement that was surprisingly successful. Barack Obama’s 2008 and 2012 campaigns made extensive use of existing social media platforms - including Facebook - but also developed their own platforms for specific operations like the Dean campaign had. In the lead-up to the 2016 campaign, Eric Schmidt guided the formation of The Groundwork, a team of web specialists who would develop specific expertise for consulting with political campaigns. The 2016 Clinton campaign has, in all likelihood, inherited much of this IT expertise. Less is understood about the 2016 Trump campaign, but it is already clear that Social Media are central to the campaign’s strategy.
Even as social media platforms consolidate power, those platforms have been leveraged by bot networks to increase the quantity of certain messages, alter the visibility of messages, spam comment threads, and vote to influence discussions. Bot networks, just like media platforms, represent the consolidation of control over network messaging. Therefore, our Agent model must allow for malicious actors who generate fake behaviours that are mixed with real behaviours in a way that we cannot distinguish.
The social media landscape is complex and the political battle currently being waged through these media is very messy, resulting in lots of noise in our social media data. Vast quantities of social media “likes” are attributable to non-human agents. We therefore proceed knowing our data consist of meticulously recorded, occasionally fraudulent behaviour indicators.
Data are from Sizzle, an online meme aggregator. Sizzle have coded the meme caption content and exported certain memes that matched candidate names. Each meme has the following data properties:
The general approach for this study is to import all the available CSV data, categorize it by candidate, and standardize the likes between social media platforms. This report is built with the R Language version 3.3.1.
The Sizzle data are separated by keyword, with a separate keyword corresponding to a separate CSV file. Because some candidates have results that are split across multiple files, we must aggregate across files. The data don’t load perfectly, so we also need to convert a few columns and parse timestamps.
The earliest meme in the set was shared on December 16, 2011 and the latest meme comes from October 27, 2016.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
We look to see whether the networks exhibit similar patterns of liking behaviour.
## Warning in self$trans$transform(x): NaNs produced
## Warning: Removed 8875 rows containing non-finite values (stat_boxplot).
These boxplots simply display raw scores.
Because social media platforms vary in their users and capabilities, liking dynamics are not the same across networks. As a result, we must standardize likes within networks so that we can make comparisons with all the available data.
First, we will briefly describe the distribution of memes. Next, we’ll look for differences between the two main candidates. Finally, we’ll observe the candidate memes over time.
A simple visualization displays the total quantity of memes made about the candidates, as well as the platform where those memes were shared.
If anything, there may be a surprising number of Trump memes shared on Instagram, but otherwise the candidates appear to be proportionally distributed across platforms.
We will compare the standardized, centred scores between the two main candidates: Clinton and Trump. Clinton’s scores
\(t(3.2632\times 10^{4}) = -5.231487 (p<0.001)\)
From these data, it appears Clinton memes are liked a little bit less than average, whereas Trump memes are liked a little bit more. There is a small but statistically significant difference between mean likes among these candidates.
Campaigns unfold over time and over the course of many media events. Candidates alternately have their moments of media spotlight on the basis of the attention they can attract. This attention may manifest as memes, which in turn create opportunities for likes. Since attention and likes will vary over time, we broke down meme scores by month to investigate time effects.
The plot depicts monthly average like scores for each candidate, and a moving-average regression line (LOESS) has been fit to the data. The ups and downs of candidates over time, despite being called likes, should probably be interpreted as attention over time. Trump memes are seen to achieve a huge amount of attention during the early part of 2016. It is only in the most recent months that Clinton memes begin to gain relatively-more likes (and therefore, attention).
The story of the Trump campaign’s success with social media is clearly spelled out in this plot. The huge surge in likes from mid-2015 through mid-2016 corresponded to a huge amount of attention for Trump. It is not clear whether the memes are pro-Trump or anti-Trump, but we can safely infer that either way, Trump memes were probably reaching more viewers.
Do the people like Trump or Clinton? We can’t say from these data, at least not directly. Even if we were to employ sentiment tagging (e.g. with LIWC or another tool) it would not be possible to say with certainty whether the act of liking a meme translates to liking a candidate.
However, we can qualitatively digest the meme attention plot and conclude that it sounds about right. It sortof looks like an opinion poll, although we can be certain it does not reflect popular opinion. It sortof looks like a popularity plot for trending tags or terms, but again, it’s not quite that either. Quite simply, the meme attention plot depicts how much general attention was given to memes about candidates over the course of months, and that’s interesting on its own.
Does the last-minute uptick in Clinton memes mean something important? Again, it is not possible to determine whether these memes are pro- or anti-candidate, so all we can say is that Clinton is receiving more likes and attention. Perhaps this is a good thing, as with “mere exposure” advertising. On the other hand, perhaps these last-minute Clinton memes are all critical, suggesting a turn in the tide. Only time will tell.
Be sure to vote by November 8, 2016.