One of my biggest pet peeves in this “data journalism” movement that is afoot are the crappy graphs the pop up on major news sites. The biggest perpetrator is the Wonkblog, a policy blog on the Washington Post. They love using Excel or Google Docs, a byproduct of needing a quick and easy statistical program in order to keep posting content.
Most of the time, Excel of Google charts offend my sensibilities because they are not visually pleasing. Other times, they offend because they are wrong. For instance, in this recent Wonkblog post from February 18, Zachary Goldfarb included a graph from Health Affairs whose y-axis is completely wrong.
I’m pretty sure this graph doesn’t even need the right y-axis, but I don’t know. It’s just sloppy, and I can’t tell if its the fault of the source, Health Affairs, or WaPo because the article it comes from is behind a paywall.
Another instance of an incorrect or misleading graph is in a seemingly innocuous New York Times piece about House of Cards viewership through the lens of Twitter mentions. The Times hired a social analytics company named General Sentiment to analyze the social media impact of the release of House of Cards. Netflix doesn’t release their viewership numbers, so one has to scrape data from Twitter to get some idea. (Side note: Since Netflix is a public company, shouldn’t shareholders force management to show views?)
Looking at the Twitter stats is semi-interesting, but the Excel graphs that General Sentiment generated are lacking for the NYT:
- The graphs are comparing the same thing – tweets about House of Cards during the first 11 days of the release of seasons one and two – but the x-axes and y-axes are different.
- The Season One graph (the blue bars) starts at 10,000 instead of zero.
- I can understand using the dates for the y-axis, but using Day 1, Day 2, etc., would probably be better since, like I said before, they are comparing the same thing.
- Separating the seasons doesn’t make much sense because, everyone say it, they are comparing the same thing.
Below would be my Excel graphs of the same information in two different ways. I had to ballpark the numerical values. The article did mention totals over the 11 days, and I came close to those.
It’s great that news outlets are adding more and more graphs to their articles. They are really helpful to understand information that uses numbers. I hope that they pay just a little more attention to clarity, especially if they are relying on an outside source for their visualizations.