Breaking the Real Chart Rules To Follow

Stephanie’s original chart.

I inadvertently sparked a debate on Stephanie Evergreen’s blog, “How to show down is good” (go read it – it’s great). In the post, she showed a bar chart whose axis didn’t start at zero (above). The horror!

Not only is this a flagrant breaking of all The Laws Of Dataviz, it came the very week Nathan Yau published an excellent post “Real Chart Rules to Follow”. How dare she!

I commented on Stephanie’s blog that her bar chart was a valid example showing that you could break one of Nathan’s rules, because there’s no such thing as zero weight for an adult human.

Stephen Few made a great point that for Stephanie’s chart, a dot plot would probably be a better option. I agree. Jeffrey made a great point that half the bar length suggests that the person is half as heavy. Fair enough: I agree with that too.

I also agree with my original comment. I’ll defend my comment, but not to the death. Maybe to first scratch but not beyond.

Let’s move on. And anyway, I’ve also written before about zeroes on axes (skilfully avoiding the bar chart pitfall, you’ll notice). And I fundamentally believe there is always an exception to every dataviz rule. We should educate people about guidelines and learning when they can be broken.

Finally, let me pose a question, to which I am genuinely intrigued to know the answer. Is the chart below valid? All I did was change the title and y-axis of Stephanie’s original chart. Now I am showing zero. Is this ok? 0lbs to target is still 150lbs in real weight.

Is this valid? It uses zero, but that zero still represents 150lbs?
Is this valid? It uses zero, but that zero still represents 150lbs?

Which chart should you use to show this data?

 

What's the best way to show this data?
What’s the best way to show this data?

I’ve blogged before about there being no “correct” way to visualize a dataset. The video below shows how this is the case. Even when data is extremely simple, there are many ways to view it, each being better at answering a different question.

Conclusion? The trick isn’t to think “a line is the best way to show time data.” It’s to consider the question you want to answer. Manipulate and play with the data until the answer is clear.

How to drive the message home with the right dashboard

Today (Thu 16 April) I did a Webinar for Tableau, “How to drive the message home with the right dashboard.” (the webinar recording will be available on that page very soon).

The slides are available here.

And here are the links to the resources I shared:

Design books and projects

Ranking UK political parties according to mentions on twitter by the media
Ranking UK political parties according to mentions on twitter by the media

Tableau Dashboards

How come we see bigfoot fewer times, despite us all now having smartphones?
How come we see bigfoot fewer times, despite us all now having smartphones?

Inspiration and further use cases

Viz of the Day: great messages, every day
Viz of the Day: great messages, every day

Killing the paired bar chart

In which years did B outsell A?
In which years did B outsell A?

Jon Schwabish just posted a nice solution to the problem of the side-by-side bar chart. I won’t go into why they’re A Bad Thing – he does that just fine.

I wanted to put this post together because it’s something I’ve been thinking about too. My solution is slightly different. Consider the side-by-side bar chart at the top showing sales of Product A and B over ten years. Too much ink! It’s confusing and impossible to interpret. It’s really hard to see anything.

How else can we show this info and ask “in which years did B outsell A?” Simple. Do something heretical and connect the dots using a line (what? Use a line to connect discrete values? But you can’t do that!):

side by side slope

Because we’re so well evolved to see slopes, we quickly and easily see the three years in which B outsold A:

side by side highlight
Click here to download a workbook with this chart in it

In this example, because it’s sales over time, I kept the years as separate panes.

With slightly different data, you can acheive the same results using a categorical slope chart. I’m doing this as part of my analytics based around the UK General Election (http://impartialityuk.tumblr.com/).

SNP and LIb Dem Mentions

A new role: evangelist

The Ancient Mariner
The Ancient Mariner
I pass, like night, from land to land;
I have strange power of speech;
That moment that his face I see,
I know the man that must hear me:
To him my tale I teach.
– The Rime of the Ancient Mariner, Samuel Taylor Coleridge

Today’s my birthday and Tableau gave me a nice present: a new role. I’m now Technical Evangelist for the company. I’m humbled, thrilled and delighted with this.

In many ways it’s what I’ve been doing, unofficially, since first buying Tableau 7 years ago, starting this blog in 2010 (here’s my first post) and then joining Tableau in 2011.

I’ve kinda always felt a little like the Ancient Mariner. He went through a crazy experience and then felt driven to share it with everyone he sees. That’s how I feel about visual analysis and Tableau.

(another reason for the quote? The Rime of the Ancient Mariner is my favourite poem. I even tried to learn it while riding a bike around New Zealand ten years ago. If you’ve not read it, I recommend you do. Then go read the hilarious version by Hunt Emerson. Finally, go listen to Iron Maiden’s epic tribute to the poem; go on, you know you want to!)

 

How many data points are too many? In praise of the small multiple.

My latest Huffington Post article (published Wed 28 Jan) discusses how amazing our visual system is at seeing very granular levels of detail. Here’s a rather shaky GIF of the different views going from 1 data point to over 10,000:

howmanymarks narrow
Click image to see a bigger version

The inspiration for the column and this post was Ann K Emery’s 2015 data resolutions. I’ve always been a big fan of small multiples, but her specific statement to “do more small multiples” triggered my efforts to break the data out of the charts I’d been making with the Citibike data.

There have been lots of posts celebrating small multiples recently. My favourite is “A Big Article About Wee Things” by Propublica. Go read it! Go on.

What I really need to emphasise is that no single view is the “right” one. Theere’s no such thing as the “right” view. Being able to cycle through these very quickly in Tableau is immensely powerful – each view teases something else out of the data as you feel your way to insight. Each view shows something different and if you can see 30 views in 5 minutes, who knows what insights your data will reveal? What’s certain is that we can reflect on just how complex and yet clear 10,000+ marks appear:

All 10,246 marks in one place!
All 10,246 marks in one place!

3 ages of data viz?

I’ve got this idea for a future theme looking at “3 ages of data viz”. I want your thoughts. Is there something in this idea? Am I right? Are there more? What’s the NEXT age going to bring? What does this teach us about dataviz?

Age 1: The Excel disaster (pre 2000)

(image from http://peltiertech.com/)

The early spreadsheet designers got excited about graphics and gave us 3-d exploded pie charts. If only they’d read some theory about effective dataviz maybe we’d not have had 35+ years of fighting back against dataviz disasters. To be fair to Excel, as you can see above, the defaults weren’t really that bad, given the limits of graphics cards in the day. Unfortunately, people got too excited about the 3d options.

Age 2: the Stephen Few fightback (2000-2010)

(from http://www.perceptualedge.com/)

Stephen Few took on the spreadsheet behemoths in the first decade of this century. He made us all see sense and put science-backed best practice on the pedestal. People saw the light and visual tools began to ditch the dross in favour of charts that actually work.

Age 3: the creative years (2010-present)

What’s the top data dog (http://tabsoft.co/1CF8TAr)

The problem with Stephen Few’s approach is that people found his approach, well, boring. Unarguably his approach was functionally correct and just right for operational business dashboards. But many people were left unmoved. They found that following his approach didn’t engage people. As data journalism flourished and infographics exploded, there was a realization that a balance needed to be struck.

At the extreme end we found that people like David McCandless found success with their design-trumps-function approach  but others, such as Alberto Cairo (see his Tapestry Conference slides) and Andy Kirk (8 hats) pushed the need to ENGAGE as well as INFORM.

Tell me your thoughts

My ideas are fluid around this. I’m trying to make the point that we’re in a great place with the combining fields of creative power and effective design. What else do I need to know?

 

Seven years ago this week….

I paid for my first Tableau license 7 years ago today (15 Jan 2008). To say that changed my life is not an understatement.
I was a struggling data analyst using outdated, unsupported, inflexible BI tools provided by an underfunded, overworked IT department. I had a team of people who would spend 3 weeks producing 1 report for 1 faculty at the University of Oxford.
In desperation I searched the web for anything that might help me escape these shackles. I found this post by Stephen Few and clicked the link to this small software company’s website.
By the end of that afternoon, I had produced more useful analytics than we could do in a month. That was the afternoon my life changed. I put together a use case. Here it is:
The notes I made for my business case to buy 1 license of Tableau Desktop
The notes I made for my business case to buy 1 license of Tableau Desktop
Note that one of my risks is “not enough functionality” (this was v3.5). That was true but what it did was better than any single piece of software I have ever used. The great news is that Tableau is now a super-powerful machine.
Note also that the notes are written on an Oracle notepad. Ha! Take that, Oracle!
Looking back on my initial work, I am amazed I was pleased with what I was producing.
Some of my first work with Tableau
Some of my first work with Tableau
My experienced eyes see these views as unsophisticated and untidy. But the emotion I remember at the time was one of joy. I was playing with my data. I was asking and answering questions as quickly as I could think of them. I was unleashed.
I stayed at the University of Oxford for 4 great years. Over those years at Oxford, I began blogging, organised the first Tableau user group, and spoke at Tableau conferences. I was having more fun using a piece of software than I could have imagined.
Fun?
You’re not supposed to enjoy using business applications. That’s just wrong. But this was FUN. So much that I would use Tableau at home for personal projects. Can you imagine using other BI tools at home for fun?
It. Just. Doesn’t. Happen.
My amazing farewell cake from my colleagues at Oxford.
My amazing farewell cake from my colleagues at Oxford.
In 2011 Tableau was growing in Europe and I joined the company; it’s 6th European employee. My first desk was shared with a photocopier. It’s been just as much fun ever since and I am grateful to have had this opportunity. I’ve made amazing friends at work. I’ve travelled to amazing places. And I’ve been inspired by the incredible community this product has produced.
I now get  visibility of the product roadmap – I am very confident the next 7 years are going to be as amazing as the first 7.