Swans and Geese
Ellesmere (Te Waihora) is a huge shallow lake south of Christchurch, which partially connects with the sea—in fact they bulldoze a channel to the ocean when they want to lower the lake level. As part of a conference on the management of te Waihora, I designed a poster for Ken Hughey of Lincoln University.
One of the things Ken wanted to depict was the population fluctuations of Canada geese (Branta canadensis) and black swans (Cygnus atratus) on the lake. He had several years of historical data, though fewer for swans than geese, and wanted to show how goose numbers had dropped below their optimum population level.This is the goose graph; the swan one looked much the same but with fewer bars.

The first thing I decided to do was place both sets of data on the same axes. Because the population data were continuous, it made more sense to link them in a line graph, which I created in Ken’s Excel file and copied into Illustrator. Normally I spend five minutes with the direct selection tool deleting all the crap that Excel adds to its graphs; in this case it was simpler to select the trend lines alone and pull them into a new layer, the create axes and numbers from scratch, using the original graph in the background layer as a guide.
(By the way, not many people seem to know that you can drag whatever’s selected from one layer to another by dragging that little colored square
near the layer’s name.)

To color the goose trend line, I used the eyedropper to sample some brown from a photo of a Canada goose; I just made the swan line black. Those colors were applied directly to the text in the title, as I didn’t want to have a key or legend. Working from Googled photos of each bird, I created silhouettes to label each line, again to avoid a key. It was easy enough to annotate the graph with a level line.
When a graph isn’t working, our temptation is to jazz it up. Excel has any number of ways of making graphs fancier: WordArt, shadows, gradients, 3D, backgrounds and so forth. But this usually makes things worse. Ken had used some of these in his lake level graph, but I thought it needed some simplifying.

I used the same color palette, stroke thicknesses, and font as in the swan and goose graph, so they looked like they belonged together on the poster. It was important, I felt, to translate the rather cryptic numbering scheme for lake openings into English, and to annotate the graph with lines to show the duration of lake openings, rather than just listing dates.

If I were to do this again from scratch, I might use a series of little horizontal lines, one at each sampling time, to represent the lake level, rather than a continuous line.

There’s more information on Ken Hughey’s Waihora research at his group’s web site. Thanks to Ken and to EOS Ecology, for whom I did the design work, for permission to reproduce these graphics. I especially appreciate it when a scientist is brave and altruistic enough to let me post their “before” versions in a forum like this.
My posting frequency has taken a hit since I started working full-time as an information designer, but when the ukulele book is finished I’ll be posting more regularly to Numberpix. The project for 2008 is to finish Pictures of Numbers, my book on data presentation. If readers of this blog have any suggestions for content, you’re welcome to email me: I’m mike, at numberpix.com.










People are poor at accurately judging areas; they do much better comparing linear measures like the lengths of a bar or the heights of a point. Areas can be useful where precision’s not important—circles can be scattered over a map, for example, to allow readers to scan for trends. But too often designers indicate data with areas because shapes are cooler than lines and you can arrange them in pretty patterns.
Note the largest value (892) and the fourth largest (436). One is just over twice the size of the other, and a circle twice the size of another should have a diameter √2 as big: about 1.4 times as wide. The larger circle in the graphic is actually about twice as wide, and it’s about four times as wide as the 204–225 circles. To see this amount of distortion this creates, compare the original proportions of the two largest circles, (right, top), with the corrected ones (right, below). I bet the designer just halved or doubled the circle diameters rather than actually calculated the areas required, which is pretty inexcusable.



Not particularly wanting to harsh on the same design company twice, but the New York Times Magazine included another screwed-up chart on Sunday, February 18th. In this one there are only nine actual data points, which could have been adequately shown with a plain bar chart, but that wouldn’t have looked cool enough, would it? So the designer decided to groove things up by repeating each very thin bar multiple times, and pulling the whole thing into a circle.
Sure enough, the bars weren’t even remotely to scale. I rotated them all to vertical, turning on the invisible grid to help, then typed the actual data into Excel and produced a quick bar chart, and juxtaposed the two (the Excel bars are flipped to make comparison easier).
But there’s another problem. If there were just one bar for each value, we’d at least all agree we should be comparing their heights. But using multiple bars creates a sort of exploded pie chart, with a wedge for each datum. Pie charts, clunky as they are, are a type of chart most people recognize, and we’re used to comparing areas, even if we don’t do it very accurately. But look at the exaggeration caused by mistaking the wedges for pie slices. I traced over the largest and smallest wedges and compared their areas; the larger has what Tufte calls a Lie Factor of 6.9 (doesn’t that sound imposing?), meaning it’s nearly seven times as large as it should be.
Almost every weekend the New York Times Magazine accompanies their first main story with a relevant infographic. They tend to be commissioned from outside agencies, and sometimes lack the good design one sees in
A good thing the designer did label the points, though, or we wouldn’t be able to see how misleading the graphic is. Absolute height doesn’t correspond to value, for example (see A). I’m guessing he or she did this to stop lines 2 and 3 from crossing—they did cross in inconvenient old reality, but that messes up the pretty pattern. Note that line 1 should be about three times the height of line 2, but I suppose that would create an ugly gap.
To redo this, I first generated a basic chart in Excel. I pasted this into a background layer in Illustrator, locked it, and just traced over all the components in a new layer (the chart is so simple it’s hardly worth ungrouping and deleting all the junk that Excel puts in its graphs). I came up with category names that were a bit more meaningful, and created a y-axis, which really only needs to be anchored by a few values.































