<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>Pictures of Numbers</title>
      <link>http://www.numberpix.com/</link>
      <description>Practical tips for better data graphics.</description>
      <language>en</language>
      <copyright>Copyright 2008</copyright>
      <lastBuildDate>Sun, 20 Jan 2008 21:14:43 +1100</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=3.33</generator>
      <docs>http://blogs.law.harvard.edu/tech/rss</docs> 

            <item>
         <title>Swans and Geese</title>
         <description><![CDATA[<p>Ellesmere (Te Waihora) is a huge shallow lake south of Christchurch, which partially connects with the sea&#8212;in fact they bulldoze a channel to the ocean when they want to lower the lake level. As part of a conference on the management of te Waihora, I designed a poster for Ken Hughey of Lincoln University.</p>

<p>One of the things Ken wanted to depict was the population fluctuations of Canada geese (<em>Branta canadensis</em>) and black swans (<em>Cygnus atratus</em>) on the lake. He had several years of historical data, though fewer for swans than geese, and wanted to show how goose numbers had dropped below their optimum population level.This is the goose graph; the swan one looked much the same but with fewer bars.</p>

<p><img alt="ken_geese.gif" src="http://www.numberpix.com/ken_geese.gif" width="560" height="327" /></p>

<p>The first thing I decided to do was place both sets of data on the same axes. Because the population data were continuous, it made more sense to link them in a line graph, which I created in Ken’s Excel file and copied into Illustrator. Normally I spend five minutes with the direct selection tool deleting all the crap that Excel adds to its graphs; in this case it was simpler to select the trend lines alone and pull them into a new layer, the create axes and numbers from scratch, using the original graph in the background layer as a guide.</p>

<p>(By the way, not many people seem to know that you can drag whatever’s selected from one layer to another by dragging that little colored square <img alt="layer_square.gif" src="http://www.numberpix.com/layer_square.gif" width="110" height="21" /> near the layer’s name.) </p>

<p><img alt="swangoose.gif" src="http://www.numberpix.com/swangoose.gif" width="544" height="303" /></p>

<p>To color the goose trend line, I used the eyedropper to sample some brown from a photo of a Canada goose; I just made the swan line black. Those colors were applied directly to the text in the title, as I didn’t want to have a key or legend. Working from Googled photos of each bird, I created silhouettes to label each line, again to avoid a key. It was easy enough to annotate the graph with a level line.</p>

<p>When a graph isn’t working, our temptation is to jazz it up. Excel has any number of ways of making graphs fancier: WordArt, shadows, gradients, 3D, backgrounds and so forth. But this usually makes things worse. Ken had used some of these in his lake level graph, but I thought it needed some simplifying.</p>

<p><img alt="ken_levels.gif" src="http://www.numberpix.com/ken_levels.gif" width="560" height="332" /></p>

<p>I used the same color palette, stroke thicknesses, and font as in the swan and goose graph, so they looked like they belonged together on the poster. It was important, I felt, to translate the rather cryptic numbering scheme for lake openings into English, and to annotate the graph with lines to show the duration of lake openings, rather than just listing dates.</p>

<p><img alt="ellesmere_depth.gif" src="http://www.numberpix.com/ellesmere_depth.gif" width="549" height="260" /></p>

<p>If I were to do this again from scratch, I might use a series of little horizontal lines, one at each sampling time, to represent the lake level, rather than a continuous line.</p>

<p><img alt="alt_levels.gif" src="http://www.numberpix.com/alt_levels.gif" width="311" height="113" /></p>

<p>There’s more information on Ken Hughey’s Waihora research at his group’s <a href="http://www.wet.org.nz/">web site</a>. Thanks to Ken and to <a href="http://www.eosecology.co.nz/">EOS Ecology</a>, for whom I did the design work, for permission to reproduce these graphics. I especially appreciate it when a scientist is brave and altruistic enough to let me post their &#8220;before&#8221; versions in a forum like this.</p>

<p><em>
My posting frequency has taken a hit since I started working full-time as an information designer, but when the <a href="http://www.keteukulele.co.nz">ukulele book</a> is finished I&#8217;ll be posting more regularly to Numberpix. The project for 2008 is to finish Pictures of Numbers, my book on data presentation. If readers of this blog have any suggestions for content, you&#8217;re welcome to email me: I&#8217;m mike, at numberpix.com.</em></p>
]]></description>
         <link>http://www.numberpix.com/2008/01/swans_and_geese.html</link>
         <guid>http://www.numberpix.com/2008/01/swans_and_geese.html</guid>
         <category>Makeovers</category>
         <pubDate>Sun, 20 Jan 2008 21:14:43 +1100</pubDate>
      </item>
            <item>
         <title>Robbins and Few</title>
         <description><![CDATA[<p><img alt="robbins.gif" src="http://www.numberpix.com/robbins.gif" width="40" height="60" class="cover"/>
<span class="title">Creating More Effective Graphs</span> <br />
<span class="author">Naomi Robbins</span> <br />
<span class="published">Wiley-Interscience, 2005 &bull; ISBN: 047127402X</span></p>

<p>This is a perfectly sensible, reasonable book, but it irritated me. First, it&#8217;s not that original: there&#8217;s extensive quoting from Cleveland&#8217;s books, including reproduction of many examples, while Playfair and Minard make their usual appearances, second-hand from Tufte. Recycling of previous authors&#8217; examples seems to be endemic in this literature; Tufte even recycles himself. Perhaps recycling examples fights global warming. Anyway, Robbins&#8217; debt to Cleveland is notable in the use of dot charts, and the proliferation of boxes around elements. I wouldn&#8217;t say the figures are particularly innovative or well-designed, and most of the text is just common sense, like &#8220;large markers are easier to read than small ones&#8221;, and &#8220;overlapping things are harder to read&#8221; (I&#8217;m paraphrasing, or course&#8212;the style is not this pithy.) There&#8217;s actually surprisingly little content; often a short paragraph will float in the middle of a page, padded with ornamental margin art, and a page or two after the graph it&#8217;s discussing. The whole thing could have easily been a quarter of the length. Plus it&#8217;s over US$40 and contains no color, which, given the fundamental importance and frequent misuse of color in amateur graph-making, seems unforgivable. So not a recommended purchase, particularly to the readers of this blog who may well have come across all this advice already.</p>

<p><img alt="few.gif" src="http://www.numberpix.com/few.gif" width="51" height="60" class="cover"/>
<span class="title">Information Dashboard Design</span> <br />
<span class="author">Stephen Few</span> <br />
<span class="published">O&#8217;Reilly, 2006 &bull; ISBN 0596100167</span></p>

<p>Few has a seemingly quite specific aim: producing a series of small charts and tables  that fit onto a single screen&#8212;an information dashboard. It&#8217;s aimed at, but not limited to, businesses who need to summarize performance indicators at a glance&#8212;many of the principles are quite general, and in fact scientists presenting data would do well to think in terms of a cluster of small figures that present related information, rather than a few big independent ones. There is plenty of good sensible advice on toning down garish colors, reducing chartjunk, trimming away superfluous labels and gridlines and so forth, in much the same vein as Tufte. I wouldn&#8217;t say it&#8217;s an essential book, particularly if you already have a work or two by the usual suspects. Nor does it add much that&#8217;s new. It does, though, recontextualize good practices, differing from Tufte in that the writing is very clear and straightforward, and so far more likely to be tolerated by a down-to-earth managerial or administrative type. The best strategy would be to show them the chapter on simplifying charts, then Few&#8217;s elegant high-density redesigns at the end, so they leave you alone and stop asking for colorful banners and animated logos. </p>
]]></description>
         <link>http://www.numberpix.com/2007/09/kosslyn_and_robbins.html</link>
         <guid>http://www.numberpix.com/2007/09/kosslyn_and_robbins.html</guid>
         <category>Reading List</category>
         <pubDate>Tue, 11 Sep 2007 14:51:43 +1100</pubDate>
      </item>
            <item>
         <title>Disproportionate Risk</title>
         <description><![CDATA[<p>Here&#8217;s another graphic from those folks at Catalogtree, published a while ago in the <em>New York Times Magazine</em>, in an article on the influence Shia Islam is likely to have on Middle East elections. So where are all the Shiites then?</p>

<p><img align="center" alt="nyt_shiites.gif" src="http://www.numberpix.com/nyt_shiites.gif" width="400" height="237" /></p>

<p>If you said Iran and Bahrain, you are so, so wrong. Pakistan actually has 60 times as many Shiites as Bahrain, a tiny country with far fewer Shiites than Iraq. What the graphic is showing is percentages, not numbers; percentages of a total population we&#8217;re not told. The designers probably decided that percentages were most important when you&#8217;re discussing elections, but the convention in these types of graphics is that each little person stands for an actual amount (say, a million people), rather than 1%.</p>

<p>If we go the the <em>Foreign Affairs</em> <a href="http://www.foreignaffairs.org/20060701faessay85405-p0/vali-nasr/when-the-shiites-rise.html">article</a> cited, we find the following data.</p>

<p><img align="center" alt="shiite_table.gif" src="http://www.numberpix.com/shiite_table.gif" width="380" height="325" /></p>

<p>Note that Catalogtree chose just a few of these countries to graph; Bahrain rather than, say, Afghanistan or Syria. Why? Well, it rather looks like they read no further than paragraph 2 of the multi-page article (their choices are the <strong>bold</strong> ones):</p>

<blockquote>&#8220;Shiites account for about 90 percent of <strong>Iran</strong>ians, some 70 percent of the people living in the Persian Gulf region, and approximately 50 percent of those in the arc from <strong>Lebanon</strong> to <strong>Pakistan</strong> &#8212; some 140 million people in all.… Recent events in <strong>Iraq</strong> have already mobilized the Shiites of <strong>Saudi Arabia</strong> (about 10 percent of the population); during the 2005 Saudi municipal elections, turnout in Shiite-dominated regions was twice as high as it was elsewhere. … The Shiites of Lebanon (who amount to about 45 percent of the country&#8217;s population) have touted the formula, as have the Shiites in <strong>Bahrain</strong> (who represent about 75 percent of the population there), who will cast their ballots in parliamentary elections in the fall.&#8221;</blockquote>

<p>The main problem with including <em>all</em> the countries in the table is obvious when you graph it: India dwarfs everybody else. In other circumstances, one might log-transform the <em>x</em> axis, but that&#8217;s just silly when you&#8217;re trying to compare numbers and percentages.</p>

<p><img align="center" alt="shiite_bars.gif" src="http://www.numberpix.com/shiite_bars.gif" width="403" height="271" /></p>

<p>One solution, and it&#8217;s the one used in the <em>Foreign Affairs</em> article, is to indicate percentages on a map rather than on a bar graph. The size of countries is very roughly proportional to their population anyway, and by cropping the map one keeps India from dominating it.</p>

<p><img align="center" alt="shiitesFA.jpg" src="http://www.numberpix.com/shiitesFA.jpg" width="475" height="500" /></p>

<p>This map has a few problems, though. Because it uses no colors, it relies on rather odd patterns to convey the percentage of Shiites. It&#8217;s also a bit of a tangle of coastlines, borders, and pointers. I tried a redesign with a more intuitive color palette, scaling back boundaries as much as possible. Of course you&#8217;d want to label countries as well.</p>

<p><img alt="new_shiite_map.gif" src="http://www.numberpix.com/new_shiite_map.gif" width="449" height="318" /></p>

<p>We still don&#8217;t know the actual numbers of Shiites, though. That could be done by overlaying little person-markers proportional to numbers, the way the Catalogtree graph seemed to be doing but wasn&#8217;t. I intentionally didn&#8217;t arrange the Shiites in serried ranks, like an army on a parade ground presumably about to march on the effete West.</p>

<p><img alt="new_shiite_map2.gif" src="http://www.numberpix.com/new_shiite_map2.gif" width="449" height="318" /></p>

<p>Of course, it ends up looking like a certain board game. But I guess when you want to show hordes of figures camped on countries, you just have to run that Risk.</p>

<p><img alt="shiites_risk.jpg" src="http://www.numberpix.com/shiites_risk.jpg" width="318" height="160" /></p>

<p>Reference: Vali Nasr, <a href="http://www.foreignaffairs.org/20060701faessay85405-p0/vali-nasr/when-the-shiites-rise.html">When the Shiites Rise</a>,
<em>Foreign Affairs</em>, July/August 2006</p>
]]></description>
         <link>http://www.numberpix.com/2007/09/disproportionate_risk_1.html</link>
         <guid>http://www.numberpix.com/2007/09/disproportionate_risk_1.html</guid>
         <category>Makeovers</category>
         <pubDate>Tue, 04 Sep 2007 23:49:35 +1100</pubDate>
      </item>
            <item>
         <title>Chameleon Ray Grid</title>
         <description><![CDATA[<p>David Shiffman, an undergraduate at Duke University, was working with Dr. Dan Rittschof on stingray feeding behavior for his honors thesis, which required catching several rays of two different species and keeping them in captivity. While they were holding them in two different kinds of tank, they noticed to their surprise a color change in both species&#8212;a result completely incidental to research, but interesting enough for David to summarize for me thus:</p>

<p><img alt="stingray1.jpg" src="http://www.numberpix.com/stingray1.jpg" width="560" height="408" /></p>

<p>The problem with this graphic is that it separates different individuals of the same species, and it&#8217;s a little confusing to see how the tank acts as a treatment and the ray color is the result. I suggested he rearrange the photos into a matrix, which is often a good way to show the interaction between factors (in this case, ray color and tank color).</p>

<p><img align="center" alt="stingray2.jpg" src="http://www.numberpix.com/stingray2.jpg" width="440" height="516" /></p>

<p>I&#8217;d suggest some pretty standard changes: removing boxes around things wherever possible, paying attention to the typeface, and showing rather than labeling the tank color. </p>

<p><img align="center" alt="newrays.jpg" src="http://www.numberpix.com/newrays.jpg" width="485" height="536" /></p>

<p>I&#8217;ve chosen here to make a swatch of the ray color, which can be an excellent way to show color change, but the sacrifice is the distinctive silhouettes of the two species, which make it much easier for us to follow what&#8217;s happened. The ideal situation would be better-quality photos of each ray, showing the whole fish without reflections to distract the reader. The rays should be photographed on a neutral background, or using each of the actual tanks as a background (the photos currently all use the same gray background, which just confuses things). The advantage of using the actual tanks is that you&#8217;d no longer need to label the colors&#8212;they&#8217;d be self-evident. The only text necessary would be the species names. Show, not tell.</p>
]]></description>
         <link>http://www.numberpix.com/2007/08/chameleon_ray_grid.html</link>
         <guid>http://www.numberpix.com/2007/08/chameleon_ray_grid.html</guid>
         <category>Makeovers</category>
         <pubDate>Tue, 28 Aug 2007 17:44:48 +1100</pubDate>
      </item>
            <item>
         <title>Maps for Scientists: Using</title>
         <description><![CDATA[<p>You&#8217;ll usually be wanting to work with maps that are EPS graphics, which are the standard for the publishing industry. If you have a Windows Metafile (.wmf), it&#8217;s fine for editing inside Word or PowerPoint (here&#8217;s a <a href="http://www.presentationmaps.com/PPtut.html">tutorial</a>) but not much else. Personally I prefer to work with EPS files and export them as GIF images when I need to use them in a PowerPoint show. The workflow below is for Illustrator, but other vector graphics packages should work similarly.</p>

<p><img alt="maptextfile.gif" src="http://www.numberpix.com/maptextfile.gif" width="94" height="57" style="float: left; margin-right: 20px; margin-bottom: 0px; margin-top: 0px;"/>Your map may start off in PostScript or Illustrator format. PostScript is a language used to define the lines and curves that make up vector graphics; a PostScript file looks like an ordinary text file, and indeed you can open it with a word processor, although it&#8217;s scarcely enthralling reading. Once you open it in Illustrator or any other vector graphics program that speaks <img alt="openmaptextfile.gif" src="http://www.numberpix.com/openmaptextfile.gif" width="143" height="59" style="float: right; margin-left: 10px; margin-bottom: 10px; margin-top: 10px;"/>PostScript, it will display as a series of shapes with lines and fills. It&#8217;s also possible to buy maps already saved in Illustrator format, which allows them to have complex layering and makes them easier for you to edit. Older Illustrator maps will need to have their fonts updated when you open them in Illustrator CS; this is an option you&#8217;re given when you first open them, and there&#8217;s no going back. Use Save As to make a working copy of the original. Of course, you should be using Save As all the time anyway to make multiple working versions of any graphic you&#8217;re editing. Go on, you&#8217;ll thank me one day.</p>

<p><img alt="northatlantic1.gif" src="http://www.numberpix.com/northatlantic1.gif" width="560" height="232" /></p>

<p>If the graphic has layers, turn off or delete the ones you don&#8217;t want. If it doesn&#8217;t have layers, similar objects (land, sea) are probably arranged in groups, so you can select them all with a single click. Sometimes, though, there are just too many points on a complex path (for example, a detailed coastline) for it to be a single Illustrator object. If that&#8217;s the case, and the things you want to change are tiled or split into several groups, you have two choices. You can shift-click to select multiple objects, or you can do a Select > Same (for example, Same > Fill Color) to catch everything sharing the properties of what&#8217;s already selected. This is useful for catching every last one of a cloud of things like rivers or placenames if they&#8217;re not handily grouped.</p>

<p><img alt="selectsame.gif" src="http://www.numberpix.com/selectsame.gif" width="560" height="261" /></p>

<p>When you have a group of objects selected, I&#8217;d recommend converting them to grayscale, since most maps you make will have to be reproduced in black and white at some point. You can pick a color model from the popup menu in the Color palette.  (You know those little triangles <img alt="popup.gif" src="http://www.numberpix.com/popup.gif" width="14" height="12" /> are popup menus, don&#8217;t you? Not everyone does.) Make sure the shades of gray are far enough apart to remain distinctive after printing or photocopying (perhaps at least 20%), and avoid less than 15% or greater than 85%, as these will degrade to solid white or black after a couple of rounds of photocopying. In this map, the pale blue ocean converted to 9% Black when I switched to grayscale, so I bumped it up to 15%.</p>

<p><img alt="northatlantic2.gif" src="http://www.numberpix.com/northatlantic2.gif" width="560" height="154" /></p>

<p>If you don&#8217;t need all the layers in an object, you can delete them, but sometimes you&#8217;ll just want to hide them in case they&#8217;re needed one day. These invisible layers are still included when you send something to the printer or export it as an EPS, which can have a big effect on file size. To keep an invisible layer available but not have it included when you print or export, uncheck the Print checkbox in the Layer Options dialog, which is in the Layers popup menu.</p>

<p><img alt="layeroptions.gif" src="http://www.numberpix.com/layeroptions.gif" width="560" height="95" /></p>

<p>Many of the shapes will have a line and a fill, and you almost never need both. Remove unnecessary lines; I set the lines of latitude and longitude to white, and eliminated the black box around the whole map. I selected the lakes and made their line and fill the same 15% Black as the ocean, then changed the rivers to make them blend in with the land (so they don&#8217;t show up, but don&#8217;t leave gaps). The countries now have a 60% stroke and fill; turning off the stroke left some gaps along state borders, so I just set it to the same color as the fill. I deleted the placenames, and changed the color of the coastlines to 60% Black again.</p>

<p><img alt="northatlantic3.gif" src="http://www.numberpix.com/northatlantic3.gif" width="557" height="230" /></p>

<p>It&#8217;s perfectly possible to simplify things further, of course. Feel free to eliminate lines and labels and fade back land and sea as required to make your point. To crop away parts of the map you don&#8217;t want, draw a box over the desired portion (this needs to be the topmost object), select everything, then choose Crop from the <img alt="cropmap.gif" src="http://www.numberpix.com/cropmap.gif" width="229" height="155" style="float: left; margin-right: 20px; margin-bottom: 0px; margin-top: 0px;"/> Pathfinder palette. This will trim away everything outside the boundaries of the topmost object. (Remember to save a backup version!) For the map below, I&#8217;ve added a colored shape with a Gaussian blur, and used the Brush tool with the Arrows brush library to draw the currents. Clicking on an arrow with the text tool turns it into a path for labels to march along.</p>

<p><img alt="northatlantic4.gif" src="http://www.numberpix.com/northatlantic4.gif" width="560" height="250" /></p>

<p>You can now use Save As to create an EPS for print publication, or Save For Web if you want a GIF or JPEG (the former is probably a better choice with maps) for PowerPoint or the Web. Remember, if you&#8217;re using color make sure it&#8217;s CMYK if the map&#8217;s for print, RGB if it&#8217;s for a computer screen (another reason why we save multiple versions&#8230;) There&#8217;s plenty more we could say about map editing, so feel free to mail me tips or queries, or comment below.</p>
]]></description>
         <link>http://www.numberpix.com/2007/08/maps_for_scientists_using_1.html</link>
         <guid>http://www.numberpix.com/2007/08/maps_for_scientists_using_1.html</guid>
         <category>Tips and Techniques</category>
         <pubDate>Tue, 21 Aug 2007 23:59:49 +1100</pubDate>
      </item>
            <item>
         <title>Maps for Scientists: Choosing</title>
         <description><![CDATA[<p>If you&#8217;re creating a map, you&#8217;re best to start with an illustration in vector, not bitmap, format (unless of course it&#8217;s an aerial photograph)—you want to be able to scale it, make global changes to simple color schemes, hide and show layers containing place names, distributions, and the like, and keep the file size relatively small and the resolution high. All this is what vector graphics are designed for. </p>

<p>Most vector maps commercially available will be EPS files, though PDF is becoming more common; Illustrator can read them both. There are any number of map art vendors on the web; <a href="http://www.mapresources.com/">Map Resources</a> and <a href="http://digital-vector-maps.com/">Digital Vector Maps</a> are typical. Expect to pay $50 or more for a single map, and several hundred dollars for a CD with a selection of them. It&#8217;s worthwhile for a whole department or lab to invest in a full set of countries of the world, plus globes in a variety of projections, but this is often too expensive for an individual. There are very few places where you can get inexpensive quality vector maps online; here are two.</p>

<p><strong>National Geographic</strong>&#8217;s website <a href="http://www.nationalgeographic.com/xpeditions/atlas/index.html">gives away</a> world and country map, with placenames or just plain vanilla, in PDF format. Illustrator can convert these and open them, though you probably don&#8217;t own the fancy fonts that National Geographic uses, so it will substitute Arial or somesuch. It&#8217;s not too difficult to select and edit the fonts, though. The maps are plainly marked as copyrighted, though modifying them for non-commercial purposes would seem to be allowed under Fair Use provisions. They&#8217;re being given away explicitly for educational use, after all.</p>

<p><strong>iStockPhoto</strong> has an enormous amount of artwork available very cheaply; a <a href="http://www.istockphoto.com/file_search.php?action=file&amp;majortermsList=&amp;majortermsConjunction=all&amp;advancedSearchState=&amp;browserSearchState=&amp;oldtext=&amp;orientation=7&amp;fileTypeSizePrice=%5B%7B%22type%22%3A%22Illustration+%5BVector%5D%22%2C%22size%22%3A%22Vector+Image%22%2C%22priceOption%22%3A%22All%22%7D%2C%7B%22type%22%3A%22Flash%22%2C%22size%22%3A%22Flash+Document%22%2C%22priceOption%22%3A%22None%22%7D%2C%7B%22type%22%3A%22Video%22%2C%22size%22%3A%22None%22%2C%22priceOption%22%3A1%7D%2C%7B%22type%22%3A%22Image%22%2C%22size%22%3A%22None%22%2C%22priceOption%22%3A%221%22%7D%5D&amp;minWidth=0&amp;minHeight=0&amp;showPeople=0&amp;printAvailable=0&amp;exclusiveArtists=0&amp;extendedLicense=&amp;illustrationLimit=Exactly&amp;flashLimit=Exactly&amp;perPage=20&amp;showTitle=&amp;showContributor=&amp;showFileNumber=1&amp;showDownload=&amp;enableLoupe=1&amp;order=Downloads&amp;9a4c2fe8af44c2603f7d31ea58a71f12_1_202740=cartography&amp;color=&amp;copySpace=%7B%22Tolerance%22%3A1%2C%22Matrix%22%3A%5B%5D%7D&amp;text=cartography&amp;within=4">basic cartography search</a> brings up a bunch of globes, and you can search more specifically for <em>Mollweide</em> and so on. Most cost around a buck. This is an excellent site for quickly finding quality images of any kind, especially generic advertising photography useful for illustrating brochures and blog postings; it&#8217;s worth bookmarking and buying a bunch of credits just in case you need a cheap photo in a hurry.</p>

<p><img alt="natgeo_istock.jpg" src="http://www.numberpix.com/natgeo_istock.jpg" width="559" height="231" /></p>

<p>The next big decision is choosing an appropriate map projection. The Earth is a sphere and paper is flat, so any representation of the planet has to distort something. Distances, shapes, or areas: what gets distorted depends on what you want to use the map for.</p>

<p>Regardless, rectangular world maps are generally bad. The traditional <strong>Mercator</strong> projection (which radically distorts sizes), and the <strong>Peters</strong> projection (which distorts shapes and distances) should both be avoided. The Mercator is designed for sailors plotting a compass bearing, but isn&#8217;t much good for anything else. Because it overemphasizes polar area, it has to be cropped to not look silly, and the Southern hemisphere usually comes off worse in the cropping because, well, there&#8217;s less stuff down there (it&#8217;s really not an insidious plot by European mapmakers). The Peters projection is a political rebuttal to the Mercator, but in keeping relative land area equal it sacrifices everything else. Peters didn&#8217;t invent  equal-area maps; there are many different one, going back to the 19th century, and the Peters version is by no means the best, though it&#8217;s certainly the most <a href="http://en.wikipedia.org/wiki/Peters_projection">controversial</a>; see the discussion in Monmonier (2004).</p>

<p>
<img alt="mercator_peters.gif" src="http://www.numberpix.com/mercator_peters.gif" width="560" height="235" />
</p>

<p>If you allow the poles to contract a little, and depart from a rectangle, you distort relative sizes far less. The <strong>Robinson</strong> projection was adopted by the National Geographic as a good general-purpose world map, and later replaced by the <strong>Winkel Tripel</strong> projection. For most of us, as long as you avoid rectangles, it doesn&#8217;t make much difference which projection you use (there are plenty more available, all different solutions to fitting a sphere on a page).</p>

<p><img alt="robinson_winkel.jpg" src="http://www.numberpix.com/robinson_winkel.jpg" width="560" height="187" /></p>

<p>Here&#8217;s another one. The <strong>Mollweide</strong> is one of a number of projections that preserves relative area, but in the process distorts shape. One way to reduce the distortion is to interrupt it at the oceans (assuming, of course, you&#8217;re not interested in oceans). These orange-peel <strong>interrupted</strong> maps come in many different variations, and should be used more widely. I&#8217;ve never seen one for marine biologists, carving up land masses so as to accurately portray the oceans, but it must exist.</p>

<p><img alt="Mollweide-interrupted.gif" src="http://www.numberpix.com/Mollweide-interrupted.gif" width="560" height="280" /></p>

<p>If instead of portraying the whole planet you only want to focus on one area, a <strong>planar</strong> projection might be better. The usual ones we see are the round maps of the North or South poles, but they can be centered on any part of the world; they&#8217;re most accurate in the middle and become progressively more distorted as you reach the edges. In the past I&#8217;ve used Antarctic planar projections to depict the breakup of Gondwana and the circumpolar distribution of the ratites. If you&#8217;re buying a set of maps, make sure it comes with a good variety of planar projections (usually just categorized as &#8220;globes&#8221;), including one centered on your area of interest.</p>

<p><img alt="globes.gif" src="http://www.numberpix.com/globes.gif" width="560" height="295" /></p>

<p>References: Map projections are of course well covered in <a href="http://en.wikipedia.org/wiki/Map_projection">Wikipedia</a> and a little more accessibly on <a href="http://kartoweb.itc.nl/geometrics/Map%20projections/body.htm">this page</a>. Also check out <em>Flattening the Earth: Two Thousand Years of Map Projections</em> by John P. Snyder (University of Chicago Press, 1997 new ed.) and <em>Rhumb Lines and Map Wars: a Social History of the Mercator Projection</em> by Mark Monmonier (University of Chicago Press, 2004).</p>
]]></description>
         <link>http://www.numberpix.com/2007/08/maps_for_scientists_choosing_1.html</link>
         <guid>http://www.numberpix.com/2007/08/maps_for_scientists_choosing_1.html</guid>
         <category>Tips and Techniques</category>
         <pubDate>Tue, 14 Aug 2007 01:47:50 +1100</pubDate>
      </item>
            <item>
         <title>How to Silhouette</title>
         <description><![CDATA[<p>If you&#8217;re a biologist, having an nice representative outline of your study organism is very handy, especially if it can be scaled to any size without losing sharpness, while remaining svelte enough to attach to an email. If you know your graphic formats, I&#8217;m talking about Encapsulated PostScript, or EPS, the vector graphics format that Illustrator and other drawing programs use. Unfortunately, most of us only have photos or sketches of our organisms, in a bitmapped format like TIFF or JPEG. How do we go from one to the other? In <a href="http://www.numberpix.com/2005/12/march_of_the_ratites.html">March of the Ratites</a>, I briefly discussed how to make an EPS silhouette, but here&#8217;s the step by step version. </p>

<p><img alt="pentool.gif" src="http://www.numberpix.com/pentool.gif" width="26" height="23" style="float: left; margin-right: 20px; margin-bottom: 0px; margin-top: 5px;"/>You&#8217;ll start in Illustrator, Freehand, or any other drawing program that has a pen tool. If you&#8217;re new to Illustrator, learning how to use the pen tool to create and edit paths is very strongly recommended: there are pen tool tutorials <a href="http://www.sketchpad.net/drawing8.htm">here</a>, <a href="http://dx-xel.blogspot.com/2007/07/illustrator-tutorial-how-to-use-pen.html">here</a>, 
<a href="http://veerle.duoh.com/index.php/blog/comments/illustration_from_sketch_to_finish_part_one/">here</a>, and <a href="http://www.tutorialbuzz.com/2007/07/using-pen-tool-in-illustrator.html">here</a>. <a href="http://www.creativetechs.com/tips/tip_resources/AdobePenTool-Cheatsheet.pdf">Here&#8217;s</a> a cheat sheet.</p>

<p>Start with a photo. Google&#8217;s <a href="http://images.google.com">Image search</a> is good; large images are best. You&#8217;re looking for a view from the side with good strong edges. If you&#8217;re working from a fuzzy sketch or indistinct picture, print it large, color it in with a magic marker, and scan it back in. It&#8217;s also possible to find a few images and composite them together in Photoshop, as I&#8217;ve done below. (I was just at the University of Queensland, where these big ugly megapodes, called <a href="http://en.wikipedia.org/wiki/Australian_brush-turkey">Brush Turkeys</a> (<em>Alectura lathami</em>) steal food from cafeteria tables.)</p>

<p><img alt="brushturkey.jpg" src="http://www.numberpix.com/brushturkey.jpg" width="560" height="347" /></p>

<p><img alt="layer.gif" src="http://www.numberpix.com/layer.gif" width="257" height="120" height="23" style="float: left; margin-right: 20px; margin-bottom: 5px; margin-top: 0px;"/>
File > Place the photo or scan in an Illustrator document. You may want to double-click the layer, lock it, and set its opacity to 50% if you&#8217;re planning to trace round it. If you choose the Template option when placing it, Illustrator will do exactly that for you. Create a new layer to draw on.</p>

<p><img alt="strokefill.gif" src="http://www.numberpix.com/strokefill.gif" width="57" height="52" style="float: right; margin-left: 10px; margin-bottom: 0px; margin-top: 5px;"/>
Choose the Pen tool, set the stroke to solid and the fill to none, and start clicking your way around the outline. Press down to drag out Bézier handles when you want a curve; you can adjust the exact curvature later. Most of your points will be curves; for the few acute or obtuse angles you can just click to make a corner point. If you make a mistake, choose Undo. When you get all the way round, click on your starting point to close the curve. It took me about five minutes to work my way around the Brush Turkey, and that was using a laptop trackpad. A mouse is far easier, and a graphics tablet better still&#8212;even a small graphics tablet, like the Wacom Graphire (look for a used older model) is a worthwhile investment if you plan on making a lot of information graphics.</p>

<p align="center">
<img alt="addingpoints.jpg" src="http://www.numberpix.com/addingpoints.jpg" width="400" height="199" /></p>

<p>Once you&#8217;ve finished the outline, switch to the Direct Selection tool, zoom in, and start adjusting the points and curves. Reposition the points, and lengthen/shorten/reposition the Bézier handles until the outline matches the photo. You don&#8217;t have to get it perfect, as the silhouette will usually be printed quite small. Use the cheat sheet or tutorials above to learn how to add and remove points and change them from curve to corner.</p>

<p><img alt="editingpoints.jpg" src="http://www.numberpix.com/editingpoints.jpg" width="560" height="160"/></p>

<p>Hide the background layer, set the Fill to black, and check the overall look. You may find some weirdness where you&#8217;ve accidentally clicked too many times or dragged the wrong way; you can usually fix this by zooming in very close and removing unwanted points (by clicking on them with the Pen tool). </p>

<p>If you&#8217;re happy, save a just-in-case backup Illustrator file (with the .AI extension), delete the background layer, and use Save As to make an Illustrator EPS (probably best to make it CMYK with a black-and-white TIFF preview; if this is Greek to you, check out my <a href="http://www.giantflightlessbirds.com/workshops/scanning_and_graphics.pdf">handout</a> on graphics formats). You can still edit the EPS in Illustrator, but it&#8217;s a common file type that can be understood and imported by most page layout and image editing programs. Because it&#8217;s a vector graphic, you can also resize it without changing the resolution, and the file size is a bit smaller: mine is 368K, a third the size of the original photo. Once you&#8217;ve made a few silhouettes, you&#8217;ll find a myriad of uses for them, from labelling phylogenies to tagging graphs to sprucing up your grant proposals and letterheads. Enjoy.</p>

<p><p align="center"><img alt="brushturkeysilhouette.gif" src="http://www.numberpix.com/brushturkeysilhouette.gif" width="318" height="199"  /></p.</p>

<p>Many thanks to David Booth and Yvonne Eiby for chatting with me about Brush Turkeys, and to the University of Queensland&#8217;s Department of Integrative Biology (particularly Cynthia Riginos and Lyn Cook) for sponsoring my visit.</p>
]]></description>
         <link>http://www.numberpix.com/2007/08/how_to_silhouette.html</link>
         <guid>http://www.numberpix.com/2007/08/how_to_silhouette.html</guid>
         <category>Tips and Techniques</category>
         <pubDate>Sun, 05 Aug 2007 05:06:17 +1100</pubDate>
      </item>
            <item>
         <title>Beautiful Evidence</title>
         <description><![CDATA[<p><a href="http://www.amazon.com/Beautiful-Evidence-Edward-R-Tufte/dp/0961392177/ref=pd_bbs_sr_1/002-2754518-6016001?ie=UTF8&amp;s=books&amp;qid=1173054240&amp;sr=8-1"><img alt="be.jpg" src="http://www.numberpix.com/be.jpg" width="51" height="60" class="cover"/></a>
<span class="title">Beautiful Evidence</span> <br />
  <span class="author">Edward R. Tufte</span> <br />
  <span class="published">Graphics Press, 2006 • ISBN: 0961392177</span></p>

<p>Of all Tufte’s books, I felt this was most like a curate’s egg. Good in parts, but quite a bit of stuff we’ve heard before, and some less-palatable portions.</p>

<p>It&#8217;s a bit of a grab-bag. One chapter, rather grandiosely titled <em>Fundamental Principles of Analytical Design</em>, is yet another discussion of the Minard graphic showing the retreat from Moscow. The fundamental principles, if you’re interested, are: use multivariate data, documented, integrated with explanation and text, showing causality and explanatory comparisons. There’s a chapter famously and largely justifiably condemning PowerPoint, already available for years as a separate <a href="http://www.edwardtufte.com/tufte/books_pp
">pamphlet</a>. There’s a nice explanation of sparklines, <img alt="sparkline.jpg" src="http://www.numberpix.com/sparkline.jpg" width="68" height="13" /> tiny trend lines that can be integrated into text, also posted <a href="http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0001OR&amp;topic_id=1">online</a>. There are photographs of Tufte’s sculptures, for some reason. And there’s a discussion of figure/text integration in the <em>Hypnerotomachia Poliphili</em>. Now, I confess to being a big typography nerd, so will happily read all about the <em>Hypnerotomachia</em>. But I wonder what most of the scientists I know with  Tufte tucked dutifully on their shelves will make of it.</p>

<p>What I found most bothersome was the discussion of cladograms, (evolutionary trees inferred from attributes of living organisms). Tufte initially <a href="http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=00018e&amp;topic_id=1">posted</a> a much more critical analysis of a fairly typical cladogram, with jabs at the use of well-understood technical terms like “strict consensus” (Tufte wrote, “Sounds like <img alt="cladogramette.jpg" src="http://www.numberpix.com/cladogramette.jpg" width="113" height="86" style="float: right; margin-left: 20px; margin-bottom: 5px; margin-top: 5px;"/>
marketing, not science.”) There was a subsequent pile-on by some very reputable systematists, and Tufte backpedaled, claiming he was just a curious interloper in their field. Nevertheless, the watered-down version of the critique that made it into <em>Beautiful Evidence</em> still accuses biologists of using feel-good pitch words with “broad cheerleading meanings” to cloak their dodgy “editorial judgements” and questionable “statistical crunching”. Yes, cladograms are information-poor, and convey undue certainty to those who aren’t familiar with them, so Tufte’s skepticism is perhaps excusable, if not really justified. Skepticism vanishes a few pages later when lauding Feynman diagrams, which to me seem to contain far more dodgy handwaving than a good cladogram.</p>

<p>If you already have the first three books, this is by no means necessary. There seems to be a trend towards increasing length, cost, use of color, and hodgepodgery in Tufte’s books; perhaps the fifth in the series will buck this trend when it comes out it about (I predict) 2015.</p>
]]></description>
         <link>http://www.numberpix.com/2007/05/beautiful_evidence.html</link>
         <guid>http://www.numberpix.com/2007/05/beautiful_evidence.html</guid>
         <category>Reading List</category>
         <pubDate>Thu, 10 May 2007 17:25:27 +1100</pubDate>
      </item>
            <item>
         <title>Mike’s Tip List</title>
         <description><![CDATA[<p>I was asked to come up with a list of data presentation tips for scientists; here&#8217;s the result. Distilling one&#8217;s philosophy down to bullet points is an exercise I recommend. Not much of the following will be news to regular readers of this site, I suspect, but feel free to pass it on. Or draw up your own, and send me the link.</p>

<p><img alt="bigmike.jpg" src="http://www.numberpix.com/bigmike.jpg" width="560" height="206" /></p>

<ul>
<li>Sketch out ideas on paper first, before you turn on the computer. All graphics used to be drawn by hand. Software reduces creativity; good graphics are created despite your software.</li>
<li>People will look at your pictures before they read your text, if they read it at all. Graphics have to be self-contained. Put your conclusion right there in the caption.</li>
<li>The graphic has to tell a story (if it doesn&#8217;t, don&#8217;t use it) and your job is to keep redesigning it until the story is as clear as possible.</li>
<li>Show the actual data, as much as you can. People can deal with much greater information density than you think. Your job is to help them see the patterns in the data, but&#8230;</li>
<li>Show as little non-data stuff as you can. Remove boxes, lines, colored backgrounds, grids, shadows, and other decoration, except where it&#8217;s essential to understanding the data. If you can&#8217;t remove it, fade it out or make it smaller, thinner, or dotted.</li>
<li>Minimize the number of steps required to interpret your graphic. Don&#8217;t put required information in the text that could go in the caption, or in the caption if it could go in a key, or in a key if you could just label the points or lines directly.</li>
<li>Avoid color; it disappears on photocopying or printing. Use contrasting thicknesses, tints, line styles or shapes first, then color. Your graphic must work in black and white.</li>
<li>If you use color, use an intuitive scale that relates sensibly to your data, not all the colors of the rainbow. Make sure colors vary in intensity, not just hue, and remember some of your readers will be color-blind.</li>
<li>Provide context. Always use a scale and give sources. Six small, related graphs juxtaposed in the space we&#8217;d usually use for just one provide more than six times as much content.</li>
<li>Learn some basic typography, Illustrator, and Photoshop. It&#8217;s not hard to find tutorials, and they&#8217;re wonderful transferable skills.</li>
<li>Never print out your slides. Give people a handout with your contact details, a couple of graphics or tables (including any too detailed for a PowerPoint slide), your conclusions, and a bibliography.</li>
<li>Don&#8217;t make lists of bullet points, like this one. Show, not tell.</li>
</ul>

<p>Manifestos are great for removing nuance and blurring away contention, aren&#8217;t they? That&#8217;s why they&#8217;re so satisfying to write.</p>
]]></description>
         <link>http://www.numberpix.com/2007/02/mikes_tip_list.html</link>
         <guid>http://www.numberpix.com/2007/02/mikes_tip_list.html</guid>
         <category>Tips and Techniques</category>
         <pubDate>Wed, 28 Feb 2007 18:50:53 +1100</pubDate>
      </item>
            <item>
         <title>Deceptive Areas</title>
         <description><![CDATA[<p><img alt="areas_circle.gif" src="http://www.numberpix.com/areas_circle.gif" width="213" height="363" style="float: left; margin-right: 20px; margin-bottom: 5px; margin-top: 5px;"/>People are poor at accurately judging areas; they do much better comparing linear measures like the lengths of a bar or the heights of a point. Areas can be useful where precision&#8217;s not important—circles can be scattered over a map, for example, to allow readers to scan for trends. But too often designers indicate data with areas because shapes are cooler than lines and you can arrange them in pretty patterns. </p>

<p>Regardless of the shape chosen, because we have a hard time judging areas, it&#8217;s vitally important that sizes are calculated accurately: namely, proportional to the value they represent. Otherwise the designer is telling lies.</p>

<p>From the Sunday <em>New York Times</em> magazine for February 25, here&#8217;s another mess from Catalogtree. There are plenty of poor choices made here&#8212;note the particularly ugly way that circles have been doodled on top of the text, making the chart look like a printing error. Leave aside also the fact that half the wording consists of dull qualifiers that could have been easily turned into footnotes, and that the designer could think of no better way to arrange the text than just dumping it on the page in a block, as if they didn&#8217;t actually want anyone to read it. These pale into insignificance beside the circles being the wrong size.</p>

<p><img alt="areas_circle_lie.gif" src="http://www.numberpix.com/areas_circle_lie.gif" width="78" height="149"  style="float: right; margin-left: 20px; margin-bottom: 5px; margin-top: 5px;"/>Note the largest value (892) and the fourth largest (436). One is just over twice the size of the other, and a circle twice the size of another should have a diameter &#8730;2 as big: about 1.4 times as wide. The larger circle in the graphic is actually about twice as wide, and it&#8217;s about four times as wide as the 204–225 circles. To see this amount of distortion this creates, compare the original proportions of the two largest circles, (right, top), with the corrected ones (right, below). I bet the designer just halved or doubled the circle diameters rather than actually calculated the areas required, which is pretty inexcusable.</p>

<p>To see the the way it should be done, check out the circles from an earlier New York Times article reporting frequency of the word &#8220;Iraq&#8221; in different presidential State of the Nation addresses. The first circle is twice the area of the second, not twice the width. Note the elegant overlapping too.</p>

<p><img alt="areas_iraq.jpg" src="http://www.numberpix.com/areas_iraq.jpg" width="114" height="297" style="float: right; margin-left: 20px; margin-bottom: 5px; margin-top: 5px;"/></p>

<p>It&#8217;s understandable that some designers might mess up circular areas, as it takes a little algebra to work back from the desired area to calculate the right diameter to use; namely, 2&#8730;(area/&pi;). But even simple squares can defeat a designer&#8217;s math abilities. Below is another example from a much earlier <em>NYT</em> magazine&#8212;the design firm 5W infographic, discussed in a <a href="http://www.numberpix.com/2006/03/stacking_world_conflict.html">previous posting</a>, is no longer used by the <em>Times</em>.</p>

<p><img alt="areas_immigrant.jpg" src="http://www.numberpix.com/areas_immigrant.jpg" width="234" height="450" style="float: left; margin-right: 20px; margin-bottom: 5px; margin-top: 0px;"/></p>

<p>How do we know if there might be a problem with an information graphic? One clue is weird instructions on how one should interpret it; a good data graphic doesn&#8217;t need instructions in how to read it. Note, for example, a key that points out that area only 25% of the original somehow equals 50% of the value. Again, this looks like a math-phobic designer at work. Illustrator&#8217;s Info palette works in widths and heights, not areas, so it&#8217;s easy for a designer to drag to draw a square where one pixel of linear distance, not area, equals 1% in value. Actually calculating the correct widths, from &#8730;area, was obviously too difficult.</p>

<p><img alt="areas_immigrant_key.gif" src="http://www.numberpix.com/areas_immigrant_key.gif" width="57" height="64" style="float: right; margin-left: 20px; margin-bottom: 5px; margin-top: 5px;"/></p>

<p>The result is a graphic that makes large values too large and small ones too small. In this case, the designer chose to use only some of the data in the survey, mostly the pro-US results, so the graphic is not only inaccurate but biased.</p>

<p>It&#8217;s really not even necessary to use a data graphic at all when there are just a few data points, if you arrange them carefully. I tracked down the original report and summarized the questions in tabular form, arranging the columns in a different and more intuitive order. The margin of error was such that I wasn&#8217;t happy pulling out trends with anything more than some boldface type; in fact, that 41 probably shouldn&#8217;t be highlighted at all, since it&#8217;s not really different from the 37. And I had room to add a couple of extra questions. The whole thing could I&#8217;m sure be squeezed into the area the original graphic occupied, but I&#8217;m getting tired of fixing the work of highly-paid, supposedly-professional designers, so will spend the next couple of posts looking at something else and give Catalogtree a break.</p>

<p><img alt="areas_table.gif" src="http://www.numberpix.com/areas_table.gif" width="386" height="337" /></p>
]]></description>
         <link>http://www.numberpix.com/2007/02/deceptive_areas.html</link>
         <guid>http://www.numberpix.com/2007/02/deceptive_areas.html</guid>
         <category>Makeovers</category>
         <pubDate>Sun, 25 Feb 2007 08:25:43 +1100</pubDate>
      </item>
            <item>
         <title>Wacky Wheel of Wedges</title>
         <description><![CDATA[<p><img alt="preacher.gif" src="http://www.numberpix.com/preacher.gif" width="253" height="441"  style="float: left; margin-right: 20px; margin-bottom: 5px; margin-top: 0px;"/>
Not particularly wanting to harsh on the same design company twice, but the New York Times Magazine included another screwed-up chart on Sunday, February 18th. In this one there are only nine actual data points, which could have been adequately shown with a plain bar chart, but that wouldn&#8217;t have looked cool enough, would it? So the designer decided to groove things up by repeating each very thin bar multiple times, and pulling the whole thing into a circle.</p>

<p>Well, it does look exciting and retro. One problem is there&#8217;s no room to label the bars directly, so we have to laboriously go back and forth to puzzle out which is which, distinguishing 9 from upside-down 6 (if your chart <em>requires</em> upside-down reading to interpret, you&#8217;re probably doing something wrong), making pairwise mental rotations of clumps of bars, and so forth. Quite a bit of work for nine data points; a simple table would be clearer.</p>

<p>But it gets worse. Again the designer helpfully annotated the bars with actual data. The first thing I noticed was that 92% looks rather more than four-and-a-bit times as high as 21%. So I traced over one member of each clump in Illustrator, and measured their lengths. (The easiest way to measure the lengths of angled lines in Illustrator is Window > Document Info > Objects. If I wasn&#8217;t intending to reconstruct the graph, I could have just used the Measure tool, of course.) <img alt="preacher2.gif" src="http://www.numberpix.com/preacher2.gif" width="192" height="260" style="float: left; margin-right: 10px; margin-bottom: 5px; margin-top: 10px;" />Sure enough, the bars weren&#8217;t even remotely to scale. I rotated them all to vertical, turning on the invisible grid to help, then typed the actual data into Excel and produced a quick bar chart, and juxtaposed the two (the Excel bars are flipped to make comparison easier).</p>

<p>We can see straight away that the shorter bars are disproportionately small. This has a pretty serious effect on the political slant of the chart, as it minimizes the amount of time the clergy seem to spend holding forth on immigration and stem-cell research, and overemphasizes their sermonizing on hunger and poverty. (I&#8217;d be interested in knowing if clergy really only talked about nine things, and if this is a subset what was left out and why&#8212;but another time). What&#8217;s caused this distortion? It&#8217;s not from stretching the 92% bar too much; there&#8217;s progressive distortion of all the other bars. <img alt="tinypreacher2.gif" src="http://www.numberpix.com/tinypreacher2.gif" width="47" height="86" style="float: right; margin-left: 20px; margin-bottom: 5px; margin-top: 5px;"/> After playing around with scaling&#8212;see the inset&#8212;it seemed more like an arbitrary fixed chunk has been lopped off each bar&#8230; Then I realized that the bars originally ran all the way to the <em>center</em> of the circle. That&#8217;s where we&#8217;re supposed to be mentally measuring to! Go check the original chart&#8212;was this obvious to you? Or were you fooled by the virtual baseline, and the numbers, into thinking the bars stopped there? (Me, I have enough trouble comparing ink I can see without factoring in imaginary ink that I can&#8217;t.) The designer felt they had to fill the middle with white so they could arrange their numerical labels there, and the numbers were only required because the chart&#8217;s groovy circularity left no room for anything better. So the path to the resulting mess seems clear.</p>

<p><img alt="preacher3.gif" src="http://www.numberpix.com/preacher3.gif" width="183" height="290" style="float: right; margin-left: 20px; margin-bottom: 5px; margin-top: 5px;"/>But there&#8217;s another problem. If there were just one bar for each value, we&#8217;d at least all agree we should be comparing their heights. But using multiple bars creates a sort of exploded pie chart, with a wedge for each datum. Pie charts, clunky as they are, are a type of chart most people recognize, and we&#8217;re used to comparing areas, even if we don&#8217;t do it very accurately. But look at the exaggeration caused by mistaking the wedges for pie slices. I traced over the largest and smallest wedges and compared their areas; the larger has what Tufte calls a Lie Factor of 6.9 (doesn&#8217;t that sound imposing?), meaning it&#8217;s nearly seven times as large as it should be.</p>

<p>(An aside for the geeky: there is in fact a way to measure the area of closed paths in Illustrator, though it seems so s00per-seekret that I&#8217;m reluctant to share it. Briefly, open the debug window (on a Mac, command-option-shift-F12), click on Objects and Object Tree to expand then, select your closed path and see it become bold in the object tree, click its bold underlined name, and lots of terrifying numbers will appear in the Objects section, including the area&#8212;in points, I think, but it scarcely matters.)</p>

<p>So a cascade of bad design choices has led to a needless distortion of the relevant data,  for those readers who weren&#8217;t so rebuffed by this wacky graphic that they just skipped it. At least the designer left the numbers on there, or we&#8217;d never even have known. The whole thing could have been avoided with a simple bar graph&#8212;not as sexy, but comprehensible, no larger, and (most importantly) not a big lie.</p>

<p><img alt="preacher_final.gif" src="http://www.numberpix.com/preacher_final.gif" width="250" height="298" /></p>
]]></description>
         <link>http://www.numberpix.com/2007/02/wacky_wheel_of_wedges.html</link>
         <guid>http://www.numberpix.com/2007/02/wacky_wheel_of_wedges.html</guid>
         <category>Makeovers</category>
         <pubDate>Thu, 22 Feb 2007 18:28:33 +1100</pubDate>
      </item>
            <item>
         <title>False Advertising</title>
         <description><![CDATA[<p><img alt="adbuyers.gif" src="http://www.numberpix.com/adbuyers.gif" width="257" height="449" style="float: left; margin-right: 20px; margin-bottom: 5px; margin-top: 5px;"/>Almost every weekend the New York Times Magazine accompanies their first main story with a relevant infographic. They tend to be commissioned from outside agencies, and sometimes lack the good design one sees in <a href="http://www.numberpix.com/2006/05/new_york_times_style.html">most NYT graphs</a>. I&#8217;ve written about bad examples before, on subjects like <a href="http://www.numberpix.com/2006/03/stacking_world_conflict.html">world conflicts</a> and the <a href="http://www.numberpix.com/2006/08/ditch_the_tables_nyt.html">threat posed by Iran</a>. One of the worst ones I&#8217;ve seen in a while appeared in the Sunday Magazine of February 11th.</p>

<p>What&#8217;s so bad about it? Well, there are some pedestrian faults, the sort of things we find in a lot of graphs. The lines are labeled indirectly, with a key, so one has to jump back and forth to interpret them. It&#8217;s possible the designer felt there wasn&#8217;t room to label the lines directly, yet there&#8217;s a lot of wasted horizontal space between each year&#8212;he or she could have easily fitted them in. (And what are those lines anyway? What&#8217;s the difference between &#8220;Automotive Factory and Dealer Associations&#8221; and plain old &#8220;Auto Dealerships&#8221;? Surely one of the designer&#8217;s jobs is to communicate, not just recite corporate spin that calls fast-food vendors &#8220;Quick-Service Restaurants&#8221;.) The colors are pretty, but don&#8217;t signify anything, serving only to distinguish the lines. And it&#8217;s strange that every point has its value labeled; why not just use a <em>y</em>-axis scale? After all, with a general newspaper readership surely it&#8217;s the trends and relative magnitudes that matter here, not the exact values.</p>

<p><img alt="adbuyers_notes.gif" src="http://www.numberpix.com/adbuyers_notes.gif" width="257" height="235" style="float: left; margin-right: 20px; margin-bottom: 0px;" />A good thing the designer did label the points, though, or we wouldn&#8217;t be able to see how misleading the graphic is. Absolute height doesn&#8217;t correspond to value, for example (see <strong>A</strong>). I&#8217;m guessing he or she did this to stop lines 2 and 3 from crossing&#8212;they did cross in inconvenient old reality, but that messes up the pretty pattern. Note that line 1 should be about three times the height of line 2, but I suppose that would create an ugly gap.</p>

<p>Change in height doesn&#8217;t match change in value either. The two lines in <strong>B</strong> correspond to the two changes in line 5. Since both are 0.3 billion dollars, the lines should be the same height, and they obviously aren&#8217;t. Look at the magnitude of change in line 1 as well. It looks like every line is using a different scale, and the designer just made the ends all join up so it looked nice, like a subway map.</p>

<p>And what the heck&#8217;s going on with <strong>C</strong>? 1.2 should equal 1.2! Perhaps it&#8217;s 1.25, and the digit was left off so it would match the others, and incidentally make the graph absurd. Doesn&#8217;t anyone at the Times proofread these things? </p>

<p><img alt="my_adbuyers.gif" src="http://www.numberpix.com/my_adbuyers.gif" width="248" height="305" style="float: left; margin-right: 20px; margin-bottom: 0px;"/> To redo this, I first generated a basic chart in Excel. I pasted this into a background layer in Illustrator, locked it, and just traced over all the components in a new layer (the chart is so simple it&#8217;s hardly worth ungrouping and deleting all the junk that Excel puts in its graphs). I came up with category names that were a bit more meaningful, and created a <em>y</em>-axis, which really only needs to be anchored by a few values.</p>

<p>One thing very obvious now is the dominance of auto advertising. I&#8217;m sure I&#8217;ve oversimplified the two auto categories; I&#8217;d want to see to what extent they overlap or could be lumped. Another thing to note is how wildly the original graph overemphasized changes; increases and decreases now look much more modest (although I wish we had ten or twenty more years of data; it would easily fit in the same space.) The color-coding is still meaningless; perhaps rising and declining categories could be colored differently, or color could encode information, like predominantly-print vs. TV advertising. The main thing, though, is that the graph&#8217;s no longer telling lies. (And we got to keep the groovy rounded orange lines.)</p>
]]></description>
         <link>http://www.numberpix.com/2007/02/false_advertising_1.html</link>
         <guid>http://www.numberpix.com/2007/02/false_advertising_1.html</guid>
         <category>Makeovers</category>
         <pubDate>Sun, 11 Feb 2007 16:21:19 +1100</pubDate>
      </item>
            <item>
         <title>The Scientist’s Rainbow</title>
         <description><![CDATA[<p>How many colors do you need? Color costs money to print, and disappears when you laser print, fax, or photocopy a graphic, and some people (like me) have trouble seeing it. This is why I use grayscale graphics wherever possible, saving color for emphasis (I&#8217;ll say more about this in a future post).</p>

<p>The opposite of this approach is a graphic that uses <em>every color in the visible spectrum</em> for no apparent reason. One finds this so often in computer-generated charts in scientific publications that I call it <strong>The Scientist’s Rainbow</strong>. Once identified, you&#8217;ll see it everywhere. Here&#8217;s a good example.</p>

<p><img alt="rainbow1.jpg" src="http://www.numberpix.com/rainbow1.jpg" width="560" height="582" /></p>

<p>The original had no key to the colors, and they seem to correspond only to the scale on the <em>y</em>-axis. One way of testing whether color is conveying any information is to convert the graphic to grayscale.</p>

<p><img alt="rainbow1b.jpg" src="http://www.numberpix.com/rainbow1b.jpg" width="560" height="582" /></p>

<p>So this color was just for decoration&#8212;yes, rainbows are pretty. But sometimes the Scientist&#8217;s Rainbow is actually impeding communication. The visual spectrum isn&#8217;t arranged in an intuitive order, and if there&#8217;s a scattering of colors it can be pretty hard to extract values for the mess.</p>

<p><img alt="rainbow2.jpg" src="http://www.numberpix.com/rainbow2.jpg" width="560" height="460" /></p>

<p>We sould start by extracting the meaningful stuff from the empty space. I expanded the vertical scale until the pixels were squares, not rectangles, converted everything to grayscale, blurred away the sharp edges, and messed with the contrast a little. </p>

<p><img alt="rainbow2b.jpg" src="http://www.numberpix.com/rainbow2b.jpg" width="528" height="494" /></p>

<p>Note that the lightest areas are not the highest intensity, but correspond to yellow and green in the <em>middle</em> of the spectrum. The highest values on the colored graph are reds, which are as dark as the blues at the other end (so the original would be meaningless as soon as it was photocopied).  We&#8217;d be best to plot the raw data again using a black-to-white scale, although some clever color substitution in Photoshop could also convert a rainbow to shades of gray. That sort of work is best left to interns though, with their love of drudgery.</p>
]]></description>
         <link>http://www.numberpix.com/2007/01/the_scientists_rainbow.html</link>
         <guid>http://www.numberpix.com/2007/01/the_scientists_rainbow.html</guid>
         <category>Makeovers</category>
         <pubDate>Wed, 10 Jan 2007 19:29:28 +1100</pubDate>
      </item>
            <item>
         <title>Ditch the Tables, NYT!</title>
         <description><![CDATA[<p>Tables are often better than charts, especially with just a few data points. But sometimes a graph is what you need. Here&#8217;s an example from the Sunday <em>New York Times</em> magazine of August 6th. It shows American opinons on what to do about Iran.</p>

<p  align="center">
<img alt="nyt-table.gif" src="http://www.numberpix.com/nyt-table.gif" width="250" height="312" />
</p>

<p>Quick, what&#8217;s the message of the table? That most people think we should use diplomacy against Iran? If so, then why show three surveys? There must be some kind of trend here. Wait, why is the timeline going from right to left? OK, there seems to have been a dip in support for military action back in May, or was that a rise in the number of people who don&#8217;t think Iran is a threat? But there&#8217;s a 4% margin of error, isn&#8217;t there? Does that matter?</p>

<p>Let&#8217;s try a graphical depiction. To reflect the 4% margin of error (presumably a 95% confidence interval 4 percentile points either side of the reported figure) I used an 8 pt line, where 1 pt = 1 pixel = 1 per cent when I originally constructed the graph in Illustrator. It would have been better to apply a linear gradient to each line that fades from the midpoint to the edges, but Illustrator can&#8217;t do that very easily.</p>

<p><img alt="nyttablelines.gif" src="http://www.numberpix.com/nyttablelines.gif" width="513" height="173" /></p>

<p>I went into the <a href="http://www.pollingreport.com/iran.htm">public records</a> and added the poll results for February, which are almost exactly the same as June. For some reason the <em>Times</em> left them out, though they had plenty of room. So is the dip in May significant? Did something happen then that the <em>Times</em> should be telling us about? Who knows?</p>

<p>But before we start over-analyzing the data, here are two different polls, both taken in February 2006. Note the difference in responses to essentially the same questions.</p>

<p><img alt="nyttablebars.gif" src="http://www.numberpix.com/nyttablebars.gif" width="474" height="175" /></p>

<p>So the biggest differences are caused by pollsters; any supposed &#8220;trend&#8221; in the <em>Times</em> data is no greater than the difference between two simultaneous polls with slightly different questions. If the media just put a few polls side by side, maybe we&#8217;d lose our ill-founded confidence in them.</p>

<hr />

<p>Why are we polling random Americans about what to do in Iran, anyway? Here&#8217;s the opinion poll <em>I&#8217;d</em> like to administer:  </p>

<ol>
<li>Should the United States take military action against Iran?  </li>
<li>Who&#8217;s the President of Iran, anyway? Starts with A. No googling!  </li>
<li>Name two cities in Iran, and two countries that border it.  </li>
<li>Out of France, China, Israel, and India, which has the most nuclear weapons, and which the fewest?</li>
</ol>

<p>Scoring note: No correct answers for questions 2, 3, or 4 scores an automatic &#8220;N/A&#8221; for question 1.</p>

<p>(Answers: <a href="http://en.wikipedia.org/wiki/President_of_iran">2</a>, <a href="http://www.mideastweb.org/Iran.gif">3</a>, <a href="http://en.wikipedia.org/wiki/List_of_countries_with_nuclear_weapons">4</a>)</p>
]]></description>
         <link>http://www.numberpix.com/2006/08/ditch_the_tables_nyt.html</link>
         <guid>http://www.numberpix.com/2006/08/ditch_the_tables_nyt.html</guid>
         <category>Makeovers</category>
         <pubDate>Mon, 28 Aug 2006 09:20:44 +1100</pubDate>
      </item>
            <item>
         <title>Beach Mouse Pelt Map</title>
         <description><![CDATA[<p>UC San Diego biologist Hopi Hoekstra and her co-authors found that the light-colored beach mice of Florida differed from their darker cousin by a single nucleotide in one gene—at least in Western Florida. The Eastern beach mice seem to have evolved their color some other way. She produced a very nice graphic mapping coat color and light/dark allele frequency (her colleague Bill Lynn did the mouse pelts, in Photoshop).</p>

<p><img alt="beachmouse.gif" src="http://www.numberpix.com/beachmouse.gif" width="550" height="300" /></p>

<p>Doesn&#8217;t this lay out their argument well? I couldn&#8217;t resist making a few changes, of course, because I&#8217;m fussy: </p>

<ul>
<li>Fading back the coastline and pointers so the data stood out</li>
<li>Choosing different and related colors for the mouse ranges</li>
<li>Making the color of the pointers match the ranges</li>
<li>Changing the circle fills to a dark brown</li>
<li>Extending the coastline and range into adjacent states, and labelling the states</li>
<li>Making the state border a little different from the coastline</li>
<li>And, in a bit of typographic pickiness, raising the baseline of all the &#8220;=&#8221; by half a point and putting a thin space either side. </li>
</ul>

<p>(I notice it all looks very Tufte now, with his patented Tufte beige, but that wasn&#8217;t the intention.) If one were to do a serious redesign, my first suggestion would be to move the Oldfield mouse up into Georgia, so that it&#8217;s physically separated from the beach mice, and on land while they&#8217;re in the ocean. Adding a key to the two allele colors would be nice, and would pretty much remove the need for an explanatory caption. But I think the graphic works fine as is.</p>

<p><img alt="beachmousenew.gif" src="http://www.numberpix.com/beachmousenew.gif" width="550" height="313" /></p>

<p>Reference: There&#8217;s a nice <a href="http://www-biology.ucsd.edu/news/article_071806.html">popular article</a> on the findings, and the original paper is: <br />
Hopi E. Hoekstra, Rachel J. Hirschmann, Richard A. Bundey, Paul A. Insel, Janet P. Crossland. 2006. A Single Amino Acid Mutation Contributes to Adaptive Beach Mouse Color Pattern. <em>Science,</em> 313(5783): 101–104. 7 July 2006, DOI: 10.1126/science.1126121. (<a href="http://www.sciencemag.org/cgi/reprint/313/5783/101.pdf">PDF</a>)</p>
]]></description>
         <link>http://www.numberpix.com/2006/08/beach_mouse_pelt_map.html</link>
         <guid>http://www.numberpix.com/2006/08/beach_mouse_pelt_map.html</guid>
         <category>Makeovers</category>
         <pubDate>Mon, 21 Aug 2006 22:11:21 +1100</pubDate>
      </item>
            <item>
         <title>Adding Variables</title>
         <description><![CDATA[<p>Two surveys recently looked at public acceptance of evolution in a range of countries. This one arranged the data in order of decreasing acceptance.</p>

<p><img alt="evolution1.gif" src="http://www.numberpix.com/evolution1.gif" width="421" height="661" /></p>

<p>Note the pretty good color choice: red for rejection, something wishy-washy for &#8220;don&#8217;t know.&#8221; I would have gone with a dark green for acceptance, if only because the red/blue contrast is so over-used. The white spaces between the bars are a little obtrusive. Here&#8217;s what the <em>New York Times</em> did with it in the August 15th issue. They dropped the numbers, and left the &#8220;don&#8217;t know&#8221; category the same color as the white space between bars, which breaks up those obtrusive horizontal lines. Fading back the colors also helped.</p>

<p><img alt="nyt_evn_cropped.png" src="http://www.numberpix.com/nyt_evn_cropped.png" width="298" height="139" /> </p>

<p>How would the graph have been different if the bars had been sorted by <em>increasing rejection</em> of evolution? Greece would bump lower, Japan higher, and there&#8217;d be some shuffling in the middle, but the key point I think the authors wanted to make&#8212;that the USA is way down with Turkey&#8212;would have been preserved. But perusing the list a little raises all sorts of questions. Why are the Czechs and Slovaks more likely to reject evolution than the Bulgarians? Why are the Finns twice as likely as the Swedes and Danes? Poland and Ireland are two of the most religious countries in Europe, but they&#8217;re near the middle of this list. The graphic raises more questions than it answers. Part of the problem is that it&#8217;s only depicting three variables (not four; one category is just the remainder of the other two, as the <em>Times</em> recognized). And two of those are not really independent, having a roughly inverse relationship. So we don&#8217;t have enough information to do any analysis ourselves. </p>

<p>Here&#8217;s another presentation of similar data (I&#8217;ve added a key). </p>

<p><img alt="evolution2.gif" src="http://www.numberpix.com/evolution2.gif" width="500" height="436" /></p>

<p>Now, this is not a slick graphic. I would certainly have played with the spacing, font size, alignment, ALL CAPS labels, and colors. But by choosing a scatter plot instead of a bar chart, and making the country name into the marker, seven variables are being depicted in the same space the previous graph could only manage two-and-a-bit. Moreover, we&#8217;re now equipped to evaluate trends ourselves, look at outliers (the churchgoing Irish, the oddly evolution-rejecting Dutch), and examine our preconceptions (there may be very few atheists in the USA, but that alone doesn&#8217;t account for the lack of acceptance of evolution). And note that the simple take-home message of the first graph is not being sacrificed, either: the USA stands out each time.</p>

<p>(The <a href="http://moses.creighton.edu/JRS/2005/2005-11.html">site itself</a> has another 8 sets of charts, each with a different variable on the y-axis, and you can click through to see them appear one at a time in the same space, for easier comparison.)</p>

<p>We sometimes forget just how much information can be depicted in a data graphic. In a <a href="http://www.numberpix.com/2006/06/the_graphical_octants.html">previous post</a> I noted that exploratory and educational graphics are the extremes of a single axis. But surely a worthy goal is to do both: make your point, and include enough information for the reader to do their own hypothesis testing. Adding some variables is a good way to start.</p>

<p>References: <br />
Jon D. Miller, Eugenie Scott, and Shinji Okomoto. Public Acceptance of Evolution. <em>Science</em>, August 11. 765–766. <a href="http://www.sciencemag.org/cgi/reprint/313/5788/765.pdf">PDF online.</a> <br />
Paul, Gregory S. 2005. Cross-National Correlations of Quantifiable Societal Health with Popular Religiosity and Secularism in the Prosperous Democracies. <em>Journal of Religion and Society</em>, vol. 7. <a href="http://moses.creighton.edu/JRS/2005/2005-11.html">Online</a>.</p>
]]></description>
         <link>http://www.numberpix.com/2006/08/adding_variables.html</link>
         <guid>http://www.numberpix.com/2006/08/adding_variables.html</guid>
         <category>Nice Work</category>
         <pubDate>Mon, 14 Aug 2006 13:34:29 +1100</pubDate>
      </item>
            <item>
         <title>Charting HTML</title>
         <description><![CDATA[<p>The artist/coder Sala, responsible for the <a href="http://www.onethousandpaintings.com/home/">1000 numbers</a> project, has created an applet that turns an web page into a <a href="http://www.aharef.info/2006/05/websites_as_graphs.htm">color-coded diagram</a> that depicts the HTML hierarchy. Here&#8217;s the main page of <em>Pictures of Numbers</em> when it was displaying the Tufte Library post.</p>

<p><img alt="siteasgraph.gif" src="http://www.giantflightlessbirds.com/numberpix/siteasgraph.gif" width="560" height="369" /></p>

<p>Here&#8217;s Sala&#8217;s key to the tags:</p>

<p><img alt="siteasgraphkey.gif" src="http://www.giantflightlessbirds.com/numberpix/siteasgraphkey.gif" width="434" height="153" /></p>

<p>It&#8217;s easy enough to spot out the three book reviews and their covers, the lists of links down the side, and the flurry of forms, text, and image in the colophon.</p>

<p><img alt="siteasgraphnotes.gif" src="http://www.giantflightlessbirds.com/numberpix/siteasgraphnotes.gif" width="560" height="378" /></p>

<p>I know what I&#8217;d like: a label beside each node giving the tag class or ID. That would make untangling endless nested DIV tags so much easier. Why doesn&#8217;t Dreamweaver have a graphical interface to its code?</p>

<p>Some interesting patterns can be seen on Sala&#8217;s site when large sites are compared, and design choices (such as tables vs divs) are revealed&#8212;the results reminded me of the haplotype diagrams my officemate generates as part of her research. If you&#8217;re interested in other graphical depictions of networks, check out the <a href="http://www.visualcomplexity.com/vc/">Visual Complexity</a> site.</p>
]]></description>
         <link>http://www.numberpix.com/2006/08/charting_html.html</link>
         <guid>http://www.numberpix.com/2006/08/charting_html.html</guid>
         <category>Nice Work</category>
         <pubDate>Mon, 07 Aug 2006 13:42:13 +1100</pubDate>
      </item>
            <item>
         <title>A Tufte Library</title>
         <description><![CDATA[<p>To me, the most important books on presenting data graphics are by Edward Tufte. People who create charts as part of their job should keep one or more of them close by, and regularly reread them.</p>

<p><a href="http://www.amazon.com/gp/product/0961392142/qid=1149550726/sr=1-1/ref=sr_1_1/002-2754518-6016001?s=books&amp;v=glance&amp;n=283155"><img alt="vdqi.jpg" src="http://www.giantflightlessbirds.com/numberpix/vdqi.jpg" width="49" height="60" class="cover"/></a>
<span class="title">The Visual Display of Quantitative Information</span>, 2nd ed. <br />
  <span class="author">Edward R. Tufte</span> <br />
  <span class="published">Graphics Press, 2001 (orig. 1983) • ISBN: 0961392142</span></p>

<p>This is the most essential Tufte book, published over 20 years ago but so far ahead of its time it looks absolutely contemporary. (Compare it with some of the other data graphics books from the mid-1980s and you&#8217;ll see). Tufte here introduces his recurring themes: maximizing the data-ink ratio, stripping away unnecessary furniture and &#8220;chartjunk&#8221;, showing all the data, and graying out what&#8217;s less important. There&#8217;s elegant discussion of how readers perceive area changes, why we shouldn&#8217;t think of data as boring stuff that needs livening up, why pie charts suck, and when tables are better than graphs. Some of my favorite examples from here are Minard&#8217;s famous chart of the retreat from Moscow, the step-by-step erasure of needless ink from a bar chart, and the use of one of his own early graphs as an example of poor design. And the whole book is beautifully typeset and produced, with restrained use of color and plenty of white space.</p>

<p><a href="http://www.amazon.com/gp/product/0961392118/qid=1149550758/sr=1-2/ref=sr_1_2/002-2754518-6016001?s=books&amp;v=glance&amp;n=283155">
<img alt="ei.jpg" src="http://www.giantflightlessbirds.com/numberpix/ei.jpg" width="46" height="60" class="cover"/></a>
<span class="title">Envisioning Information </span> <br />
<span class="author">Edward R. Tufte</span> <br />
<span class="published">Graphics Press, 1990 • 0961392118</span> </p>

<p>Similar in subject and tone to the first book, though there&#8217;s more discussion of mapmaking. The most important concept discussed here is <em>small multiples</em>, the fruitful idea that six small graphs instead of one large one can add a new layer of information and reveal large patterns in the data. I&#8217;ve personally found this the most enlightening technique to share with newcomers to information design, and it&#8217;s often the best way to break out of the prison of an unsatisfactory design solution. Tufte also discusses how to use color effectively and with restraint, and how to successively reveal a process in a linked sequence of diagrams (such as dance notation or calligraphy). I particularly liked his coverage of tabular data, from train and bus timetables to the Vietnam War Memorial. Another fine book.</p>

<p><a href="http://www.amazon.com/gp/product/0201353946/ref=pd_sim_b_2/102-0225633-0954523?%5Fencoding=UTF8&amp;v=glance&amp;n=283155">
<img alt="ve.jpg" src="http://www.giantflightlessbirds.com/numberpix/ve.jpg" width="47" height="60" class="cover"/></a>
<span class="title">Visual Explanations</span> <br />
<span class="author">Edward R. Tufte</span> <br />
<span class="published">Graphics Press, 1997• 0961392126</span></p>

<p>This book wanders further out of strict chart-and-graph territory to cover depictions of processes, exemplified by the diagramming of magic tricks; if we understand how misdirection works, we can turn it around to draw the reader&#8217;s attention to what really matters. There are nice examples of the practical consequences of information graphics, discussing their role in the Challenger disaster and John Snow&#8217;s investigation of the London cholera epidemic. The most valuable concept to me was the notion of the smallest effective difference, with a useful discussion of just how little emphasis is needed to make a point clear in a graphic, and the power of using gray shades and faint lines. (There&#8217;s a fun genealogy of rock ’n’ roll too, illustrating parallelism as a graphical tool.) Lovely, but a less immediately practical book than the two preceding it.</p>

<p>Tufte&#8217;s newest, <em>Beautiful Evidence</em>, has just been published, and a review will appear here as soon as my copy arrives.</p>
]]></description>
         <link>http://www.numberpix.com/2006/07/a_tufte_library_1.html</link>
         <guid>http://www.numberpix.com/2006/07/a_tufte_library_1.html</guid>
         <category>Reading List</category>
         <pubDate>Sun, 30 Jul 2006 14:44:39 +1100</pubDate>
      </item>
            <item>
         <title>Better Axes</title>
         <description><![CDATA[<p>A good rule when making graphs is to remove needless impediments. Every extra act of interpretation we ask of the reader is a chance for them to misunderstand, be baffled, or get frustrated and move on. There should be as little standing between the reader and the data as possible. One level of interpretation all readers have to grapple with is the humble axis; here are some guidelines.</p>

<h3>Label directly</h3>

<p>If you can, put units right there on the axis, not on the axis label. In general, getting information out of the label and caption and putting it on the graph where people can see it a good idea. </p>

<p><img alt="axes1.gif" src="http://www.giantflightlessbirds.com/numberpix/axes1.gif" width="559" height="50"  style="margin-top: 10px; margin-bottom: 0px;"/></p>

<p>(By the way, that’s a real degrees sign above; all fonts have one. See <a href="http://www.giantflightlessbirds.com/numberpix/2006/01/the_robin_williams_library.html">Robin Williams</a> for tips on finding it and other special characters. Never try to fake it with a superscripted o!)</p>

<h3>Rotate for readability</h3>

<p><img alt="axes3.gif" src="http://www.giantflightlessbirds.com/numberpix/axes3.gif" width="287" height="171"  style="float: right; margin-left: 20px; margin-bottom: 10px;"/></p>

<p>All the graphing software I know generates vertical labels on the <em>y</em>-axis by default, but these are really almost unreadable. It’s a good idea to make them horizontal wherever possible, moving them to the top if there’s no space to the left. </p>

<p>I’m also a fan of getting axes out of log form. Real units are what we’re used to reading, and forcing people to calculate antilogs in their head increases the risk they’ll misread your numbers. We’re pretty terrible at comparing logarithmic values as it is, so it’s almost deceptive to hide them behind a linear axis.</p>

<p>When you expand units to make them comprehensible, they do take up more room. (For example, see below. But who doesn’t understand what Ma means, I hear you ask? Well, your mother, administrators, your congressman, journalists, and the voting taxpayers or undergraduates who pay your salary perhaps.) One solution is to rotate them a little: -30° in this case. It’s also possible to make the axis smarter; why not show geological periods, for example?</p>

<p><img alt="axes2.gif" src="http://www.giantflightlessbirds.com/numberpix/axes2.gif" width="558" height="54" style="margin-top: 5px; margin-bottom: 0px;"/></p>

<h3>Use sensible units</h3>

<p>If you’re using non-decimal units for some reason don’t use a decimal scale, even if it seems more “scientific”. Why make needless work for the reader? </p>

<p><img alt="axes4.gif" src="http://www.giantflightlessbirds.com/numberpix/axes4.gif" width="560" height="41"  style="margin-top: 10px; margin-bottom: 10px;"/></p>

<p>(The foot and inch marks used here, by the way, are the prime marks, not the typewriter quotes next to the semicolon key—see Robin Williams again.)</p>

<p>When we count days, we think in months and years, not base 10. If you turn the scale into a calendar, it no longer needs a silly axis label like “day of year”.  Of course, always identify months with a word or roman numeral, because 01/05/06 can mean Jan 5 or May 1. We’re all used to reading calendars, so a detailed scale is fine—note that a sufficiently-detailed one doesn’t need an axis line.</p>

<p><img alt="axes5.gif" src="http://www.giantflightlessbirds.com/numberpix/axes5.gif" width="557" height="47" style="margin-top: 10px; margin-bottom: 10px;"/></p>

<h3>You Don’t Have to Start with Zero</h3>

<p><img alt="axes6.gif" src="http://www.giantflightlessbirds.com/numberpix/axes6.gif" width="144" height="131" style="float: right; margin-left: 20px; margin-bottom: 10px;"/>
Honestly.  In some cases, it’s just meaningless, as there is no “value zero” to graph, such as with the days to the right. And having the <em>y</em>-axis pass through 1 means a data point might get tangled up with the tick marks on the axis, so there’s no reason not to leave a small gap.</p>

<p>There are also cases where beginning at zero would add pointless empty space to the graph; consider how little trend we’d be able to see if the graph on the right’s <em>y</em>-axis went from 0 to 110. So the answer is to eliminate empty space from the axis as much as possible without being actively deceptive.</p>

<p>William Cleveland, in <em>The Elements of Graphing Data</em>, often allows the scale to continue below zero, to “avoid interference” between the perpendicular axis and any zero-value data; he uses a dotted reference line to stand in for the dropped axis. Unfortunately, in the following graph this implies that ozone could exist in a concentration of less than zero parts per billion! The space below the reference line is in fact a misleading and uninhabitable no-man’s land.</p>

<p><img alt="axes7.gif" src="http://www.giantflightlessbirds.com/numberpix/axes7.gif" width="560" height="201" /></p>

<p>Leaving a gap so points don't hit the axis is OK, but extending the scale implies data values also continue. These days, with a better color palette, data on the axis no longer has to be a problem. Giving data points a thin white stroke allows them to intersect lines (and each other) while remaining visible; a little better than the jittered circles Cleveland was forced to use. (In the above makeover, you can see I also added units to the axes, rescued the labels from that horrible ALL-CAPS computer font, and condensed the empty space.)</p>

<p>Comments and suggestions welcomed, as always.</p>
]]></description>
         <link>http://www.numberpix.com/2006/06/better_axes.html</link>
         <guid>http://www.numberpix.com/2006/06/better_axes.html</guid>
         <category>Tips and Techniques</category>
         <pubDate>Mon, 26 Jun 2006 14:43:06 +1100</pubDate>
      </item>
            <item>
         <title>The Graphical Octants</title>
         <description><![CDATA[<p>At a recent all-day <a href="http://www.sigmaxi.org/programs/public/im2-1.shtml">workshop</a> devoted to information presentation, some bright sparks dreamed up three axes along which you can position any information graphic.</p>

<p align="center"><br />
<img alt="octant.gif" src="http://www.giantflightlessbirds.com/numberpix/octant.gif" width="400" height="337" /><br />&nbsp;</p>

<p>We normally just assume the <strong>novice</strong>–<strong>expert</strong> distinction is a function of how qualitative or educational a graphic is, so I strongly support putting it on an independent axis. Both novices and experts need <strong>exploratory</strong> graphics, that let them find patterns in the data, and <strong>educational</strong> ones, intended to make a point. These could equally be <strong>quantitative</strong>, where every data value could be read off, or <strong>qualitative</strong>, showing general patterns. </p>

<p>Three variables make a three-dimensional space divided into eight cubical sectors; technically, <em>octants</em> (though everyone at the meeting, shamefully, was calling them quadrants). </p>

<p>Which octant is your graphic in? Now what would the other seven versions of it look like? This is a good exercise, because it makes you realize there isn’t one “best” solution to a design problem, something we can forget if we read too many makeovers (including the ones on this site). It’s a good mental exercise, for you and your client, to try shifting graphics along just one axis, so you can put your finger on exactly which elements are doing which job.</p>

<p>Of course, heuristics like these octants are just mental tools, not the Revealed Truth. For example, are we sure these axes necessarily represent mutually exclusive goals? In the best graphics, it’s possible to display all your data quantitatively, yet still discern overall qualitative trends. Perhaps we could challenge ourselves to make graphics that are comprehensible to lay people (which includes almost <em>everybody</em>, even experts from disciplines just outside yours) while containing levels of richness for the informed; avoiding needlessly technical language and conventions would be a start. And a sufficiently well-crafted graphic would let people find more patterns than just the ones we want them to see, perhaps ones we hadn’t even noticed. How about as a goal for the year trying to <em>span multiple octants</em>? (Now that would make a befuddling bumper sticker.)</p>
]]></description>
         <link>http://www.numberpix.com/2006/06/the_graphical_octants.html</link>
         <guid>http://www.numberpix.com/2006/06/the_graphical_octants.html</guid>
         <category>Tips and Techniques</category>
         <pubDate>Sun, 04 Jun 2006 13:41:14 +1100</pubDate>
      </item>
            <item>
         <title>More on the Planets</title>
         <description><![CDATA[<p>One problem with depicting the solar system in an information graphic is that the enormous sizes and distances are hard to grasp. The usual solution is to use logarithms to compress things, but these can be hard to decipher, and sometimes it&#8217;s just better to show things to scale. Here&#8217;s an elegantly minimalist graph that shows the planetary diameters and distances (using two scales, though&#8212;otherwise you&#8217;d need a somewhat wider monitor), created by someone identified only as &#8220;Brian0918&#8221;.</p>

<p align="center"><img alt="albedobrian.gif" src="http://www.giantflightlessbirds.com/numberpix/albedobrian.gif" width="282" height="437" /></p>

<p>Practically every pixel is data, and even the <i>x</i> axis could be grayed out or dropped entirely, giving an unsurpassable data/ink ratio (which would also help Pluto show up).</p>

<p>The problem element here is the sun, which is so gigantic it swamps everything else. If we were to eliminate it, or just show a chunk, the graphic would become much more concise. I&#8217;ve done this below; note the two scales, implicit in the original but I think necessary here&#8212;one is five thousand times the size of the other.<br />&nbsp;</p>

<p><img alt="diamsemimajor.gif" src="http://www.giantflightlessbirds.com/numberpix/diamsemimajor.gif" width="560" height="151" /><br />&nbsp;</p>

<p>That version&#8217;s OK for print, but for the web one needs slightly more contrast and more solid typefaces.<br />&nbsp;</p>

<p><img alt="diamsemimajor2.gif" src="http://www.giantflightlessbirds.com/numberpix/diamsemimajor2.gif" width="560" height="168" /></p>

<p>The other solution for dealing with scale is breaking the graphic into chunks and recalibrating the scale by a sensible amount in each. <a href="http://www.rense.com/general72/size.htm">These</a> comparisons do this (note the common reference object in each successive picture), and use lovely 3D depictions as well, letting us go beyond the solar system to compare our sun with other stars.</p>

<p><img alt="planetmodels.jpg" src="http://www.giantflightlessbirds.com/numberpix/planetmodels.jpg" width="560" height="316" /></p>

<p>References: Wikipedia has <a href="http://en.wikipedia.org/wiki/Image:Solar_system%2C_equatorial_diameter_versus_semi-major_axis.png">Brian0918&#8217;s chart</a>: if anyone has more information on its origins, please leave a comment. The models were found via <a href="http://www.boingboing.net">BoingBoing</a> at <a href="http://www.rense.com">Rense.com,</a> which goes to show there are jewels in dross on the Web, but I don’t know the site they were taken from originally. And a practical activity that communicates the scale of the solar system to kids is the <a href="http://www.noao.edu/education/peppercorn/pcmain.html">Earth is a Peppercorn</a>.</p>
]]></description>
         <link>http://www.numberpix.com/2006/05/more_on_the_planets.html</link>
         <guid>http://www.numberpix.com/2006/05/more_on_the_planets.html</guid>
         <category>Makeovers</category>
         <pubDate>Wed, 17 May 2006 17:19:55 +1100</pubDate>
      </item>
            <item>
         <title>New York Times Style</title>
         <description><![CDATA[<p>The New York Times generally has excellent information graphics, and uses a distinctive house style. Here are three typical graphs, from <a href="http://www.nytimes.com/2006/05/07/washington/07ohio.html?hp&ex=1147060800&en=4f2d87863fbbaff9&ei=5094&partner=homepage.webloc">this article</a> in the May 7 edition.</p>

<p align="center"><img alt="nyt_margins.gif" src="http://www.giantflightlessbirds.com/numberpix/nyt_margins.gif" width="237" height="263" /><p>

<ul>
<li>Note the strong contrast between headings (in bold caps), graph title (upper and lower case) and series labels (caps).
<li>The same font is used throughout, the same size in axis labels as in graph titles. This means the labels are larger and the graphs smaller than Excel's defaults.
<li>Only two shades of black are used throughout, and no color or patterns.
<li>The shades and labels used here is repeated in the next two graphs (see below), consistently referring to the same series. The choice of shades intuitively corresponds to specific (Ohio) and general (U.S.).
<li>The bars here sensibly overlap instead of being offset as is usual for bar charts.
<li>Direct labelling of sample values is used instead of the little labelled swatches that Excel produces&#151;one less interpretative task for the reader. Note how the sample data is offset, so it can't be confused with real numbers.
<li>Explanations are right on the chart: what's a WIN is noted by the data, instead of in a caption or title.
</ul>

<p align="center"><img alt="nyt_barline.gif" src="http://www.giantflightlessbirds.com/numberpix/nyt_barline.gif" width="353" height="263" /></p>

<ul>
<li>All these graphs are as small as they reasonably can be, yet the bar and especially the line chart contain quite a bit of data.
<li>No frame around the charts, and no background colors or decoration.
<li>The bars have no stroke, just a fill.
<li>No <i>y</i> axis line.
<li>Lightly dotted <i>y</i> gridlines.
<li>Directly labelling one of the values on the <i>y</i> axis with the unit (in this case, percentage, but it could be dollars or kg or anything else) rather than using a rotated axis label (which is usually hard to read and a waste of space).
<li>Abbreviating years as '01 rather than needlessly writing them in full. 
<li>No tickmarks on the <i>x</i> axis; instead, dividers are used between years.
<li>The Ohio series is thinner where it has to be (overlapping bars), but the same width where it doesn't (overlapping lines). All differences should be for a reason; otherwise, keep things the same.
<li>Unobtrusive gaps are used for the U.S. series in the bar chart, just thick enough to tell the bars apart, but not so think the series can't be viewed as a single shape.
</ul>

<p align="center"><img alt="nyt_bars_arrows.gif" src="http://www.giantflightlessbirds.com/numberpix/nyt_bars_arrows.gif" width="196" height="181" /></p>

<p>And note some nice little touches: the % symbol in the <i>y</i> axis label extending into the chart a little, so the numbers align properly; the way the line for the label "OHIO" knocks out the gridline so they don't clash; and how one of the 2004 values is allowed to dip below the <i>x</i> axis.</p>

<p>A very close reading of a well-designed information graphic can yield all sorts of good ideas. Good design <i>should</i> require a close reading to unpack; anything immediately obvious is probably too flashy and getting in the way of the data. These examples show how a chart can be simple, beautiful, and functional at the same time.</p>
</ul>]]></description>
         <link>http://www.numberpix.com/2006/05/new_york_times_style.html</link>
         <guid>http://www.numberpix.com/2006/05/new_york_times_style.html</guid>
         <category>Nice Work</category>
         <pubDate>Sun, 07 May 2006 17:38:48 +1100</pubDate>
      </item>
            <item>
         <title>Fixing Excel’s Charts</title>
         <description><![CDATA[<p>Everyone makes fun of Excel, but sometimes it’s all you have. How can we fix its dreadful graphs?</p>

<p>Here’s a data set showing the male and female body masses in kilograms of the ratite birds. Let’s graph it as a scatter plot. (You can <a href="http://www.giantflightlessbirds.com/numberpix/ratite.xls">download the Excel file</a> and follow along if you like).</p>

<p><img alt="excel1.gif" src="http://www.giantflightlessbirds.com/numberpix/excel1.gif" width="560" height="256" /></p>

<p>There are a few things we can fix while in the Chart Wizard.</p>

<ul>
<li>Type in sensible axis labels and title.</li>
<li>Turn off the gridlines (or, if they’re absolutely necessary, make them pale gray and dotted.)</li>
<li>Turn off the legend, since we only have one data series.</li>
</ul>

<p><img alt="excel2.gif" src="http://www.giantflightlessbirds.com/numberpix/excel2.gif" width="533" height="275" /></p>

<p>Once the chart is placed on the page, this is what you get. Still a lot of work to do, isn’t there? Time to start double-clicking on various elements and altering them.</p>

<p><img alt="excel2b.gif" src="http://www.giantflightlessbirds.com/numberpix/excel2b.gif" width="560" height="335" /></p>

<ul>
<li>Give the plot area no fill and no line, instead of default gray.</li>
<li>Format line color and thickness of each axis; paler and thinner never hurts.</li>
<li>Manually choose the best minimum and maximum numbers for each axis scale to minimize waste space. I made the maximum <em>x</em>-axis value 120.</li>
<li>Change the font for each axis to something readable, turning off auto-scale, that very annoying feature that makes lettering unreadable if you shrink the chart..</li>
<li>Adjust the font and size of all the labels, again turning off auto-scale. I made the heading in Hoefler text plain—bold tends to be over-used in non-professional design, and in fact plain text is often more striking for headings. Georgia is a good alternative to Hoefler Text on Windows machines. Trebuchet in two sizes was used for the axes, since it’s narrow and contrasts well with the heading. Pick the labels up and move them where you need them.</li>
<li>Select the y-axis title and change the alignment so it’s rotated 0&#176;. Insert line breaks as needed to make it narrower.</li>
<li>Select the data points, turn off the default shadow, and use a foreground or background color (the Excel equivalent of stroke and fill); you’d rarely need both. Adjust size or shape as needed.</li>
<li>Start tweaking. Here I resized the plot area so both axes used the same scale (I needed to turn off the <em>x</em>-axis's auto-incrementing). Drawing a square with the drawing tools (shift-rectangle) tu use as a temporary guideline helps get the intervals on each axis the same. I also added a y=x line and text label with the drawing tools. </li>
<li>The next step would be to shift the units to the axes, by floating a “kg” text box next to each one, and then label some of the interesting outliers, such as ostrich and giant moa, with more text boxes. While it’s certainly possible to do all this in Excel, it would be more sensible at this stage to shift to a vector graphics program like Illustrator, and use it to make a self-contained EPS graphic.</li>
</ul>

<p>So that’s a checklist of steps that can easily improve an Excel chart. Below is the cleaned-up version and the Excel default (note what auto-scale did to the labels when I resized the default chart!)</p>

<p><img alt="excel3.gif" src="http://www.giantflightlessbirds.com/numberpix/excel3.gif" width="560" height="420" /></p>

<p>Too much work to do to every graph? I agree. A future posting will provide some already-cleaned-up templates you can download and install.</p>
]]></description>
         <link>http://www.numberpix.com/2006/05/fixing_excels_charts.html</link>
         <guid>http://www.numberpix.com/2006/05/fixing_excels_charts.html</guid>
         <category>Tips and Techniques</category>
         <pubDate>Sun, 07 May 2006 17:36:26 +1100</pubDate>
      </item>
            <item>
         <title>Reflections on the Planets</title>
         <description><![CDATA[<p>Here is a chart from Wilkinson, illustrating the bubble plot method, where a third variable is encoded by the size of the marker. Unfortunately, planets are not a good data set for demonstrating bubble plots; we automatically assume these differently-sized circles are representations of the planets to scale. We&#8217;re also not very good at discriminating between the sizes of small circles: is Earth 0.4 or 0.5? Is Mercury 0.2 or 0.1?</p>

<p><img alt="albedo_wilkinson.gif" src="http://www.giantflightlessbirds.com/numberpix/albedo_wilkinson.gif" width="556" height="518" /></p>

<p>The units are not very friendly, either. Albedo is just the percentage of electromagnetic radiation reflected by a planet, and AU are units equal to the Earth's distance from the sun. Why temperature was chosen as a third variable is unclear (and why use degrees Kelvin?). Sure, it varies linearly with distance from the sun on a log&#8211;log scale, but that&#8217;s no surprise. The only exception is Venus, whose temperature is a consequence of a carbon dioxide atmosphere, not albedo; if anything, its cloud layer lowers the temperature by reflecting sunlight. So the graph is not really telling a coherent story. </p>

<p>What could be improved? We can start by just plotting albedo against distance, using more intuitive units for both. Now some of the variation that was masked by the bubble plot begins to emerge:</p>

<p><img alt="albedochart.gif" src="http://www.giantflightlessbirds.com/numberpix/albedochart.gif" width="560" height="438" /></p>

<p>This is a fairly basic chart, but it still isn&#8217;t telling a story. To flesh it out, I added relative sizes, changed the brightness of the planets to match albedo, and annotated some of the outliers. The graph has become a little too cluttered, because the pattern of data points is being swamped by supplementary information, but at least now there&#8217;s some sort of narrative going on. And it prompts questions, like why does the Earth have such a high albedo when it&#8217;s mostly ocean?</p>

<p><img alt="albedochartannotated2.gif" src="http://www.giantflightlessbirds.com/numberpix/albedochartannotated2.gif" width="560" height="464" /></p>

<p>A little more research, and I realized Earth&#8217;s albedo is mostly determined by cloud cover. That was the key to understanding why Venus and the gas giants were so reflective, and dry balls of rock like Mercury and Mars weren&#8217;t. So I resimplified the chart to make that point, stripping off some of the irrelevant information.

<p><img alt="albedochart2.gif" src="http://www.giantflightlessbirds.com/numberpix/albedochart2.gif" width="560" height="466" /></p>

<p>Now the graph has a point to make, and you can just tell it&#8217;s happier.</p>

<p>References: The original chart is from Leland Wilkinson&#8217;s <i>The Grammar of Graphics</i> (Springer 1999). What piqued my interest was a fascinating discussion of <a href="http://www.donaldedavis.com/2002_addons/SSYCOLRS.html">what color the planets really are.</a> Planetary albedo data are taken from <a href="http://pds.jpl.nasa.gov/planets/special/planets.htm">part of NASA&#8217;s site</a>, Wikipedia supplied the Earth albedo data, which merely lists &#8220;Edward Walker&#8221; as its reference (as does every other site on the internet, blithely copying Wikipedia of course.) The <a href="http://nostalgia.wikipedia.org/wiki/Albedo">old Wikipedia page</a> cites, gulp, &#8220;Walker, E., 1987: Pictures of Preschoolers Out in the Snow. Dishwasher Picture Publishing, Volume 26, 151&#8211;1103.&#8221; So you may want to take those figures with a grain of salt.</p>]]></description>
         <link>http://www.numberpix.com/2006/04/reflections_on_the_planets_1.html</link>
         <guid>http://www.numberpix.com/2006/04/reflections_on_the_planets_1.html</guid>
         <category>Makeovers</category>
         <pubDate>Sun, 09 Apr 2006 12:36:39 +1100</pubDate>
      </item>
            <item>
         <title>Mountains of War</title>
         <description><![CDATA[<p>In the Sunday, March 19th New York Times Magazine, accompanying an article on the decline in global conflict, was the following chart:</p>
<p align="center"><img alt="hsc05nyt.jpg" src="http://www.giantflightlessbirds.com/numberpix/hsc05nyt.jpg" width="184" height="324" /></p>
<p>Apart from the odd terminology (extrastate vs. interstate? And "war between states" means something quite different here in North Carolina...), I was puzzled by the color choice. Was this a stacked area graph, or one where the areas were superimposed, as the color scheme seemed to suggest? I went to the Human Security Centre website and found the original:</p>
<p><img alt="hscoriginal.gif" src="http://www.giantflightlessbirds.com/numberpix/hscoriginal.gif" width="500" height="299" /></p>
<p>It is indeed a stacked area graph. The second paragraph of the caption reads: </p>
<blockquote>Figure 1.1 is a &lsquo;stacked graph&rsquo;, meaning that the number of conflicts in each category is indicated by the depth of the band of colour. The top line indicates the total number of conflicts of all types in each year. Thus in 1946 there were five extrastate conflicts, two interstate conflicts, ten intrastate conflicts, and 17 conflicts in total. </blockquote>
<p>A good rule of thumb is that if you need to spell out how to read the graph, there&rsquo;s a design problem. In this case, we're being misled by the &ldquo;mountain illusion.&rdquo; The human eye is used to distant objects being lighter in color, so sees the area chart as a mountain range, and hence layered.</p>

<p><a href="http://si.ist.psu.edu/pgsit/blogs/node/771&h=768&w=1024&sz=44&tbnid=CzcSQk5o5GKwfM:&tbnh=112&tbnw=150&hl=en&start=27&prev=/images%3Fq%3Dmountains%26start%3D20%26svnum%3D10%26hl%3Den%26lr%3D%26client%3Dsafari%26rls%3Den%26sa%3DN">
<img src="http://www.giantflightlessbirds.com/numberpix/hscmtns.jpg" width="500" height="229" /></a></p>

<p>The New York Times designers, 5W Infographic, exacerbated this illusion by changing the color palette to different percentages of magenta. On the plus side, they simplified the <em>x</em>-axis by only numbering decades, labelled each area directly instead of with a legend, removed the frame, and replaced the <em>x</em>-axis tick marks with pale vertical gridlines.</p>
<p><img src="http://www.giantflightlessbirds.com/numberpix/hscboth.gif" width="500" height="217" /></p>
<p>How to get rid of the mountain illusion? The most obvious solution is to change the color scheme by reversing the light-to-dark direction, but this doesn't seem to help much. Better is a color scheme that flattens out the illusory layering. Making the contrast color as similar as possible to the main color seems to work best.</p>
<p><img src="http://www.giantflightlessbirds.com/numberpix/hsc2colors.gif" width="499" height="261" /></p>
<p>Another option is to change the stacking order, so the smaller series sits on top of the larger, which also fights the mountain illusion. Unfortunately this makes it much harder to discern trends in the smaller series; they're swamped by the larger. Another complication here is that one series disappears, so the trick is to distinguish between the two that remain. I've used a solid line, but a dotted line would be better, and subtle changes in shade best. The final and probably superior alternative is to just unstack everything in Illustrator, by making three graphs and combining them. </p>
<p><img src="http://www.giantflightlessbirds.com/numberpix/hsrstacking.gif" width="498" height="293" /></p>
<p>While unstacking this area graph makes it unambiguous, it&rsquo;s  much harder to see the total conflicts for a year. Does this matter? As always, it depends on the point one is trying to make; in this case, that world conflict has declined. The trend is pretty easy to see, as the majority of the data are in just one series. If total conflicts were important, one could add a line:</p>
<p align="left"><img src="http://www.giantflightlessbirds.com/numberpix/hscunstackedwtotal.gif" width="249" height="277" /></p>
<p>References: The accompanying article was <a href="http://www.humansecurityreport.info/index.php?option=content&task=view&id=197">Wonderful World</a>, by James Traub, NYT Magazine 3/19/2006. The lovely Smoky Mountains photo is by someone called <a href="http://si.ist.psu.edu/pgsit/blogs/node/771&h=768&w=1024&sz=44&tbnid=CzcSQk5o5GKwfM:&tbnh=112&tbnw=150&hl=en&start=27&prev=/images%3Fq%3Dmountains%26start%3D20%26svnum%3D10%26hl%3Den%26lr%3D%26client%3Dsafari%26rls%3Den%26sa%3DN">Melissa</a>, who unfortunately has no contact details, or I'd have asked. The folks at the Human Security Center at the University of British Columbia helpfully supplied the original data from their <a href="http://www.humansecurityreport.info/">Human Security Report 2005</a>. </p>
<p>(Update: charts by 5W Infographics no longer appear in the NYT magazine...)]]></description>
         <link>http://www.numberpix.com/2006/03/stacking_world_conflict.html</link>
         <guid>http://www.numberpix.com/2006/03/stacking_world_conflict.html</guid>
         <category>Makeovers</category>
         <pubDate>Tue, 21 Mar 2006 13:20:50 +1100</pubDate>
      </item>
            <item>
         <title>March of the Monarchs</title>
         <description><![CDATA[<p>Every year, the Journey North Project tracks sightings of the first Monarch butterflies (<i>Danaus plexippus</i>) of spring, and produces maps like this one.</p>

<p><img alt="mm_map2000.gif" src="http://www.giantflightlessbirds.com/numberpix/mm_map2000.gif" width="556" height="332" /></p>

<p align="center"><img alt="mm_legend.gif" src="http://www.giantflightlessbirds.com/numberpix/mm_legend.gif" width="235" height="219" /></p>

<p>The legend shows the first sightings in given two-week periods, but uses overlaid dots, which become a little confusing. The choice of colors is not the best; I had to get very close to my monitor to tell early and late May apart. It could do with some improvement, and Leland Wilkinson in <i>The Grammar of Graphics</i> produces a far superior version, though for an earlier year.</p>

<p><img alt="mm_wilkinson.gif" src="http://www.giantflightlessbirds.com/numberpix/mm_wilkinson.gif" width="560" height="401" /></p>

<p>Could this be further improved? Wilkinson uses the “Scientist&#8217;s Rainbow” in his legend. But since the data are associated with increasing spring temperatures, and the lines are in a definite order, a legend using increasingly warm colors (different saturations of orange, for example) would work better; the original graph does this to some extent. Wilkinson uses colder colors as summer arrives, which seems counter-intuitive. The lines could easily be labelled directly, since the legend is sitting right beside them. Since most of the sightings are Eastern, half the map is blank and could be cropped.</p>

<p><img alt="mm_mymonarchs2000.gif" src="http://www.giantflightlessbirds.com/numberpix/mm_mymonarchs2000.gif" width="560" height="390" /></p>

<p>Instead of ranges, I used a date for the advancing front. Wilkinson’s lines are nonparametrically smoothed contours through the <i>concentration</i> of points in each date range, rather than across the leading edge, because of “random error in the dataset”. The effect is to retard the contour, and it implies that all the points in advance of it are mistakes. Remember, these points are not a population sample from which we’re discerning a mean value, but the edge of a range.</p>

<p>This raises a bigger issue: what’s the purpose of this graphic, anyway? If the goal is to answer the question “When will the Monarchs appear in my state?” then averaging the lines for several successive years would be best&#151;instead of two-week ranges, a less precise time interval would be better like “early April” and “late April”. The data are all online, and the graph is left as an exercise for the reader. Think how useful such a graph will be to our grandchildren, when they’re jaded from seeing Monarchs flitting around in February.</p>

<p>There was a very good <a href="http://www.nytimes.com/2006/10/03/science/03butter.html">article</a> in the October 3rd <em>New York Times</em> science section on Monarch migration. They use a color palette similar to one I suggested above (though these butterflies are heading south), and dots for sightings like the Journey North project. The dotted line is not the wave front of butterflies, as you might think, but a 60°F isotherm&#8212;Monarchs can&#8217;t fly when the temperature drops below 55°F. The isotherm is labeled a little way offscreen, and I missed the label on my first reading of the map, so that would be my only criticism.</p>

<p><img alt="mm_nytimes.jpg" src="http://www.numberpix.com/mm_nytimes.jpg" width="560" height="323" /></p>

<p>References. The Journey North Project archives can be found at at Annenberg/CPB Learner Online (<a href="http://www.learner.org">www.learner.org</a>). Wilkinson’s thorough and analytical book <i>The Grammar of Graphics</i> (Springer, 1999) is also worth a look if you&#8217;re interested in programming graph-generating applications.</p> 
]]></description>
         <link>http://www.numberpix.com/2006/02/march_of_the_monarchs.html</link>
         <guid>http://www.numberpix.com/2006/02/march_of_the_monarchs.html</guid>
         <category>Makeovers</category>
         <pubDate>Wed, 15 Feb 2006 12:21:06 +1100</pubDate>
      </item>
            <item>
         <title>A Robin Williams Library</title>
         <description><![CDATA[<p>Robin Williams (no, not the once-funny-now-supposedly-heartwarming-but-actually-mawkish actor) writes about design and type for non-professionals. Not many years ago, authors did the typing and someone else made the figures and layout look nice. Now we&rsquo;re all suddenly designers. Going to art school isn&rsquo;t an option for most of us, but a crash course in page layout and print production is a good idea if you produce graphics and type as part of your job. A few hours invested in the following  can save you from days of anguish and frustration when your print job doesn&rsquo;t work, or your table columns aren&rsquo;t lining up properly no matter what you do. </p>
<p><a href="http://www.amazon.com/gp/product/0201782634/sr=8-1/qid=1147460180/ref=pd_bbs_1/102-0225633-0954523?%5Fencoding=UTF8"><img alt="macnottypewriter.jpg" src="http://www.giantflightlessbirds.com/numberpix/macnottypewriter.jpg" width="53" height="75" class="cover" /></a><span class="title">The Mac is Not a Typewriter</span> (There's a PC version too)  <br />
  <span class="author">Robin Williams</span><br />
  <span class="published">Peachpit Press, 2003 • ISBN: 0201782634  (PC: 0938151495) </span></p>
<p>This wonderful little book is very short, but so good it should be given out with every computer sold. Get three and pass them around. Williams covers not just basic typography (like why you shouldn&rsquo;t type two spaces after periods) but also some more arcane topics, like hanging punctuation and the line spacing of capitals. What I think anyone who works with text for a living should know about type.</p>
<p><a href="http://www.amazon.com/gp/product/0321193857/ref=pd_bxgy_img_b/102-0225633-0954523?%5Fencoding=UTF8"><img alt="nondesignersdesign.jpg" src="http://www.giantflightlessbirds.com/numberpix/nondesignersdesign.jpg" width="53" height="75"  class="cover" /></a><span class="title">The Non-Designer&rsquo;s Design Book</span><br />
  <span class="author">Robin Williams</span><br />
  <span class="published">Peachpit Press, 2003 • 0321193857 </span></p>
<p>I used this as a standard text for a course on desktop publishing and design. The students with pretensions to being graphic designers pooh-poohed it at first, since it was short and approachable, but were won over at the end. Williams walks you through half a dozen different kinds of typeface, so you&rsquo;re able to mix them confidently on the page. Then she lays out the immortal CRAP principles: contrast, repetition, alignment, and proximity. Just these are enough to  improve most people&rsquo;s layouts about 300%.</p>
<p><a href="http://www.amazon.com/gp/product/0201353946/ref=pd_sim_b_2/102-0225633-0954523?%5Fencoding=UTF8&v=glance&n=283155"><img alt="scanandprint.jpg" src="http://www.giantflightlessbirds.com/numberpix/scanandprint.jpg" width="50" height="75" class="cover" /></a><span class="title">The Non-Designer&rsquo;s Scan and Print Book</span><br />
  <span class="author">Robin Williams and Sandee Cohen</span><br />
  <span class="published">Peachpit Press, 1999 • 0201353946 </span></p>
<p>When creating your own graphics for publication, you always have to deal with file format issues (EPS or PDF? Vector or raster?), color models (CMYK or RGB?), resolution, scanning, paper size, embedding fonts, and so forth. Most people muddle through by following a list of arcane demands from their publisher, but wouldn&rsquo;t it be great if you actually understood all this stuff and could chat with the printer ahead of time about spot colors and line screens? That&rsquo;s what this book is for. </p>
<p>If you enjoy these, I recommend you check out:</p>
<p class="title">How to Boss Your Fonts Around<br />
  The Robin Williams Design Workshop<br />
  The Non-Designer&rsquo;s Web Book <br />
  The Non-Designer&rsquo;s Type Book</p>
<p>All are easy reads, and full of good stuff. Get the most up-to-date edition of each of you can, as this field dates fast. By the way, I don&rsquo;t get a cut from Amazon, and recommend you try a price comparison search on <a href="http://www.addall.com">Addall</a> or <a href="http://www.bestwebbuys.com/books/index.html">BestWebBuys</a>.</p>
]]></description>
         <link>http://www.numberpix.com/2006/01/the_robin_williams_library.html</link>
         <guid>http://www.numberpix.com/2006/01/the_robin_williams_library.html</guid>
         <category>Reading List</category>
         <pubDate>Wed, 11 Jan 2006 12:29:41 +1100</pubDate>
      </item>
            <item>
         <title>March of the Ratites</title>
         <description><![CDATA[<p>The job was a chart that would display the fossil history of the ratites. (Ratites are the giant flightless birds that include the ostrich, emu and the like). There was one previous attempt: the diagram of bird fossil histories from Unwin (1993). It&rsquo;s rather daunting, isn&rsquo;t it? </p>
<p><img alt="unwin.gif" src="http://www.giantflightlessbirds.com/numberpix/unwin.gif" width="500" height="688" /></p>
<p>The chart shows the geological timescale up the left (from the Maastrichtian in the late Cretaceous through to the Holocene and the present day), and when different groups of birds occur in the fossil record (the vertical lines). The timescale implies the different geological stages are all the same length, which is certainly not the case, but there is no scale in years to make this clear. The bird fossils only actually occupy half the timescale, though the gridlines and boxes continue all the way to the bottom of the page. The legend takes up almost as much room as the data, and actually reading information back off the chart requires constantly referring to the key, making comparison fairly difficult.</p>
<img alt="unwinall.gif" src="http://www.giantflightlessbirds.com/numberpix/unwinall.gif" width="500" height="300" />
<p> How could this be improved? For its size, the chart contains very little information; most of it is furniture. For each group, there are only three data points: first appearance, last appearance, and how much of the fossil record is patchy or continuous. I was interested in showing just the ratites (roughly groups 18 to 29), and some of these have no fossil record to speak of, so there was plenty of opportunity to add more information. I wanted to show actual names, localities, a picture of the birds concerned indicating relative sizes, flying ability, whether they were actually ratites or not, and some indication of how many bones had been discovered from each. So I started sketching.</p>
<p><img alt="marchsketch.gif" src="http://www.giantflightlessbirds.com/numberpix/marchsketch.gif" width="500" height="168" />
</p>
<p>I realized I could combine the bone information with the outlines, and indicate flying ability by showing the wings of flying ratites. At this point I put together a geological timeline, to scale, in Illustrator. I decided to show the actual boundary dates of geological periods as well as a linear timescale.</p>
<p><img alt="marchsketch.jpg" src="http://www.giantflightlessbirds.com/numberpix/marchsketch.jpg" width="500" height="128" />
</p>
<p>I roughly sketched each ratite (in some cases reconstructing them from partial skeletons, in other cases working from Googled photographs). I then photocopied them up to full-page size and traced the outline with magic marker. Each ratite was then scanned into Photoshop, the outlines filled in, and the contrast adjusted so only the black silhouette remained. This was then placed into Illustrator as a template. Illustrator has autotrace tools for converting a scanned image into an outline, but I found it much easier to click and drag my way around the silhouette with the pen tool. The nice thing about working large like this is that when you shrink the graphic back down it looks pretty sharp.</p>
<p align="center"><img alt="ostrichsketch.jpg" src="http://www.giantflightlessbirds.com/numberpix/ostrichsketch.jpg" width="300" height="247" />
 </p>
<p>Here&rsquo;s a version of the final graphic. I simplified the crowd of ratites clustered around the Holocene by noting in the text which groups did not have a significant fossil record. The shades of gray indicate how closely-related each species is to modern ratites. </p>
<p><img alt="ratiteslines.gif" src="http://www.giantflightlessbirds.com/numberpix/ratiteslines.gif" width="497" height="210" />
 </p>
<p>Where only partial skeletons are known, bones were added to the silhouettes. Where the fossil record becomes continuous, the line for each group becomes solid. I hoped these conventions would be intuitive enough that the chart would pretty much speak for itself.</p>
<p align="center"><img alt="ratitesbones.gif" src="http://www.giantflightlessbirds.com/numberpix/ratitesbones.gif" width="500" height="300" />
</p>
<p align="left">One side effect is that I now have a bunch of scalable silhouettes I can use to label other diagrams, phylogenetic trees, and the like, so it was worth the effort.</p>
<p align="center"><img alt="tree.gif" src="http://www.giantflightlessbirds.com/numberpix/tree.gif" width="377" height="297" />
</p>
<p>Reference: Unwin, D.M., 1993. Aves. In Benton, M.J. <i>The Fossil Record II.</i> London: Benton & Hall.</ br>
Criticisms of this chart should in no way be seen as a reflection on the expertise of Dave Unwin, a splendid fellow, and not just because he let me hold the Berlin <i>Archaeopteryx</i>.</p>]]></description>
         <link>http://www.numberpix.com/2005/12/march_of_the_ratites.html</link>
         <guid>http://www.numberpix.com/2005/12/march_of_the_ratites.html</guid>
         <category>Makeovers</category>
         <pubDate>Thu, 01 Dec 2005 14:53:48 +1100</pubDate>
      </item>
            <item>
         <title>The Density of Deer Heels</title>
         <description><![CDATA[<p>This graph shows the density of osteocytes, or bone cells, in different parts of the calcaneum (heel) of a mule deer. Cranial, caudal, medial, and lateral translate as front, back, inside, and outside. </p>

<p><img alt="oldcalc.gif" src="http://www.giantflightlessbirds.com/numberpix/oldcalc.gif" /></p>

<p>The biggest problem is that the arrangement of the bars doesn't correspond with the parts of the bone in question. Nor are the colors of the bars meaningful; they're only there to make the legend work. The cryptic y-axis label is just number of osteocytes per square millimeter. The three cases where there are significantly more osteocytes are indicated on the graph with an asterisk (*), the conventional indicator of statistical significance. Note that the asterisks are in a different position in each case; one might even think the the three black bars are being referred to, but in young fawns it's in fact the white bar.</p>
<p> It's so confusing that the caption takes up nearly half as much space as the graph itself.</p>

<p><img alt="oldcalcstext.gif" src="http://www.giantflightlessbirds.com/numberpix/oldcalcstext.gif"  /></p>

<p>How to make this more comprehensible? A good goal would be to reduce the amount of interpretation we were requiring of the reader. I noticed the article had a diagram showing cross-sections of the bone in question, from young fawn to adult.</p>

<p align="center"><img alt="calcoutlines.gif" src="http://www.giantflightlessbirds.com/numberpix/calcoutlines.gif"  /></p>

<p> I scanned them, placed them as a template in Illustrator, then traced around them with the pen tool. For each bone I made a compound path of the inner and outer circles, and sliced it like a pizza with the knife tool into four chunks, each of which could take a different fill.</p>

<p>I noted significant differences right on the bones themselves, and added a color scale, where shade (40% to 80% black) corresponds intuitively to osteocyte density.</p> 

<p align="center"><img alt="calcs.gif" src="http://www.giantflightlessbirds.com/numberpix/calcs.gif" /></p>

<p>The revised graph takes up less room and is pretty much self explanatory. Once the scales and labels had been given, one could even show multiple small versions of the same four bones side by side, each showing osteocyte density under different conditions.</p>

<p>Reference: J. G. Skedros, K. J. Hunt, and R. D. Bloebaum (1995), in the <i>Journal of Morphology</i> 265(2):244-247.</p>]]></description>
         <link>http://www.numberpix.com/2005/11/deer_bones.html</link>
         <guid>http://www.numberpix.com/2005/11/deer_bones.html</guid>
         <category>Makeovers</category>
         <pubDate>Thu, 03 Nov 2005 16:59:00 +1100</pubDate>
      </item>
      
   </channel>
</rss>
