{"id":146,"date":"2019-01-30T18:31:33","date_gmt":"2019-01-30T18:31:33","guid":{"rendered":"https:\/\/www.allendowney.com\/blog\/?p=146"},"modified":"2019-02-04T20:54:45","modified_gmt":"2019-02-04T20:54:45","slug":"data-visualization-for-academics","status":"publish","type":"post","link":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/","title":{"rendered":"Data visualization for academics"},"content":{"rendered":"\n<p>One of the reasons I am excited about the rise of data journalism is that journalists are doing amazing things with visualization.  At the same time, one of my frustrations with academic research is that the general quality of visualization is so poor.<\/p>\n\n\n\n<p>One of the problems is that most academic papers are published in grayscale, so the figures don&#8217;t use color.  But most papers are read in electronic formats now; the world is safe for color!<\/p>\n\n\n\n<p>Another problem is the convention of putting figures at the end, which is an extreme form of <a href=\"https:\/\/style.mla.org\/dont-bury-the-lede\/\">burying the lede<\/a>.  <\/p>\n\n\n\n<p>Also, many figures are generated by software with bad defaults: lines are too thin, <g class=\"gr_ gr_4 gr-alert gr_gramm gr_inline_cards gr_run_anim Grammar only-ins doubleReplace replaceWithoutSep\" id=\"4\" data-gr-id=\"4\">text<\/g> is too small, axis and grids lines are obtrusive, and when colors are used, they tend to be saturated colors that clash. And I won&#8217;t even mention the gratuitous use of 3-D.<\/p>\n\n\n\n<p>But I think the biggest problem is the simplest: the figures in most academic papers do a <strong>poor job of communicating one point clearly<\/strong>.<\/p>\n\n\n\n<p><a href=\"https:\/\/www.allendowney.com\/blog\/2018\/12\/06\/august-causes-adhd\/\">I wrote about one example a few months ago<\/a>, a paper showing that children who start school relatively young are more likely to be diagnosed with ADHD.<\/p>\n\n\n\n<p>Here&#8217;s the figure from <a href=\"https:\/\/www.nejm.org\/doi\/10.1056\/NEJMoa1806828\">the original paper<\/a>:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"795\" height=\"651\" src=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png\" alt=\"\" class=\"wp-image-161\" srcset=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png 795w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31-300x246.png 300w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31-768x629.png 768w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31-330x270.png 330w\" sizes=\"auto, (max-width: 795px) 100vw, 795px\" \/><\/figure>\n\n\n\n<p>How long does it take you to understand the point of this figure?  Now here&#8217;s my representation of the same data:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"424\" height=\"280\" src=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image.png\" alt=\"\" class=\"wp-image-147\" srcset=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image.png 424w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-300x198.png 300w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-409x270.png 409w\" sizes=\"auto, (max-width: 424px) 100vw, 424px\" \/><\/figure>\n\n\n\n<p>I believe this figure is easier to interpret.  Here&#8217;s what I changed:<\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>Instead of plotting the difference between successive months, I plotted the diagnosis rate for each month, which makes it possible to see the pattern (diagnosis rate increases month over month for the first six months, then levels), and the magnitude of the difference (from 60 to 90 diagnoses per 10,000, an increase of about 50%).<\/li><li>I shifted the horizontal axis to put the cutoff date (September 1) at zero.<\/li><li>I added a vertical line and text to distinguish and interpret the two halves of the plot.<\/li><li>I added a title that states the primary conclusion supported by the figure.  Alternatively, I could have put this text in a caption.<\/li><li>I replaced the error bars with a shaded area, which looks better (in my opinion) and appropriately gives less visual weight to less important information.<\/li><\/ol>\n\n\n\n<p>I came across a similar visualization makeover recently. In <a href=\"https:\/\/www.washingtonpost.com\/opinions\/how-just-a-few-dollars-can-keep-many-kids-from-going-to-college\/2019\/01\/24\/9264c5c2-201b-11e9-9145-3f74070bbdb9_story.html?utm_term=.6f6faccfbca6&amp;wpisrc=nl_ideas&amp;wpmm=1\">this Washington Post article<\/a>, Catherine Rampell writes, &#8220;Colleges have been under pressure to admit needier kids. It\u2019s backfiring.&#8221; <\/p>\n\n\n\n<p>Her article is based on <a href=\"https:\/\/siepr.stanford.edu\/research\/publications\/measuring-opportunity-us-higher-education\">this academic paper<\/a>; here&#8217;s the figure from the original paper:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"613\" height=\"915\" src=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-1.png\" alt=\"\" class=\"wp-image-148\" srcset=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-1.png 613w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-1-201x300.png 201w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-1-181x270.png 181w\" sizes=\"auto, (max-width: 613px) 100vw, 613px\" \/><\/figure>\n\n\n\n<p>It&#8217;s sideways, it&#8217;s on page 29, and it fails to make its point.  So Rampell designed a better figure.  Here&#8217;s the figure from her article:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/www.washingtonpost.com\/resizer\/KFD_gEY-jMtC2ygYys6NebMRA8Y=\/1294x0\/arc-anglerfish-washpost-prod-washpost.s3.amazonaws.com\/public\/ASO3J4BAGAI6TC2ZBIUPEGIRGE.png\" alt=\"\"\/><figcaption><br><\/figcaption><\/figure>\n\n\n\n<p>The title explains what the figure shows clearly: enrollment rates are highest for low-income students that qualify for Pell grants and lowest for low-income students who don&#8217;t qualify for Pell grants.<\/p>\n\n\n\n<p>To nitpick, I might have plotted this data with a line rather than a bar chart, and I might have used a less saturated color.  But more importantly, this figure makes its point clearly and compellingly.<\/p>\n\n\n\n<p>Here&#8217;s one last example, and a challenge: <a href=\"https:\/\/osf.io\/preprints\/socarxiv\/j2tw9\/\"><g class=\"gr_ gr_21 gr-alert gr_spell gr_inline_cards gr_run_anim ContextualSpelling ins-del multiReplace\" id=\"21\" data-gr-id=\"21\">this<\/g> recent paper<\/a> reports, &#8220;the number of scale points used in faculty teaching evaluations (e.g., whether instructors are rated on a scale of 6 vs. a scale of 10) substantially affects the size of the gender gap in evaluations.&#8221;<\/p>\n\n\n\n<p>To demonstrate this effect, they show eight histograms on pages 44 and 45.  Here&#8217;s page 44:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"832\" height=\"839\" src=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-2.png\" alt=\"\" class=\"wp-image-155\" srcset=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-2.png 832w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-2-150x150.png 150w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-2-297x300.png 297w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-2-768x774.png 768w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-2-268x270.png 268w\" sizes=\"auto, (max-width: 832px) 100vw, 832px\" \/><\/figure>\n\n\n\n<p>And here&#8217;s page 45:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"821\" height=\"836\" src=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-3.png\" alt=\"\" class=\"wp-image-156\" srcset=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-3.png 821w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-3-295x300.png 295w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-3-768x782.png 768w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/image-3-265x270.png 265w\" sizes=\"auto, (max-width: 821px) 100vw, 821px\" \/><\/figure>\n\n\n\n<p>With some guidance from the captions, we can extract the message: <\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>Under the 6-point system, there is no visible difference between ratings for male and female instructors.<\/li><li>Under the 10-point system, in the least male-dominated subject areas, there is no visible difference. <\/li><li>Under the 10-point system, in the most male-dominated subject areas, there is a visibly obvious difference: students are substantially less likely to give female instructors a 9 or 10.<\/li><\/ol>\n\n\n\n<p>This is an important result \u2014 it makes me want to read the previous 43 pages. And the visualizations are not bad \u2014 they show the effect clearly, and it is substantial.<\/p>\n\n\n\n<p>But I still think we could do better. So let me pose this challenge to readers: Can you design <g class=\"gr_ gr_7 gr-alert gr_gramm gr_inline_cards gr_disable_anim_appear Grammar only-del replaceWithoutSep\" id=\"7\" data-gr-id=\"7\">a visualization<\/g> of this data that communicates the results so that<\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>Readers can see the effect quickly and easily, and<\/li><li>Understand the magnitude of the effect in practical terms?<\/li><\/ol>\n\n\n\n<p>You can get the data you need from the figures, at least approximately. And your visualization doesn&#8217;t have to be fancy; you can send something hand-drawn if you want. The point of the exercise is the design, not the details.<\/p>\n\n\n\n<p>I will post submissions in a few days.  If you send me something, let me know how you would like to be acknowledged.<\/p>\n\n\n\n<p>UPDATE: We discussed this example in class today and I presented one way we could summarize and visualize the data:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"432\" height=\"288\" src=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/02\/gender_bias-1.png\" alt=\"\" class=\"wp-image-166\" srcset=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/02\/gender_bias-1.png 432w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/02\/gender_bias-1-300x200.png 300w, https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/02\/gender_bias-1-405x270.png 405w\" sizes=\"auto, (max-width: 432px) 100vw, 432px\" \/><figcaption>Students in the most male-dominated fields are less likely to give female instructors top scores, but only on a 10-point scale.  The effect does not appear on a 6-point scale.<\/figcaption><\/figure>\n\n\n\n<p>There are definitely things to do to improve this, but I generated it using Pandas with minimal customization.  All the code is in <a href=\"https:\/\/github.com\/AllenDowney\/ThinkStats2\/blob\/master\/examples\/gender_bias_example.ipynb\">this Jupyter notebook<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>One of the reasons I am excited about the rise of data journalism is that journalists are doing amazing things with visualization. At the same time, one of my frustrations with academic research is that the general quality of visualization is so poor. One of the problems is that most academic papers are published in grayscale, so the figures don&#8217;t use color. But most papers are read in electronic formats now; the world is safe for color! Another problem is&#8230;<\/p>\n<p class=\"read-more\"><a class=\"btn btn-default\" href=\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/\"> Read More<span class=\"screen-reader-text\">  Read More<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[1],"tags":[4,5],"class_list":["post-146","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-data-science","tag-visualization"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Data visualization for academics - Probably Overthinking It<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data visualization for academics - Probably Overthinking It\" \/>\n<meta property=\"og:description\" content=\"One of the reasons I am excited about the rise of data journalism is that journalists are doing amazing things with visualization. At the same time, one of my frustrations with academic research is that the general quality of visualization is so poor. One of the problems is that most academic papers are published in grayscale, so the figures don&#8217;t use color. But most papers are read in electronic formats now; the world is safe for color! Another problem is... Read More Read More\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/\" \/>\n<meta property=\"og:site_name\" content=\"Probably Overthinking It\" \/>\n<meta property=\"article:published_time\" content=\"2019-01-30T18:31:33+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2019-02-04T20:54:45+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png\" \/>\n<meta name=\"author\" content=\"AllenDowney\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@AllenDowney\" \/>\n<meta name=\"twitter:site\" content=\"@AllenDowney\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"AllenDowney\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/\"},\"author\":{\"name\":\"AllenDowney\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/#\/schema\/person\/4e5bfb2e9af6c3446cb0031a7bf83207\"},\"headline\":\"Data visualization for academics\",\"datePublished\":\"2019-01-30T18:31:33+00:00\",\"dateModified\":\"2019-02-04T20:54:45+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/\"},\"wordCount\":862,\"publisher\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png\",\"keywords\":[\"data science\",\"visualization\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/\",\"url\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/\",\"name\":\"Data visualization for academics - Probably Overthinking It\",\"isPartOf\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png\",\"datePublished\":\"2019-01-30T18:31:33+00:00\",\"dateModified\":\"2019-02-04T20:54:45+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#primaryimage\",\"url\":\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png\",\"contentUrl\":\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png\",\"width\":795,\"height\":651},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.allendowney.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data visualization for academics\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/#website\",\"url\":\"https:\/\/www.allendowney.com\/blog\/\",\"name\":\"Probably Overthinking It\",\"description\":\"Data science, Bayesian Statistics, and other ideas\",\"publisher\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.allendowney.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/#organization\",\"name\":\"Probably Overthinking It\",\"url\":\"https:\/\/www.allendowney.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2025\/03\/probably_logo.png\",\"contentUrl\":\"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2025\/03\/probably_logo.png\",\"width\":714,\"height\":784,\"caption\":\"Probably Overthinking It\"},\"image\":{\"@id\":\"https:\/\/www.allendowney.com\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/x.com\/AllenDowney\",\"https:\/\/www.linkedin.com\/in\/allendowney\/\",\"https:\/\/bsky.app\/profile\/allendowney.bsky.social\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/#\/schema\/person\/4e5bfb2e9af6c3446cb0031a7bf83207\",\"name\":\"AllenDowney\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.allendowney.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/fb01b3a7f7190bea1bbf7f0852e686c2f8c03b099222df2ce4bc7926f15bcb43?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/fb01b3a7f7190bea1bbf7f0852e686c2f8c03b099222df2ce4bc7926f15bcb43?s=96&d=mm&r=g\",\"caption\":\"AllenDowney\"},\"url\":\"https:\/\/www.allendowney.com\/blog\/author\/allendowney_6dbrc4\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Data visualization for academics - Probably Overthinking It","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/","og_locale":"en_US","og_type":"article","og_title":"Data visualization for academics - Probably Overthinking It","og_description":"One of the reasons I am excited about the rise of data journalism is that journalists are doing amazing things with visualization. At the same time, one of my frustrations with academic research is that the general quality of visualization is so poor. One of the problems is that most academic papers are published in grayscale, so the figures don&#8217;t use color. But most papers are read in electronic formats now; the world is safe for color! Another problem is... Read More Read More","og_url":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/","og_site_name":"Probably Overthinking It","article_published_time":"2019-01-30T18:31:33+00:00","article_modified_time":"2019-02-04T20:54:45+00:00","og_image":[{"url":"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png","type":"","width":"","height":""}],"author":"AllenDowney","twitter_card":"summary_large_image","twitter_creator":"@AllenDowney","twitter_site":"@AllenDowney","twitter_misc":{"Written by":"AllenDowney","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#article","isPartOf":{"@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/"},"author":{"name":"AllenDowney","@id":"https:\/\/www.allendowney.com\/blog\/#\/schema\/person\/4e5bfb2e9af6c3446cb0031a7bf83207"},"headline":"Data visualization for academics","datePublished":"2019-01-30T18:31:33+00:00","dateModified":"2019-02-04T20:54:45+00:00","mainEntityOfPage":{"@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/"},"wordCount":862,"publisher":{"@id":"https:\/\/www.allendowney.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#primaryimage"},"thumbnailUrl":"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png","keywords":["data science","visualization"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/","url":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/","name":"Data visualization for academics - Probably Overthinking It","isPartOf":{"@id":"https:\/\/www.allendowney.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#primaryimage"},"image":{"@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#primaryimage"},"thumbnailUrl":"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png","datePublished":"2019-01-30T18:31:33+00:00","dateModified":"2019-02-04T20:54:45+00:00","breadcrumb":{"@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#primaryimage","url":"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png","contentUrl":"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-31-16-39-31.png","width":795,"height":651},{"@type":"BreadcrumbList","@id":"https:\/\/www.allendowney.com\/blog\/2019\/01\/30\/data-visualization-for-academics\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.allendowney.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Data visualization for academics"}]},{"@type":"WebSite","@id":"https:\/\/www.allendowney.com\/blog\/#website","url":"https:\/\/www.allendowney.com\/blog\/","name":"Probably Overthinking It","description":"Data science, Bayesian Statistics, and other ideas","publisher":{"@id":"https:\/\/www.allendowney.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.allendowney.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.allendowney.com\/blog\/#organization","name":"Probably Overthinking It","url":"https:\/\/www.allendowney.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.allendowney.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2025\/03\/probably_logo.png","contentUrl":"https:\/\/www.allendowney.com\/blog\/wp-content\/uploads\/2025\/03\/probably_logo.png","width":714,"height":784,"caption":"Probably Overthinking It"},"image":{"@id":"https:\/\/www.allendowney.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/AllenDowney","https:\/\/www.linkedin.com\/in\/allendowney\/","https:\/\/bsky.app\/profile\/allendowney.bsky.social"]},{"@type":"Person","@id":"https:\/\/www.allendowney.com\/blog\/#\/schema\/person\/4e5bfb2e9af6c3446cb0031a7bf83207","name":"AllenDowney","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.allendowney.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/fb01b3a7f7190bea1bbf7f0852e686c2f8c03b099222df2ce4bc7926f15bcb43?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/fb01b3a7f7190bea1bbf7f0852e686c2f8c03b099222df2ce4bc7926f15bcb43?s=96&d=mm&r=g","caption":"AllenDowney"},"url":"https:\/\/www.allendowney.com\/blog\/author\/allendowney_6dbrc4\/"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":140,"url":"https:\/\/www.allendowney.com\/blog\/2019\/01\/18\/the-library-of-data-visualization\/","url_meta":{"origin":146,"position":0},"title":"The library of data visualization","author":"AllenDowney","date":"January 18, 2019","format":false,"excerpt":"Getting ready for my Data Science class (starting next week!) I am updating my data visualization library, looking for resources to help students learn about visualization. Last week I asked Twitter to help me find resources, especially new ones.\u00a0 Here's the thread.\u00a0\u00a0Thank you to everyone who responded! I'll try to\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/01\/Screenshot-at-2019-01-18-09-56-35-300x3.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":201,"url":"https:\/\/www.allendowney.com\/blog\/2019\/04\/01\/local-regression-in-python\/","url_meta":{"origin":146,"position":1},"title":"Local regression in Python","author":"AllenDowney","date":"April 1, 2019","format":false,"excerpt":"I love data visualization make-overs (like this one I wrote a few months ago), but sometimes the tone can be too negative (like this one I wrote a few months ago). Sarah Leo, a data journalist at The Economist, has found the perfect solution: re-making your own visualizations. Here's her\u2026","rel":"","context":"In \"local regression\"","block_context":{"text":"local regression","link":"https:\/\/www.allendowney.com\/blog\/tag\/local-regression\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2019\/04\/image.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":1097,"url":"https:\/\/www.allendowney.com\/blog\/2023\/11\/07\/we-have-a-book\/","url_meta":{"origin":146,"position":2},"title":"We Have a Book!","author":"AllenDowney","date":"November 7, 2023","format":false,"excerpt":"My copy of Probably Overthinking It has arrived! If you want a copy for yourself, you can get a 30% discount if you order from the publisher and use the code UCPNEW. You can also order from Amazon or, if you want to support independent bookstores, from Bookshop.org. The official\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/11\/IMG_20231107_104338_7.jpg?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/11\/IMG_20231107_104338_7.jpg?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/11\/IMG_20231107_104338_7.jpg?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/11\/IMG_20231107_104338_7.jpg?resize=700%2C400&ssl=1 2x, https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/11\/IMG_20231107_104338_7.jpg?resize=1050%2C600&ssl=1 3x, https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/11\/IMG_20231107_104338_7.jpg?resize=1400%2C800&ssl=1 4x"},"classes":[]},{"id":865,"url":"https:\/\/www.allendowney.com\/blog\/2023\/06\/25\/go-get-the-data\/","url_meta":{"origin":146,"position":3},"title":"Go Get the Data","author":"AllenDowney","date":"June 25, 2023","format":false,"excerpt":"My mantra when I was working on Probably Overthinking It was \"Go Get the Data.\" If I wanted to use a result from prior work, I would get the data whenever possible and make my own visualization. Of course, that's more work than copying and pasting a figure, but there\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/02\/pinker452.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/02\/pinker452.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/02\/pinker452.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2023\/02\/pinker452.png?resize=700%2C400&ssl=1 2x"},"classes":[]},{"id":1541,"url":"https:\/\/www.allendowney.com\/blog\/2025\/04\/06\/announcing-think-stats-3e\/","url_meta":{"origin":146,"position":4},"title":"Announcing Think Stats 3e","author":"AllenDowney","date":"April 6, 2025","format":false,"excerpt":"The third edition of Think Stats is on its way to the printer! You can preorder now from Bookshop.org and Amazon (those are affiliate links), or if you can't wait to get a paper copy, you can read the free, online version here. Here's the new cover, still featuring a\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2025\/04\/image-2.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2025\/04\/image-2.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2025\/04\/image-2.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/www.allendowney.com\/blog\/wp-content\/uploads\/2025\/04\/image-2.png?resize=700%2C400&ssl=1 2x"},"classes":[]},{"id":308,"url":"https:\/\/www.allendowney.com\/blog\/2019\/09\/19\/what-should-i-do\/","url_meta":{"origin":146,"position":5},"title":"What should I do?","author":"AllenDowney","date":"September 19, 2019","format":false,"excerpt":"I am planning to be on sabbatical from June 2020 to August 2021, so I am thinking about how to spend it. Let me tell you what I can do, and you can tell me what I should do. Data Science I consider myself a data scientist, but that means\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/posts\/146","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/comments?post=146"}],"version-history":[{"count":8,"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/posts\/146\/revisions"}],"predecessor-version":[{"id":167,"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/posts\/146\/revisions\/167"}],"wp:attachment":[{"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/media?parent=146"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/categories?post=146"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.allendowney.com\/blog\/wp-json\/wp\/v2\/tags?post=146"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}