Summary
In this week’s episode, we delve into the pivotal role of visual clarity in scientific research. Join me and Professor of Clinical Epidemiology Maarten Boers as we discuss his new book, Data Visualization for Biomedical Scientists. If you are at all interested in being a better science communicator—and especially if you are interested in publishing your work in academic journals—this episode is for you! We talk about how Maarten’s book extends beyond the world of biomedical science into good table design, small multiples, and how academic publishing needs to get its act in order.
Topics Discussed
- The Necessity of Clear Experimental Procedures: We highlight the significance of understanding every step within an experiment. Our discussion unpacks the ways in which clear, precise procedures facilitate reproducibility and validation of scientific work.
- Deciphering Scientific Terminology: Maarten’s book emphasizes the importance of demystifying complex scientific jargon. We examine strategies for breaking down terminology barriers for both specialist and general audiences.
- Graphical Excellence in Research Communication: We focus on the power of well-titled, labeled, and annotated graphs in conveying research and analysis.
- Impactful Captions and Visual Storytelling: Captions are more than mere descriptions—they’re a gateway to engaging the reader. We explore how to craft active captions that not only inform but also captivate and retain the reader’s attention.
- Challenges in Academic Publishing: We confront the practical challenges researchers often encounter with journals, their design (or lack thereof), and other publishing pitfalls. We talk about how to effectively intervene when production staff mishandle figures and how to work within the constraints of journal page limits.
- Ensuring Accuracy in the Publication Process: Our conversation also touches on the responsibilities of researchers to ensure their findings are presented accurately and effectively, even in the final stages of publication.
Resources
Book: Data Visualization for Biomedical Scientists
Website: Amsterdam UMC
Check out Maarten’s new data visualization course at EpidM. The course includes:
- 4 hours of prep time on your own and 4 hours of interactive teaching
- Personalized feedback on your own material
- Next course March 2024
- Reserve your spot
Guest Bio
University Medical Centers (1997–2023), and rheumatologist (recently retired as a staff rheumatologist in the Reade Institute). He is director of Epiconsult BV, that hosts his consulting activities. He holds a PhD in Medicine from the University of Leiden, Netherlands and an MSc in Clinical Epidemiology from McMaster University, Hamilton, ON, Canada. Dr. Boers’ main research interests have focused on treatment of rheumatoid arthritis and on outcome measures in clinical research including the early intervention and combination therapy in rheumatoid arthritis (principal investigator of the COBRA trial and follow up studies) and the rehabilitation of glucocorticoids as essential treatment (principal investigator of the GLORIA trial published in 2022). He co-founded the OMERACT (Outcome Measures in Rheumatology) consensus initiative in 1992 (www.omeract.org). He has (co)authored more than 500 full papers on topics both in and outside of rheumatology.
Related Episodes
Sponsor: Tube Buddy
TubeBuddy is the main tool that I use to manage the backend of my YouTube channel. It has been a huge lifesaver when it comes to time management and properly optimizing my channel. They have tons of free tools you can check out right here!
Different Ways to Listen to the Show!
New Ways to Support the Show!
With more than 250 guests and 10 seasons of episodes, the PolicyViz Podcast is one of the longest-running data visualization podcasts around. You can support the show by downloading and listening, following the work of my guests, and sharing the show with your networks. If you’re interested in financially supporting the show, you can sign up for my Patreon platform, make a one-time payment via PayPal, or shop one of the show’s sponsors.
Transcript
00:12 – 00:17
Welcome back to the PolicyViz Podcast. I’m your host, Jon Schwabish. Thanks for tuning in
00:18 – 00:23
to the show. On this week’s episode of the podcast, I welcome Maarten Boers, scientist who’s
00:23 – 00:29
written a great new book on visualizing data, something I really am interested in, obviously,
00:29 – 00:34
and I’m sure you are too having listening to this podcast. Now Why should you read Maarten’s
00:34 – 00:38
book and listen to this podcast in addition to all the other great books that you’re reading?
00:38 – 00:44
I think there are 2 big things to consider with this book. 1st, He spends a lot of time talking
00:44 – 00:48
about table design, and we’re gonna talk about it in the interview. It’s something that doesn’t
00:48 – 00:53
really get a lot of attention in the data visualization library. So from my perspective, it’s
00:53 – 00:59
really Maarten’s new book, my book, and Steven Fuse’s book are really the few books that talk
00:59 – 01:02
about table design. So that’s one thing. If you’re interested in better tables, this is the
01:02 – 01:07
episode for you. The other thing that we talk about a lot is small multiples boers what Maarten
01:07 – 01:12
calls matrix graphs. So we talk about good styles and designs and ideas and concepts about creating
01:13 – 01:18
better small multiple charts, Which if you don’t know are exactly what they sound like, a combination
01:18 – 01:24
of smaller multiple charts. And the last thing that we talked about that is really interesting
01:24 – 01:30
for me, From my perspective as someone who’s, worked in publishing in the academic field and
01:30 – 01:35
in academic journals is how the academic publishing doesn’t focus on data visualization and
01:35 – 01:41
how that’s been a detriment to the ability to communicate, research, and scientific findings
01:41 – 01:46
more effectively. So there’s a lot of information and guidance packed into this interview and
01:46 – 01:52
into Maarten’s book. So I hope you’ll enjoy this week’s episode of the podcast, my interview
01:52 – 01:59
with Maarten Boers, starts right now. Hi, Maarten. Good. Well, afternoon, your time. Morning,
01:59 – 02:01
my time. How are you? Happy New Year. Good to see you.
02:02 – 02:07
Hi. Good to see you too. Thank you for inviting me to do this, interview. I’m really excited.
02:07 – 02:12
This is, terrific. I’m excited to chat with you. I’ve got your book, Data visualization for
02:12 – 02:18
biomedical scientists. We’ll talk about how it’s not just for biomedical scientists. And there
02:18 – 02:24
are some unique pieces in your book that you don’t find in many other data visualization boers.
02:24 – 02:29
So so I’m excited to chat with you. So, Maybe we can start with, your background, and then we
02:29 – 02:34
just jump right in to talk about the book. The first question I have about the book is, What
02:34 – 02:40
are the unique challenges with visualizing data that that biomedical scientists face? And then
02:40 – 02:44
I think we can because the book is definitely broader than than just that, but I think We can
02:44 – 02:48
start there. So, so kind of 2 questions for you. Just just kick the whole thing off.
02:48 – 02:56
Yeah. Well, I I mean, To be really honest, I don’t think there are unique challenges to biomedical
02:57 – 03:06
science. The reason I decided to focus on what is my own field is that there’s lots of general
03:06 – 03:14
data visualization books around, and Some of them also address scientific papers and and and
03:14 – 03:24
presentations. And in the process of writing my book, I had some feedback from some, very notable
03:24 – 03:31
people in the field, And and one of them said, you know, why write another book on data visualization?
03:31 – 03:36
I mean, look at my book. That’s that’s all people need.
03:36 – 03:38
Right now. Yeah. Right. Yeah. Yeah.
03:38 – 03:43
And I was I was I was writing this book because Even though that book was great, and there are
03:43 – 03:49
many great books out there, and I, you know, I reference them all in my book. It’s not like
03:50 – 03:56
I I suddenly invented this field, but, I was struggling with what I could not find in those
03:56 – 04:02
books that were day to day issues in the papers I was trying to write or mostly I was trying
04:02 – 04:10
to supervise in in my in my my fellows and finding out that that There was no guidance at all,
04:10 – 04:16
and everybody did it wrong, including senior scientists. Nothing was being Corrected from the
04:16 – 04:22
journals. If you look at the journal templates, you know, 99% are simply horrible and could
04:22 – 04:28
be easily improved. So I started, you know, Writing about that and writing to journal editors
04:28 – 04:34
and and when I was on editorial board, spending time to to improve papers and writing guidelines,
04:34 – 04:40
and, You know, that sort of expanded into something, then I decided, you know, COVID came along.
04:41 – 04:47
Mhmm. Let’s let’s write a book. So that’s Basically, how it happened. But I think and and and
04:47 – 04:52
biomedical science is my field, so I know what’s wrong Jon my field. And if I were to say, okay,
04:53 – 04:59
and by the way, you know, economics or another field, they have poor graphs as well. Mhmm. Then
04:59 – 05:04
I didn’t really have to go and study those journals to make sure that I wasn’t making a fool
05:04 – 05:08
of myself, and people would fall all over me. Oh, you haven’t seen me done this and that. I
05:08 – 05:15
know my field. Right. So I thought, you know, if I stick to that, nobody can blame me for going
05:15 – 05:18
outside of my, comfort zone. So so
05:18 – 05:22
but let’s talk about that a little bit because, like, I will tell you in the economics field,
05:22 – 05:27
like, the graphs are also fairly terrible in most journals. And so you you spend there’s, like,
05:27 – 05:31
this this section right at the end that’s kinda tucked away, but it was kinda like my favorite
05:31 – 05:38
part because it it talks about this peer review process that nobody talks about. And and I guess,
05:38 – 05:46
I’m not sure what my question is other than why do you think Peer reviewed journals haven’t,
05:47 – 05:55
tapped into what we know is data visualization is a popular means to communicate research and data.
05:56 – 06:07
Because nobody feels enough of an expert to comment on it. And, so it’s it’s from from the junior
06:08 – 06:13
to all the way to the senior level, that nobody knows how to do it.
06:14 – 06:14
Yeah.
06:14 – 06:22
And so so, You know, I I was interested in in data visualization right from the very beginning
06:22 – 06:29
of my scientific career before, So I always spent a lot of time, you know, doing stuff, but
06:29 – 06:32
nobody of my supervisors would ever comment on that part.
06:32 – 06:33
Right.
06:33 – 06:40
And if you made a table, Well, you just made a table, and nobody said, gee, why don’t you you
06:40 – 06:44
know, why is the order of your categories like this? Or why have you ordered your numbers in
06:44 – 06:50
this way? And why is this table so terrible to read? It was sort of a a natural phenomenon that
06:50 – 06:53
tables and graphs would magically appear.
06:53 – 06:54
Yeah.
06:54 – 06:59
And then right from the first version, they would go all the way into the published paper. And,
06:59 – 07:06
unfortunately, That is either still the case, or journals have caught up and put in some guidelines
07:06 – 07:08
that make tables and graphs worse.
07:09 – 07:16
Right. They built these templates that are not built by I mean, I’m sure well intentioned, but
07:16 – 07:19
not people who may have expertise and how to do these well.
07:19 – 07:25
Or they and and, I mean, the most recent version of this is that the the bigger journals have
07:25 – 07:32
put little software machines in place Mhmm. Where, once you are past the acceptance stage, you
07:32 – 07:37
go into the proof. You get your proof, But the proof is no longer a PDF, but it’s actually a
07:37 – 07:43
web page where you can, you know, correct your paper. And the web page has a little table machine
07:43 – 07:48
in there It does not allow you to format the table properly.
07:48 – 07:48
Right.
07:49 – 07:51
Or it makes it so difficult that everybody gives up.
07:52 – 07:57
Right. There’s a revenue stream problem in academic publishing, right? Like my first, the first
07:57 – 08:02
paper I wrote, I remember the first paper that I will coauthored, back in the day, the journal
08:03 – 08:08
helped make all the the graphics. You know, they they had all the editors. I mean, there was
08:08 – 08:11
a team, and now You’re on your own. It’s like, you
08:11 – 08:14
know Exactly. Yeah. Exactly.
08:14 – 08:20
And do you do you think I mean, this is looking ahead, but do you think that Maybe some of these
08:20 – 08:25
artificial intelligence tools and as the, you know, data vis tools maybe start developing it
08:25 – 08:29
better, that will be helpful, or do you think we’re just gonna be spinning our wheels and
08:30 – 08:36
Well, yeah, well, if you say AI, AI needs to be trained. And if AI is gonna be trained with
08:36 – 08:43
the current body of literature, they will Engrained is the the horrible practice that we have now. Right.
08:43 – 08:44
Yeah. That’s right. That’s right.
08:44 – 08:51
I hadn’t even thought of that, but you’ve given me a new nightmare to contemplate. That’s that’s
08:51 – 09:01
not gonna really help. I mean Yeah. The software has improved Immensely. So there’s much better
09:01 – 09:07
tools that you can use today than there were when I started out. I mean, when I started out,
09:07 – 09:10
we didn’t have computers. I mean, I’m that old.
09:10 – 09:11
Punch cards.
09:12 – 09:15
And, actually, I started before we had punch cards.
09:15 – 09:16
Before punch cards. Okay.
09:16 – 09:26
So so so my first, scientific article was drawn by a scientific artist Oh, wow. Who who had
09:26 – 09:28
the, you know, the like architects have?
09:28 – 09:29
Yeah. Yeah. The big drafting.
09:29 – 09:35
And then they they would have paper, and then they would have sets of of dotted and dashed lines
09:35 – 09:41
on on paper, and they they would put it on the line, and then they would scratch the ink onto
09:41 – 09:48
the well, that was that was that was pure art. Yeah. Yeah. And I I remember dreaming of a computer
09:48 – 09:54
program when when When the first Mac started when when the the the Mac first Mac came on the
09:54 – 09:57
market, I dreamt of a program that would do things.
09:58 – 09:59
We’ll do it. Yeah.
09:59 – 10:04
And when that program appeared, it isn’t around anymore, so I can safely name it without sponsoring
10:04 – 10:12
anybody, Cricket Graph. Oh, wow. When that program came along, I bought my first Mac because
10:12 – 10:17
I had dreamed of that program. You know, you click on an axis, And it opens, and you can change
10:17 – 10:18
the scale. That sort
10:18 – 10:19
of thing.
10:19 – 10:24
Well, that’s, you know, routine now. It wasn’t it wasn’t around. So Right. I I’ve seen it all
10:24 – 10:31
come, and, I mean, now there’s such good programs, and and some of the programs are invented
10:31 – 10:38
by people who know about graphs. So the default is already half or three quarters good, which
10:38 – 10:45
means the errors, If you just keep on the default, you already have a pretty good Right. Starting
10:45 – 10:48
point. And many people don’t get past the default, as you know.
10:48 – 10:49
Yeah. Right.
10:49 – 10:56
When Microsoft changed their standard font, everybody uses that font because nobody ever changes
10:56 – 11:01
a font. Right. And it’s with graphs and with tables. It’s it’s the same way. You have one template.
11:01 – 11:03
Oh, that’s the way to do it. Okay. Fine.
11:03 – 11:10
Right. Right. So so you’ve mentioned a couple times tables. In the first chapter after the introduction,
11:10 – 11:17
the first chapter in your book is on table design. And By my count, there are only 3 data visualization
11:17 – 11:22
books that spend any significant time talking about tables. Your book, my book, and Steven Fuse
11:22 – 11:24
book. And those are the only 3 that I’m aware of
11:24 – 11:25
that’s happening in
11:25 – 11:34
the same time. And I’m curious, I guess my curiosity really stemmed, why you decided to do tables
11:34 – 11:35
at the very beginning of the book?
11:37 – 11:39
Well, I was gonna do a book on graphs.
11:39 – 11:40
Uh-huh. Right.
11:40 – 11:48
And then I I thought yeah. But I need to do tables first. Yeah. Because tables in a way, are
11:48 – 11:56
simpler. Mhmm. Although, making a good table is pretty difficult, but tables are simpler because
11:56 – 12:00
you don’t have all these, you know, directions you can go in.
12:00 – 12:01
Yes.
12:01 – 12:04
And they are sort of the basic staple of of science.
12:05 – 12:05
Yeah. Is
12:05 – 12:12
it that’s not graphs. It’s tables. Yeah. And I think nobody writes about them. So I was highly
12:12 – 12:18
Fired when I I didn’t have your book then. I just had Steven Few’s book, and I thought, he is
12:18 – 12:23
a guy who writes about table, and it’s it’s It’s really I mean, his table chapter is really very good.
12:23 – 12:24
Yeah. So,
12:26 – 12:32
like with all things, you know, do I need to add to that? Yes. Because Steven does not address
12:32 – 12:38
the issues I have in my field with tables and and extensively described in the chapter where
12:38 – 12:43
you have, And I’m I’m moving away from that, but that’s what you see very commonly. You have
12:43 – 12:50
multiple numbers within one cells, mhmm, within one cell that are, you know, unclearly separated.
12:50 – 12:56
So you have these number blurbs that almost become words inside a cell, And then on the next
12:56 – 13:01
line, you have the same blur but slightly differently. And, you know, within 4 lines, your table
13:01 – 13:02
is a total mess. So
13:02 – 13:02
Yeah.
13:02 – 13:10
Yeah. There were there were quite a A number of things that Steven Few didn’t have to focus
13:10 – 13:17
on because his tables We’re basically about business and sales and and housing and whatever.
13:17 – 13:22
So it’s Right. Sort of one category, which is either dollars or bricks So whatever.
13:23 – 13:23
Yes.
13:23 – 13:29
And we have all these multi item tables with different things. So Yeah. There’s there’s highly
13:29 – 13:34
specific things that I was struggling with Before I wrote the chapter and I had sort of found
13:34 – 13:39
some solutions boers, which were universally rejected by journals if I submitted, they say,
13:39 – 13:42
yeah. But this is not standard. Yeah. But just down at the zoo.
13:44 – 13:49
I mean, it’s so interesting. Like, they have their own publishing platform, whatever tools they
13:49 – 13:55
use. Yeah. And it’s like, that’s that’s it. And if it doesn’t fit in that little box, then
13:56 – 14:02
Well, it’s it’s worse than that. I think most of the journals have one template, which, you
14:02 – 14:11
know, looks like a format. Mhmm. But what they do is they just Pour your numbers in the cells
14:12 – 14:17
and then forget about it. There isn’t Right. You know, there’s some good journals that make
14:17 – 14:24
very nice tables, but, Usually, it’s just, oh, you’re submitting a table. Okay. Here’s our matrix.
14:24 – 14:28
Just pour it in. There you go. This is our format. This is how we always do it.
14:28 – 14:29
Right.
14:29 – 14:32
So there’s no there’s no thinking behind it. It’s just production.
14:33 – 14:39
What would your top recommendations be for for people when it comes to making good tables.
14:40 – 14:41
I can name only 1.
14:42 – 14:42
Okay.
14:42 – 14:43
Only only 1?
14:43 – 14:44
Love it.
14:44 – 14:45
Because I have a lot of them.
14:45 – 14:47
Yeah. Yeah. Yeah. Yeah. 1 would be good. Yeah.
14:48 – 14:52
The the I think the first recommendation would be proper alignment.
14:53 – 14:54
Yeah.
14:54 – 14:59
It’s it’s about categories and numbers, and if your numbers are not properly aligned, they’re
14:59 – 15:00
difficult to read.
15:00 – 15:06
Yeah. It is fascinating to me when I see this. I’m sure I mean, you know, science is science.
15:06 – 15:10
So they have like you said, you have multiple things in a cell. You’ve got the, You know, a
15:10 – 15:14
coefficient and a standard error with stars and and parentheses around it. And it is shocking
15:14 – 15:20
to me how the numbers are not aligned, and it is just Just objectively difficult to read. I
15:20 – 15:26
don’t think we need any sort of, like, like, real fundamental study to to prove that point.
15:26 – 15:28
It is just harder to read.
15:29 – 15:36
Exactly. So so and and it’s so basic, and it is it is fairly easy to correct. I mean, there’s
15:36 – 15:41
some issues, you know, shall we do it this way, that way? But it’s not like, oh my god. How
15:41 – 15:47
am I going to solve this problem? It’s just saying, this is how you do it and implementing it.
15:47 – 15:54
I think the the other thing about tables is to communicate to people that, you know, tables
15:54 – 16:02
are actually Boers of like very simple graphs. So you have to think about the ordering of the
16:02 – 16:11
information in your table, Which which which I I term after one of the other giants, telling
16:11 – 16:19
a story. So it’s not like you make a table and you order your categories by alphabet, or by
16:19 – 16:30
the order in your database, mhmm, Or by, you know, random? Yes. You know?
16:30 – 16:33
Whatever your code says the variable name is? Yeah.
16:33 – 16:38
Usually, things you find important are gonna be at the top of the table. Yeah. So don’t let
16:38 – 16:46
me scroll through 18 rows of noise before, you know, we hit on the thing you wanted to show
16:46 – 16:50
me in the first place. Yeah. So those those are it’s more than one. That that would be the 2
16:50 – 16:56
things, and then there’s all the other stuff. Because as you rightly noted in the prep for this
16:56 – 17:01
interview, I think the the tables chapter is actually the longest of the whole book.
17:01 – 17:07
Well, it’s it is interesting because Steven’s Chapter on Jon tables. I don’t remember, but exactly
17:07 – 17:11
where, but it’s somewhere in the middle of the book. The chapter I wrote on tables is at the
17:11 – 17:16
end of my book. And I just I did I found it interesting that it was at the beginning of this
17:16 – 17:21
book, but when I think about at least in economics, and I’m guessing it’s similar in your field.
17:21 – 17:28
If I think about picking up any journal and just going page by page and counting graphs versus
17:28 – 17:35
tables, I’m sure there are more tables, than than graphs in any random journal that I select.
17:35 – 17:41
Yeah. And and and in every paper, there’s a table 1 become it always comes before any graph,
17:41 – 17:41
and
17:41 – 17:47
it hits gives you the baseline data, so it’s sort of really, really basic. Yeah.
17:47 – 17:55
Yeah. The other interesting, piece of the book is you have an entire chapter Dedicated to, you
17:55 – 18:00
know, what you call matrix graphs. Other people call it small multiples, trellises, panel chart.
18:00 – 18:06
I mean, whatever. Right? But So, again, I’m I’m curious about the decision to include to to
18:06 – 18:11
write an entire chapter about it. I think it’s, I I don’t I’m not saying I disagree with it.
18:11 – 18:15
I actually agree with it, that it it deserves its oh, it’s one of the one of my pillars of good
18:15 – 18:21
data visualization is Think small multiples and and see where’d you go. So, again, what was
18:21 – 18:25
your thinking behind a whole chapter dedicated to small multiples? And then we could talk about
18:25 – 18:29
what your, you know, Top recommendation would be when people are are creating it.
18:29 – 18:36
Okay. So so that chapter is actually has 2 parts. The Sort of the basic part, the basic principles
18:36 – 18:45
of small multiples, which covers topics like, you know, trying to harmonize your axis So that
18:45 – 18:51
you can do away with a lot of labeling and and trying to, again, get the order right for the
18:51 – 18:57
message you have in your data, Mhmm. Whether it be you know, we like to compare horizontally
18:57 – 19:01
rather than vertically. So if your main comparison is horizontal, they should be like this,
19:01 – 19:08
all that sort of stuff, which goes for, all of those, what I call matrix drafts. But the real
19:08 – 19:22
story is The second maarten. And the second part is my effort to make A good matrix graph for
19:22 – 19:29
basic scientists. So in in my field, basic scientists are the The people in the labs, or they
19:29 – 19:33
call themselves usually translational because they want people to feel that they are connected
19:33 – 19:40
to the bedside. You know, they do stuff with cells and with with with DNA and with, experimental
19:40 – 19:41
animals and what have you.
19:41 – 19:42
Right.
19:43 – 19:52
And within my field, that is a really close shop Mhmm. Of people who have their own methods
19:52 – 20:01
of doing research. And if you’re not In the in crowd, you usually don’t understand anything
20:01 – 20:06
of what’s going Jon, and that’s because they have their own codes. They have their own abbreviations.
20:06 – 20:11
They have their own way of doing things. They have their own way of doing statistical analysis.
20:11 – 20:17
And I’m, you know, sort of a statistician as well, Doing a lot of things that in the rest of
20:17 – 20:24
the world is found to be quite objectionable and not really very right, like doing, experiments
20:24 – 20:31
with very, very small sample sizes and then doing, parametric statistics To see differences,
20:31 – 20:38
significant differences, and then doing multiple tests, and I can go on and on. So There’s a
20:38 – 20:51
lot of things in in methodology, in, my type of methodology, Which is not the lab, mechanistic
20:52 – 20:59
methodology, but really statistics, Yeah. How to set up your experiment, how to do controls,
20:59 – 21:06
and all that sort of thing, which is, Well, difficult to understand, and and usually not the
21:06 – 21:13
way I would recommend, but the other side of it is they do a lot of, So they have this theory
21:13 – 21:24
of, you know, whatever. Agent a, blocks The production of, some sort of stuff that you need
21:24 – 21:31
or not need from, from the raw compound, And we’re gonna prove that that is the fact and also
21:31 – 21:36
that it has a biologic effect that is relevant. Mhmm. And so the so they have this system where
21:36 – 21:42
they say, okay. Let’s first look in the genes, whether this is happening. And then let’s look
21:42 – 21:48
at the gene expression, and then let’s look at whether the cells who have that gene extraction
21:48 – 21:53
are actually in the place where they should be doing the work or not. And then if we have that,
21:53 – 21:59
then let let’s look at, You know, in whole lab animals, and let’s see whether the agent that
21:59 – 22:06
is needed for that process is actually being blocked or not. So they have A lot of ways to buttress
22:06 – 22:11
their theory, which is really very cool. You know, in in human experiments, we have this drug,
22:11 – 22:16
And we put it in a human and see, you know, what happens to the human, and we have to infer
22:16 – 22:23
all those steps in between. So Their if you want statistical methodology, I think is very poor,
22:23 – 22:32
but their biological methodology It’s very, very advanced, very precise, very elegant. They
22:32 – 22:37
always have, you know, 3 or 4 ways of proving their point. Yeah. We maarten here and there and
22:37 – 22:43
there and there, and that’s why this is true. So really good stuff. Beautiful science. And Yeah.
22:43 – 22:49
I always thought when I saw these Horrible mattresses that they produce because they have, like,
22:49 – 22:55
in an 8 minute presentation, they will have Twenty slides, and all those slides are small multiples
22:55 – 23:01
of at least 16 graphs with unreadable letters with no explanation. And they’ll say, So here
23:01 – 23:07
you see that a was logged by b. And by the way, oops, next slide. B was logged by c. Right.
23:07 – 23:14
You know, another 20 graphs. So it was Oh my god. Oh my god. I I can’t So I thought, you know,
23:14 – 23:20
it may just be that I’m too stupid to understand this stuff, And all the rest of these basic
23:20 – 23:24
science people, they understand, and they’re happy with these slides. It’s just me being stupid.
23:24 – 23:24
Yeah. So
23:24 – 23:29
I just said, okay. I don’t understand this graph, and I had a colleague, you know, I work in
23:29 – 23:34
a in a university bath department with a lot of basic scientists, And there was a very nice
23:34 – 23:41
presentation from one of his PhD students. And I saw from the slides okay. Here’s Someone who’s,
23:41 – 23:46
you know, sort of interested in in in trying to convey what’s going on to me who is not in the
23:46 – 23:51
lab. So that was the starting point. And I said, look. I saw this presentation. It’s nice. I
23:51 – 23:57
understood about 25% of it. Mhmm. I want to understand 100%, and I wanna see your graphs, And
23:57 – 24:04
I wanna see whether I can improve it Mhmm. After understanding so that it’s better.
24:04 – 24:04
Yeah. Is
24:04 – 24:09
it okay? So, well, it was a bit of, you know, to and fro before I had the data and I had the
24:09 – 24:15
graphs and I had so I started building this. I think, Yeah. I wasn’t working full time, and
24:15 – 24:16
I think it cost me 3 months
24:17 – 24:17
Oh, wow.
24:18 – 24:23
To understand the science Oh, yeah. Really on the detail level. And, again, you know, the science
24:23 – 24:29
is not really very difficult. It’s just understanding what’s going on in the nucleus with that,
24:30 – 24:37
cytokine and that stuff What is being activated and repressed? Sure. But you have to know what
24:37 – 24:43
kind of abbreviations they’re using. Are they useful? And then you find out they always do the
24:43 – 24:53
same experiment. It’s Jon generic experiment, and they do that In different settings over and
24:53 – 24:53
over and over again.
24:53 – 24:54
Over and over again. Right?
24:54 – 25:03
It’s like this. Okay. I have a negative control. Mhmm. Yeah. It’s normal saline or or or unstimulated
25:03 – 25:11
cells boers, Wild type baby baby calves or whatever. It’s, you know, this. Okay. Let’s see what
25:11 – 25:12
happens. That is my zero condition.
25:13 – 25:13
Yeah.
25:14 – 25:21
And then I have my positive control, which is, let’s inject this baby calf with something which
25:21 – 25:25
will turn All the white blood cells on. We know it does.
25:25 – 25:26
Right.
25:26 – 25:31
So okay. In our system, let’s see what happens. And we have, you know, All the locusoid counts
25:31 – 25:37
going up or all the loop, interleukins being activated, whatever. So that’s my positive control.
25:37 – 25:44
Okay. Mhmm. Now let’s put in the agent we think is going to do something in this system.
25:44 – 25:45
Right.
25:45 – 25:50
Let’s see what happens in this leukocyte kind of whatever the system is I’m using to measure.
25:50 – 25:56
Okay. We have that. Okay. Now let’s see what we what happens If I put in a blocking agent without
25:56 – 26:02
my stimulus, nothing happens because nothing’s being seen. Now let’s put them in together. Is
26:02 – 26:07
it really blocking? Yeah. Because Yeah. The original is going down by 50%.
26:08 – 26:08
Right.
26:08 – 26:15
Okay. So, this is blocking that. Okay. Good. Next experiment. Let’s try the same thing in another,
26:16 – 26:20
setup with different readouts. Is it happening there too? Is it happening there too? Is it happening
26:20 – 26:25
there too? Is it happening there too? And then they cycle through this because once they’ve
26:25 – 26:29
shown this to be a blocking agent, they’re gonna see, okay. If I block it and now I stimulate
26:29 – 26:34
something else, What will be the effect downstream, and it goes on or not? But the basic experiment
26:34 – 26:39
is always positive control, negative control, Action blocking.
26:39 – 26:39
Right.
26:39 – 26:41
So those are the generic labels.
26:41 – 26:42
Yeah. Yeah.
26:42 – 26:47
So why should would I need to read 16 graphs with hrzbq25
26:51 – 26:51
Right.
26:51 – 26:56
And have to go down to the footnote, which is half page long. And halfway, it says, Stimulates,
26:58 – 27:01
a b c l l five o x
27:01 – 27:01
Yeah. Yeah.
27:02 – 27:06
Which is the readout system that I’m using in this little panel.
27:06 – 27:07
Right. Right.
27:07 – 27:11
Anyway, I’m I’m using a lot of words, but that chapter cost me 3 months to make. And in the
27:11 – 27:17
end, well, you saw you’ve seen it in the book. I hope you said, okay. You know, from the original,
27:17 – 27:22
this is quite an improvement because I sort of can see what they’re doing with little text saying,
27:22 – 27:28
in this experiment, we showed blah blah blah, but not that. And then here, this was stimulated,
27:28 – 27:34
and the labels, The text and the labels are as generic as possible with as little abbreviations
27:34 – 27:41
as necessary, and then boers of walks you down that path of little multiples so that at the
27:41 – 27:43
end, you have the story.
27:44 – 27:45
Yeah. It’s kinda like a little comic book,
27:45 – 27:48
Yeah. It’s a comic book. Exactly. That’s the word.
27:48 – 27:53
But it it it’s interesting. Your your story of that what’s interesting about it is that your
27:53 – 28:03
focus is not so much on, you know, should this axis be this width, and should the grid lines
28:03 – 28:09
be this color, And should the line be such and such? It’s it’s interesting to hear you talk
28:09 – 28:17
about it because the focus of of that story was on Making the labels and the language accessible to people.
28:17 – 28:18
Exactly.
28:18 – 28:22
And and and it is interesting to me because this is one of the things that I talk about with
28:22 – 28:28
people that that it’s it’s about the titling and the labels and the annotation because, you
28:28 – 28:34
know, a graph, Whether a graph is is beautiful or not is a lot in the eye of the beholder. Right?
28:34 – 28:41
You like blue, I like green, whatever. But it’s being able to Tell people what’s going on in
28:41 – 28:46
the graph with the words. And and, I guess, I’ll say one last thing, and then I’ll I’ll let
28:46 – 28:50
you just respond. That I think you’ve heard this too where people say, oh, you should be able
28:50 – 28:57
to look at a graph and get it right away. And I just I don’t think that’s true because you need words.
28:58 – 29:04
No. Well, you know, there are rare instances. I mean, they say, you know, one one picture is
29:04 – 29:12
worth a 1,000 words. Okay. But, the the usually, the graphs that are immediately obvious are
29:12 – 29:19
those Situations where I say, did you really need a graph for this? So, you know That’s right.
29:19 – 29:27
Yeah. If I show you a graph with A mean length of men versus women, and you see a bar with men
29:27 – 29:32
being higher than women. You say, okay. So men are bigger
29:33 – 29:33
Right.
29:33 – 29:38
Larger than women. Then I say, okay. You could’ve said that in one sentence.
29:39 – 29:39
Right? Yeah.
29:39 – 29:45
Did you really know who this part this is what what pharma used to do. They’ve moved on from
29:45 – 29:49
that, I have say, but they used to show you these graphs, you know, before, after Mhmm. Big
29:49 – 29:58
big bars. Oh, right. Oh, yes. Oh, now I see it in a graph. Oh, man. No. I I think and and there’s
29:58 – 30:08
also this other thing of, I think more, traditional scientist scientific views that, you know,
30:08 – 30:14
you should not impose on the reader Boers interpretation of the data. I’m just showing you the
30:14 – 30:21
data, and then you can make your own interpretation. And I’m not gonna bias you by telling you,
30:21 – 30:29
Well, that’s all nonsense. Yeah. You know, in in in current times, people spend maybe 2 or 3
30:29 – 30:34
minutes on an abstract before moving on to the next. And if you want to hit them, you hit them
30:34 – 30:42
between the eyes. So the the graph and the caption, and and everything is telling you that message
30:42 – 30:47
again and again and again. So your caption should not read, Here’s the results of my experiments.
30:47 – 30:52
I mean, that’s not gonna help. It’s gonna say, this clearly shows that a is blocking b.
30:52 – 30:59
Yeah. I find this fascinating and it happens in scientific publishing all the time where people
30:59 – 31:06
say, oh, I can’t tell you what the result is in the graph. But if you go to the text that they
31:06 – 31:11
write, the text will tell you exactly what the argument is. But, like, when you the graph, it’s
31:11 – 31:18
like it’s just purely descriptive. And I I never understood that disconnect between What I write
31:18 – 31:23
in the text versus what I write in and around the graph? Like, why don’t they don’t
31:24 – 31:30
Well, yeah. I mean, the same discussion around What one calls declarative titles of papers.
31:31 – 31:40
So, many journals will not allow you to say, A randomized trial showing drug a is good for patients with b.
31:40 – 31:40
Right.
31:40 – 31:47
No. No. No. No. No. You have to say A randomized trial of the effect of a in patients with b.
31:47 – 31:50
Yeah. But in the text?
31:50 – 31:51
In a text, it’ll tell
31:51 – 31:57
It’ll tell you that first piece. It’s it is it’s just so frustrating to me. I just I just don’t understand it.
31:57 – 32:02
But I I I want to I want to come back to what you said earlier Yeah. Because It it is really
32:02 – 32:13
2 things. I think it’s sort of, those two things, labeled, I think Jon Cleveland. You’re you
32:13 – 32:17
know of John Cleveland? It’s an old book, but he was also one of the William.
32:18 – 32:19
William Bill Cleveland.
32:19 – 32:23
Bill Bill Cleveland. Yeah. Yeah. Yeah. Yeah. Yeah. Together with, with Tuft. Yep. Those were
32:23 – 32:31
the 2, you know, Pioneers in the field, the Giants, that we still worship every day. But he
32:31 – 32:37
I think Bill Cleveland said clear vision and clear understanding. And and those 2 are you need
32:37 – 32:44
them both, and and they’re not completely independent. But, first, you think, okay. What’s the
32:44 – 32:50
main message here? And and how I’m gonna tell this story, that is the basic, and and that would
32:50 – 32:55
would be maybe your storyboard of your multiple panels saying, I wanna show the first, this
32:55 – 33:00
experiment, then that experiment, then that. And that these three together leads to a conclusion
33:00 – 33:05
which I can use in my next set of experiments, etcetera, etcetera. Mhmm. If if that’s not there,
33:05 – 33:11
if the order there is wrong, then you’re immediately lost. Yeah. Then then the clear vision
33:11 – 33:21
comes in with Not only clear labels, but also consistent labels Mhmm. And color coding, And
33:22 – 33:30
cues in the symbols that you use, and the thinness of the versus the thickness of the series
33:30 – 33:39
lines Mhmm. And always, making your experimental condition slightly more prominent than your
33:39 – 33:45
control, but either by color or by hue or what have you. Mhmm. And that is the technique of
33:45 – 33:53
clear vision. It it’s driven by the story, but you want I mean, it’s as simple as this happens
33:53 – 34:00
all the time. So You have this order of, you know, a, then b, then c, and then you have a graph
34:00 – 34:09
where the order is c b a. Like, you know, immediately, your brain is, hey. What was first here?
34:09 – 34:11
Right. Right. Yeah.
34:11 – 34:14
And that’s usually the the programs that do that. Right? They they they have a Yeah.
34:14 – 34:15
There’s some
34:15 – 34:17
PolicyViz give you the wrong order.
34:17 – 34:17
Right.
34:17 – 34:25
So all these things is designed. And and like in tables, Ordering is designed. It’s so it it
34:25 – 34:29
boers of fits together, and you need to go back and say, okay. Now I fiddled around with my
34:29 – 34:34
labels, but is the story still clear? Oh, no. Because here, I used another label be because
34:34 – 34:39
it fit better or because it was more appropriate in this graph. But given the story, perhaps
34:39 – 34:45
I should change it to the other label because then it the consistency is is is maintained. Boers
34:45 – 34:47
I can’t, but then I thought about it.
34:47 – 34:52
Right. I wanted to ask you one last question on on the small multiples. So some of the examples
34:52 – 34:59
in the book have, Say an odd number of graphs. But the way, at least some of the examples are,
34:59 – 35:04
is, you know, you might have, let’s just say, 2 squares next to each other than a rectangle
35:04 – 35:10
below it that kind of spans across. What would you recommend to people if they have small multiples,
35:10 – 35:16
but they have an odd number? They have 1, 2, 3, and they have this blank area. Right? That’s
35:16 – 35:22
kinda left over. What do you what do you recommend people do with that blank area?
35:23 – 35:31
That’s a difficult question. Also, because, usually, you’re not the one making the page layout.
35:31 – 35:32
Yes, that’s true.
35:32 – 35:45
So so, Any form you produce will be handled or mishandled by someone In the production process
35:45 – 35:55
Mhmm. Turning it into something different. I I remember, with The tables article that I wrote
35:55 – 36:03
that they got the layout so wrong that I ended up, you know, printing the proof, Mhmm. Cutting
36:03 – 36:04
it into pieces
36:04 – 36:04
Yeah.
36:05 – 36:09
Putting the layout together, and then making photographs that say, I want it in this order.
36:09 – 36:15
Yeah. Sure. So if you have these If you have these 3 panels, they could either make a big, white
36:15 – 36:22
area boers they could just fill it up with text, in which case It maybe it doesn’t matter, or
36:22 – 36:26
they put the caption there, which if it’s big enough, you know, fits inside, then you have to,
36:26 – 36:32
you know, place your, I would say then then, you know, you have a a row on top if I do it in
36:32 – 36:39
mirror image, and I would put the the third one on the right side, and then I put the caption
36:39 – 36:45
on the left below the first well, you know, that sort of thing. If if they allow you to fiddle
36:45 – 36:52
around, that’s That’s fine. I don’t think you need to fill up a square space with a, you know,
36:52 – 36:59
a stretch graph just because then Yeah. You’ll have the block filled. But it it it it boers
37:00 – 37:08
remind me To say that it’s really important that you think about the limitations of the journal
37:08 – 37:15
page, whether it be electronic or paper, it’s it it isn’t really helpful if you make graphs
37:15 – 37:20
that fill 1 and a half column, where you know it has 2 columns because either they’ll blow it
37:20 – 37:24
up, which would be nice, or they’ll make it smaller.
37:24 – 37:25
Smaller, right.
37:25 – 37:30
Where, and then everything becomes unreadable, So you have to think in advance. Okay. What do
37:30 – 37:35
I want for this figure? Can it be in 1 column? Can it be in 2, or should it be a full page?
37:35 – 37:40
And if so, Should it be rotated, which is usually not good for figures because people read upright,
37:41 – 37:42
etcetera, etcetera. So that kind of Yeah.
37:42 – 37:48
Yeah. Yeah. No. But that’s but that’s a good point. And it is it is true that a lot of journals
37:48 – 37:53
don’t tell you that until The very end. Or they never tell you that, and they just say this
37:53 – 37:55
is how it’s gonna be, and then you’re you’re
37:55 – 37:59
And and most authors are so thrilled that their paper is finally accepted.
37:59 – 37:59
Right.
37:59 – 38:08
They don’t wanna Jon difficult where the reality is the, you know, the science editor is usually
38:08 – 38:11
on your side, and it’s just as exasperated with the production process
38:12 – 38:12
Right.
38:12 – 38:19
As as you are, but is out of the the loop. Right. The moment it goes into production, you’re
38:19 – 38:25
dealing with technicians Yeah. Who may or may not be helpful. Right. Perfectly fine just to
38:25 – 38:30
say, you know, Sorry. This proof is not okay because this and that. And then, I’ve had proofs
38:30 – 38:31
come back 4 times.
38:31 – 38:32
Yep.
38:32 – 38:37
But then, of course, I’m a cranky old man. So If you’re a junior felon,
38:37 – 38:45
that that would be a good deal. Well, on that note, So the book is data visualization for biomedical
38:45 – 38:51
scientists, creating tables and graphs that work. Maarten, where can folks find you if they have
38:51 – 38:56
Questions boers they wanna bring you in to talk, where what’s the best way to to find you?
38:56 – 39:01
Well, I think it best it best were if you Just were to project the the slides I sent you with
39:01 – 39:11
my personal details, also the The link to the book, if they wanna buy it, it’s it’s QR codes.
39:11 – 39:16
And I have to say, you know, I’m actually, Next week, I’m gonna be in in in Canada, in Toronto
39:16 – 39:25
to give a a data course, which is, you know, a half day affair with Some prep work, and that’s
39:25 – 39:30
really a lot of work to do. Yeah. You know? So I I do travel.
39:30 – 39:31
Okay.
39:31 – 39:36
And I I I don’t I don’t organize these courses myself mostly. I just tell people, you know,
39:36 – 39:40
you want me to give that boers? Okay. Then I’ll come over, and then this is what you need to
39:40 – 39:42
do. So I’m in.
39:42 – 39:47
Yep. That’s great. Well, I’ll put links to the book and to your to your classes site and, folks
39:47 – 39:51
can get in touch. Maarten, thanks so much for coming on the show. Really appreciate it. It was
39:51 – 39:52
a great conversation.
39:52 – 39:53
Thanks.
39:55 – 39:57
Thanks for tuning in to this week’s episode
39:59 – 39:59
of the show. I hope you enjoyed that.
39:59 – 40:04
I hope you will check out Maarten’s book. I put links to his website and all of the things that
40:04 – 40:10
we talked about on the show notes at PolicyViz. And you can check out other resources, other
40:10 – 40:15
tutorials, and other things around the world of data and data visualization at PolicyViz. Thanks
40:15 – 40:21
so much for listening. Until next time. This has been the Policy of this podcast. A number of
40:21 – 40:26
people helped bring you the Policy of this Podcast music is provided by the NRIs. Audio editing
40:26 – 40:31
is provided by Ken Skaggs. Design and promotion is created with assistance from Sharon Sotsky
40:31 – 40:35
Remirez, And each episode is transcribed by Jon Transcription Services. If you’d like to help
40:35 – 40:40
support the podcast, please share it and review it on iTunes, Stitcher, Spotify, YouTube, or
40:40 – 40:45
wherever you get your podcasts. The Policy Mizz podcast is ad free and supported by listeners.
40:45 – 40:51
If you’d like to help support the show financially, please visit our Paypal page or our Patreon page atpatreon.com/policyviz.