Baseball Toaster Catfish Stew
Help
STOP CASTING POROSITY! An Oakland Athletics blog.
Frozen Toast
Search
Google Search
Web
Toaster
Catfish Stew
Archives

2009
02  01 

2008
12  11  10  09  08  07 
06  05  04  03  02  01 

2007
12  11  10  09  08  07 
06  05  04  03  02  01 

2006
12  11  10  09  08  07 
06  05  04  03  02  01 

2005
12  11  10  09  08  07 
06  05  04  03  01 

2004
12  09  08  01 

2003
12  11  10  09  08 
Email Us

Ken: catfish AT zombia d.o.t. com
Ryan: rarmbrust AT gmail d.o.t. com
Philip: kingchimp AT alamedanet d.o.t net

Ken's Greatest Hits
28 Aug 2003
12 Jan 2004
31 May 2005
11 May 2005
29 Jun 2005
8 Jun 2005
19 Jul 2005
11 Aug 2005
7 Sep 2005
20 Sep 2005
22 Sep 2005
26 Sep 2005
28 Sep 2005
29 Sep 2005
18 Oct 2005
9 Nov 2005
15 Nov 2005
20 Nov 2005

13 Dec 2005
19 Jan 2006
28 Jan 2006
21 Feb 2006
10 Apr 2006
16 Apr 2006
22 Apr 2006
7 May 2006
25 May 2006
31 May 2006
18 Jun 2006
22 Jun 2006
6 Jul 2006
17 Jul 2006
13 Aug 2006
15 Aug 2006
16 Aug 2006
20 Aug 2006
11 Oct 2006
31 Oct 2006
29 Dec 2006
4 Jan 2006
12 Jan 2006
27 Jan 2007
17 Feb 2007
30 Apr 2007
27 Aug 2007
5 Sep 2007
19 Oct 2007
23 Nov 2007
5 Jan 2008
16 Jan 2008
4 Feb 2008
7 May 2008
20 Jun 2008
4 Feb 2008
Fresh Pitch Chart, Get it While it's Hot
2007-02-08 13:13
by Ryan Armbrust

Straight from the Ryan Armbrust Graphical Baseball Laboratory, located in a secluded castle on a mountaintop of the Bohemian Alps in southeast Nebraska, I bring you my latest creation. What you see is every Oakland pitch thrown from one game last year.

What began as a simple experiment in visualizing pitch progression for a game ended up looking vaguely like a reject from the Frank Lloyd Wright school of stained glass windows.

Nonetheless, I think there's some promise to what I set out to accomplish. Let me go back to the beginning, and explain what I've got going on there.

On the left hand side, there are some letters that, while they may look like some kind of acronym from anatomy class in high school, are actually indicating the pitch type on the rest of the chart. K is a swinging strikeout, backwards K is a called strikeout, O is an out made on a ball in play, S is a swinging strike, C is a called strike, and F is a foul ball. Below the double line that marks good from bad, B is a ball, W is a walk, and H is a hit.

The vertical lines indicate the end of a plate appearance, and the numbers below everything help define the duration of a specific inning.

Bright red marks are strikeouts, maroon marks are outs on balls in play, and the blue dashes are hits. You can tell at a glance how many K's and hits a pitcher gave up over the course of a game. Speaking of which, I went the easy route for this first chart, and graphed one of the simplest games of 2006. If you can guess which complete game this was, and who the Oakland pitcher was who threw it, I'll consider the chart to be at least somewhat interpretable.

My goal here is to produce a sleek chart that will contain as much relevant info as possible. If you can take a glance at it and say, "Hey, I see that the second batter of the third inning saw a ball, took two called strikes, another ball, and then struck out swinging", that's what I'm aiming for.

There are some unresolved issues with it, such as how to deal with double plays (such as the one that ended the first inning), whether runs should be accounted for, pitching changes, and if I should differentiate between singles, doubles, etc. I've got some ideas brewing, though.

So, I'm going to open myself up to some criticism here. Keep in mind that I've only invested a couple hours into this prototype, but don't pull your punches. What, if anything, do you like about this? What would you include/exclude/change? Is it similar to an existing thing that I've managed to remain ignorant to? Also, let me know if you'd like to see more of this sort of nerdy numbers and pictures creation thing, or you'd rather I stick to more... traditional methods.

Comments
2007-02-08 14:12:46
1.   For The Turnstiles
7 Ks and no walks has to be either a Haren start, or Blanton against Seattle.
2007-02-08 14:29:48
2.   Ken Arneson
It's an interesting idea, but it does need more work.

My first suggestion would be to move the "O" column to the top, so that all the different kinds of strikes are grouped together. It also makes it easy to compare BIP outs from BIP hits; they're on the opposite ends of the chart. (That would actually make the chart kind of cylindrical; it wraps around.)

Suggestion #2: it's pretty hard to tell where the innings begin and end. The line looks the same as any other PA. Maybe the innings can alternate background color or something.

2007-02-08 15:06:44
3.   Ryan Armbrust
I agree about the O column. Grouping strikes makes sense.

Alternate colors would work well. I'm thinking of alterating two shades of one color for the SP, and then different colors for relievers.

Oh, and for those curious as to which game this was, it wasn't Haren or Blanton. It was the AL pitcher of the month in August 2006.

2007-02-08 15:24:34
4.   mehmattski
Oh, I am certainly pro-nerdy, I'm fascinated by what you've got here. Is there a way to incorporate the "on base" situations into the graph, so we can see if there's a different strike/ball ratio when there's runners on/in scoring position? I'm thinking specifically of the study that showed that Chien-Ming Wang got a higher ratio of strikeouts with runners in scoring position.

Also, where does this pitch-by-pitch data come from? Is there a free site to access this data during a season? Or even a pay site? I'm tossing around some nerdy studies myself, and am looking for easy data access. Thanks!

2007-02-08 16:15:55
5.   Ryan Armbrust
mehmattski, I like the idea of indicating runners-on-base situations. I don't want the chart to get too busy and difficult to read, but that might be something that can be integrated without too much trouble.

As for the pitch by pitch data, it's all courtesy of retrosheet, one of the greatest things ever invented. check out retrosheet.org

2007-02-08 17:24:56
6.   Padgett
I like this; very Tuftian. Ken's suggestion is good -- create some symmetry by having the more important and less frequent events (outs, Ks, hits) be near the edges, and the more frequent (strikes, balls) clustered in the center. Perhaps you can separate the at-bat-ending events from the individual pitches with a faint dashed or dotted line.

I also agree with more clearly demarcating the ends of innings; this in fact was the one thing that I didn't see right away. Another possibility is to use different shades of blue to indicate whether the hits are for extra bases, and you could do the same with the outs to represent double and triple plays.

Again, this is very cool.

2007-02-08 17:42:36
7.   Phil Bencomo
Here's the game, I believe:

http://www.baseball-reference.com/boxes/TOR/TOR200608230.shtml

Esteban Loaiza complete-game shutout on August 23.

2007-02-08 20:13:31
8.   joejoejoe
A few thoughts...

- add a horizontal scale to see pitch progression (5, 10, 15, 20, etc.) - that way you can correlate pitch counts and the inning at a glance

- add totals to each row at the far left so you can see #K, #BB, #H, etc.

- the slash marks that denote each pitch could actually be a number or character that carries information. For example a double play could have a '2' instead of a slash. Triple play '3'. Infield Fly 'I'. Maybe this is off topic a bit but it's just an idea.

- Add groundout, flyout, and single, double, triple, HR marks. It's not that much more information and it allows you to compile just about every pitching stat from the chart

- more density is good, readers can generally handle far more information than is presented in common charts and graphs

I think you are off to a great start and it's always good to reexamine old ideas in new ways and I encourage you in your efforts. Nice work.

2007-02-08 21:08:34
9.   Philip Michaels
Perhaps I'm a bear of very little brain, but if the idea is that I can look at this and, at a glance, and figure out how a game progressed, I think strikes and balls and all their attendant categories (swinging, called, foul, etc.) should be on half and balls in play in the other. Right now the organization seems to be -- to over-simplify it -- good stuff that the pitcher does versus bad stuff the pitcher does. That makes it harder to follow for me.

But it's a promising start to good idea.

2007-02-08 21:11:59
10.   Ryan Armbrust
Fantastic suggestions, all around. Thanks.

And yes, this is the Loaiza complete game from last August.

I'm going to implement some of your suggestions, and I'll have a new version ready soon. I'll pick another game, and I'll try to make it a little more interesting.

Oh, and Padgett, thanks for mentioning Tufte. I strive to follow his theories when I can.

2007-02-08 23:58:26
11.   joejoejoe
8 'add totals to each row at the far left'

I meant 'far right'. Your eye follows the chart from left to right so it makes sense that the total would be at the end - on the right.

Comment status: comments have been closed. Baseball Toaster is now out of business.