Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Phil Mickelson Outshines Tiger Woods

Estimating Catcher Defensive Value: How Good Are Navarro and Shoppach?

Trying to evaluate catcher defensive is really, really, really tough.  It used to be that evaluating defense as a whole was really tough to do, but advances like the Dewan +/- system and Ultimate Zone Rating (UZR) have changed that.  Nowadays, the only defensive frontier left to be tackled is catching, but it's quite the doozy.  Tom Tango, Bill James, Matt KlaassenDavid Gassko, and countless others have done some great work on this subject, but at the moment we're still not able to statistically assess a catcher's defensive contributions as specifically as we can other positions.  The big problem is simply that there are so many variables to control for when looking at catchers.  How do you separate what's the pitcher and what's the catcher?  How to you quantify "framing"?  What about the umpire?  How big of an impact do passed balls have?  What about caught stealings?  It's...well, a lot.

Over this weekend, I had a bit of a brainstorm and figured I'd try to tackle catcher defense from a slightly different angle.  I'm no statistician so I can't do all the crazy number tricks that the likes of Tango, Lichtmen, and James, but I think I figured out a simple way to estimate a catcher's defensive value.  It's crude and there are plenty of holes in it, but...well, I had fun with it.  If you want to read the long version, feel free to check out my FanPosts at Beyond the Boxscore, but I'll summarize things below for all those that want the simplified version.

Star-divide

Let me start with a disclaimer: before I started this analysis, I'd done essentially no research whatsoever on all the current advances in quantifying catcher defense.  Take it for what it is: a fun little experiment.

All right, so, big picture time.  Imagine if you could graph the exact defensive ability of every major league baseball player over every season for the last 50 years.  What would that graph look like?  Well, it'd probably look something like a normal distribution (AKA: normal curve, bell curve, Gaussian distribution), right?  The majority of seasons would fall close to the average - within one or two standard deviations from the mean - with a smaller number of outlier seasons on both sides.  Determining what the average would be is easy enough (0 UZR, or a neutral defensive season) and if we wanted, the furthest limits on both ends could be easily established by looking back through historical UZR data.

Within that normal distribution, though, we'd have seasons from first basemen, second basemen, shortstops, catchers, etc.  We all know about Bill James' defensive spectrum, where he ranks the positions in order of increasing defensive difficulty (for a refresher: 1B, LF, RF, 3B, CF, 2B, SS, C).  With this in mind, what would the graphs look like if we broke defensive ability down by position?  Would these subsets of the normal defensive distribution look like normal distributions themselves, or would they vary slightly from position to position?

Going into the research, I hypothesized that the positions would all have slightly different distributions, with positions higher on the defensive spectrum having a higher average UZR score since defense is highly valued at those positions.  My research didn't hold that out, though; using UZR data for all regular defensive players (minimum 500 innings played) from 2002-2009, I found that the distribution of fielding ability is actually pretty uniform across the positions.  All positions had means around zero (average across all positions = 0.36 UZR) and the standard deviations between positions was also fairly similar (average standard deviation = 8.4 UZR).

Here's where my research takes a couple leaps of faith.  After concluding that all positions have similar defensive means and standard deviations, I decided that it was only logical to then assume that catching would follow the same rules.  Or at least, I figured that without any data to prove one way or the other that catching is inherently different than other positions*, the best estimate we can make is by using the information we already have from the other positions.  Also, playing the outfield requires much different skills than does playing the infield - different reaction time, different arm requirements, different footwork, different positioning, etc. - and all of those positions have comparable means and spreads in UZR scores.  So I am making an assumption here, but I think it's one that's not too far-fetched at least. 

* Well, I've now done some research on defensive catching data and it seems that researchers do believe the spread of defensive scores is different from that of the other positions.  Most seem to think it's about half the spread as the other positions, so about 20 runs separating the worst and best catcher in the league.  For now, though, let's continue with my assumption and I'll come back and address this point at the end.

Anyway, the rest of the work is really simple from this point on.  Now that we have a UZR distribution with a mean and standard deviation decided upon for catchers (mean = 0.34; standard deviation = 8.4), all we need is a ranking of catchers to plug into the model.  Tango's Fan Scouting Report does the job perfectly.  Below you'll see the 40 catchers from last season with more than 500 innings behind the plate, as ranked by the Fan Scouting Report (the "Value" column).  The colors coordinate with how many standard deviations from the Fan Scouting Report mean they fall, with green signifying above the mean and red below the mean.  I then translated those values into UZR scores.  In other words, if a player is one standard deviation above the Fan Scouting Report mean, I then gave them a value of 8.4 UZR - one standard deviation above the UZR mean.  Anyway, here are the results:

If you think the actual distribution of catching scores is smaller than this, then simply divide the UZR scores in half.  We can then express a player's defensive contributions as a range, like "Navarro was most likely a -3 to -6 fielder behind the plate last season" or "Shoppach was most likely a -4 to -8 fielder last year."

Obviously this method has its failing points, but like I said, it was merely a fun exercise on my part and it should only be used at the most as a rough estimate.  Don't use this to conclusively say that Navarro and Shoppach are horrible defensive catchers and should be run out of town; this is only one year of data, it relies on fan scouting reports, and it's very inexact.  Take this information for what it is: fun data that doesn't necessarily mean much.

Comment 25 comments  |  3 recs  | 

Do you like this story?

Comments

Display:

Nice work, Steve

I’d done similar stuff with the FSR before, but with catchers, I was seemed to get more unbelievable scores than with other positions — I remember for one season (I don’t remember which, and I’m too busy/lazy to get my spreadsheet up at the moment), Mauer was a +30 — and that was after I did some sort of fractional thing, I think. Maybe I’m remembering things wrongly, but all due respect to Joe Mauer…

We really need the FSR for catchers, especially. Well, we really need if for all fielders (I’d love it if Tango started doing it for pitching and hitting, too, but that’s another topic), given current state of defensive statistics. One thing I would say is that the FSR might best be considered as a judge of “true talent” — not only something to regress our "objective’ measures to in absence of more data, but also to regress players to in order to project them. Tango’s version of similarity scores is helpful in that regard as well.

Anyway, just some random thoughts. Nice job.

I'm not a sabermetrician, but I do play one at FanGraphs.

Can't get enough of me? Check out my Twitter feed.

by Matt Klaassen on Mar 10, 2010 7:38 PM EST reply actions  

Yeah, that definitely makes sense.

I think the logical next step for me after this article would be to look at the Fan Scouting Reports for the other positions, extrapolate UZR scores like above, and see how well they match up with actual UZR scores. I’ll have to get to it at some point soon. If you have links to some of your old articles, I’d love to check them out.

The FSR is pretty awesome. Besides for voting myself, this is the first time I’ve spent looking at the data and it’s quite impressive stuff.

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 10, 2010 7:59 PM EST up reply actions  

So I just ran the correlations through for 2B

It’s only one position, but there was a .52 correlation between the UZR scores predicted by the FSR and actual UZR scores. There were some big misses, but some that were right on. Not too bad overall, though, I’d say.

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 10, 2010 10:11 PM EST up reply actions  

And so if we should view the FSR as an indicator of "true talent"

Then even if it disagrees with UZR over one season of data, it should be accurate over a longer scope. Me thinks I’m going to have something to keep me busy over the next week….

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 11, 2010 1:15 AM EST up reply actions  

Rec'd. Awesome.

"Sure, because of the "cold weather" and rain." More bait and switch tactics by the New York owners of this team." --NikoHoullis, the lead blogger at Buc'em on racial and anti-semitic insensitivity.

by kericr on Mar 10, 2010 9:21 PM EST up reply actions  

Fun read Steve

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 10, 2010 9:13 PM EST reply actions  

Good stuff!

Keep the catcher stuff coming!

from Cubs Stats and Twitter @BradleyWoodrum

by BWoodrum on Mar 10, 2010 9:35 PM EST reply actions  

cERA & SIERA - Metric Homophones

Given the disdain shown towards both, can we acknowledge the stat community suffers from a case of homophonebia?

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 10, 2010 9:39 PM EST reply actions   1 recs

I find this way funnier than it probably should be...

Rec’d

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 10, 2010 10:12 PM EST up reply actions  

Definitely a big fan of this

It’s at least a step in the right direction, and as we get more data, or more quantifiable scouting reports at least, these projections could only become more accurate, in my opinion

by Matt Slowinski on Mar 11, 2010 12:08 AM EST reply actions  

Where is the Z-man?

rzar.wordpress.com
draysbay.com
raysprospects.com

by RZ on Mar 11, 2010 10:39 AM EST reply actions  

#20

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 11, 2010 10:43 AM EST up reply actions  

The fans underrate him

Most of those other catching metrics that use game events like wild pitches, passed balls, pitch f/x, etc., have him ranked pretty high.

rzar.wordpress.com
draysbay.com
raysprospects.com

by RZ on Mar 11, 2010 11:17 AM EST up reply actions  

Arm probably is overweighted in the fans eye

Could be another interesting study

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 11, 2010 11:21 AM EST up reply actions  

Henry Blanco FTMFW

"It's good to have a little cushion. But it's not going to be easy."

by Andy Hellicksonstine on Mar 11, 2010 11:55 AM EST reply actions  

Steve, you should use the position-specific page here:
http://www.tangotiger.net/scout/index6.php?prim_fld_cd=2

I use different weights for each position. The main page you reference has one “neutral” set of weights.

by tangotiger on Mar 12, 2010 6:02 PM EST reply actions  

Tracked down the weights:

http://www.insidethebook.com/ee/index.php/site/comments/2007_fans_scouting_report_results/

Comment 4 provides the weighting system used. This could lead to all sorts of fun analysis of moving players around, more so than WAR’s uniform positional adjustments

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 12, 2010 7:56 PM EST up reply actions  

Cool...thanks Tango

I’ll have to re-do it with the adjusted numbers and mess around a bit more with the data. What do you think, does the theory behind this idea make sense? If there’s anything else you think I should look into or try, I’d be willing to give it a shot.

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 12, 2010 8:12 PM EST up reply actions  

Comments For This Post Are Closed


User Tools

Founded in 2005, DRaysBay is home to, "Progressive statistical analysis and reasoned argument."

Please read our Community Guidelines.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Small
Zobrist vs Pedroia vs Cano
Scaled_php_small
Rays Community Prospect #31 Runoff

Recent FanPosts

Scaled_php_small
Rays Community Prospect #35
Scaled_php_small
Rays Community Prospect #34
Scaled_php_small
Rays Community Prospect #33
Scaled_php_small
Rays Community Prospect #32
Scaled_php_small
Rays Community Prospect #31
Scaled_php_small
Rays Community Prospect #30 (Again)
Scaled_php_small
Rays Community Prospect #30 Runoff
Small
Take A Moment To Rosterbate

+ New FanPost All FanPosts >

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recent FanShots

Jeff Bagwell, Fred McGriff, The Hall of Fame, and 400 Home Runs
ESPN Chat with Matt Moore
Danny Clyburn: 1974-2012
Joe Maddon Town Hall Contest
Hickey said as of now all of the starters -- Wade Davis, Jeff Niemann,...
White Sox sign Dan Johnson
Indians acquire Canzler
Justin Ruggiano to Elect Free Agency
Dougdirt over at MinorLeagueBall compiled John Sickels' rankings with WAR values from Victor Wang's research.

Thread here.
The increasingly desperate search for offense has caused some teams to...

+ New FanShot All FanShots >

DRB Fantasy Baseball

Friends of the Site

DRB Suggestion Box

Drb4_medium


Managers

Slowsky__1__small Steve Slowinski

Dad_small Jason Collette

Brad_small BWoodrum

Price_small Erik Hahmann

Analysts

Lob-city_design_small rglass44

Untitled_small EminenceFront

Small Mulva

Rutg_uakjmedjwh9ndzd4lkll_small Imperialism32

100_1952_small MrNegative1

Steak-with-crown_small CBJones

Whelk_small Whelk

Small PGP

Scaled_php_small mr. maniac

Tampa_theatre_small jcmitchell

Me_small John Gregg