DRaysBay: An SB Nation Community

Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook

Estimating Catcher Defensive Value: How Good Are Navarro and Shoppach?

Trying to evaluate catcher defensive is really, really, really tough.  It used to be that evaluating defense as a whole was really tough to do, but advances like the Dewan +/- system and Ultimate Zone Rating (UZR) have changed that.  Nowadays, the only defensive frontier left to be tackled is catching, but it's quite the doozy.  Tom Tango, Bill James, Matt KlaassenDavid Gassko, and countless others have done some great work on this subject, but at the moment we're still not able to statistically assess a catcher's defensive contributions as specifically as we can other positions.  The big problem is simply that there are so many variables to control for when looking at catchers.  How do you separate what's the pitcher and what's the catcher?  How to you quantify "framing"?  What about the umpire?  How big of an impact do passed balls have?  What about caught stealings?  It's...well, a lot.

Over this weekend, I had a bit of a brainstorm and figured I'd try to tackle catcher defense from a slightly different angle.  I'm no statistician so I can't do all the crazy number tricks that the likes of Tango, Lichtmen, and James, but I think I figured out a simple way to estimate a catcher's defensive value.  It's crude and there are plenty of holes in it, but...well, I had fun with it.  If you want to read the long version, feel free to check out my FanPosts at Beyond the Boxscore, but I'll summarize things below for all those that want the simplified version.

Star-divide

Let me start with a disclaimer: before I started this analysis, I'd done essentially no research whatsoever on all the current advances in quantifying catcher defense.  Take it for what it is: a fun little experiment.

All right, so, big picture time.  Imagine if you could graph the exact defensive ability of every major league baseball player over every season for the last 50 years.  What would that graph look like?  Well, it'd probably look something like a normal distribution (AKA: normal curve, bell curve, Gaussian distribution), right?  The majority of seasons would fall close to the average - within one or two standard deviations from the mean - with a smaller number of outlier seasons on both sides.  Determining what the average would be is easy enough (0 UZR, or a neutral defensive season) and if we wanted, the furthest limits on both ends could be easily established by looking back through historical UZR data.

Within that normal distribution, though, we'd have seasons from first basemen, second basemen, shortstops, catchers, etc.  We all know about Bill James' defensive spectrum, where he ranks the positions in order of increasing defensive difficulty (for a refresher: 1B, LF, RF, 3B, CF, 2B, SS, C).  With this in mind, what would the graphs look like if we broke defensive ability down by position?  Would these subsets of the normal defensive distribution look like normal distributions themselves, or would they vary slightly from position to position?

Going into the research, I hypothesized that the positions would all have slightly different distributions, with positions higher on the defensive spectrum having a higher average UZR score since defense is highly valued at those positions.  My research didn't hold that out, though; using UZR data for all regular defensive players (minimum 500 innings played) from 2002-2009, I found that the distribution of fielding ability is actually pretty uniform across the positions.  All positions had means around zero (average across all positions = 0.36 UZR) and the standard deviations between positions was also fairly similar (average standard deviation = 8.4 UZR).

Here's where my research takes a couple leaps of faith.  After concluding that all positions have similar defensive means and standard deviations, I decided that it was only logical to then assume that catching would follow the same rules.  Or at least, I figured that without any data to prove one way or the other that catching is inherently different than other positions*, the best estimate we can make is by using the information we already have from the other positions.  Also, playing the outfield requires much different skills than does playing the infield - different reaction time, different arm requirements, different footwork, different positioning, etc. - and all of those positions have comparable means and spreads in UZR scores.  So I am making an assumption here, but I think it's one that's not too far-fetched at least. 

* Well, I've now done some research on defensive catching data and it seems that researchers do believe the spread of defensive scores is different from that of the other positions.  Most seem to think it's about half the spread as the other positions, so about 20 runs separating the worst and best catcher in the league.  For now, though, let's continue with my assumption and I'll come back and address this point at the end.

Anyway, the rest of the work is really simple from this point on.  Now that we have a UZR distribution with a mean and standard deviation decided upon for catchers (mean = 0.34; standard deviation = 8.4), all we need is a ranking of catchers to plug into the model.  Tango's Fan Scouting Report does the job perfectly.  Below you'll see the 40 catchers from last season with more than 500 innings behind the plate, as ranked by the Fan Scouting Report (the "Value" column).  The colors coordinate with how many standard deviations from the Fan Scouting Report mean they fall, with green signifying above the mean and red below the mean.  I then translated those values into UZR scores.  In other words, if a player is one standard deviation above the Fan Scouting Report mean, I then gave them a value of 8.4 UZR - one standard deviation above the UZR mean.  Anyway, here are the results:

If you think the actual distribution of catching scores is smaller than this, then simply divide the UZR scores in half.  We can then express a player's defensive contributions as a range, like "Navarro was most likely a -3 to -6 fielder behind the plate last season" or "Shoppach was most likely a -4 to -8 fielder last year."

Obviously this method has its failing points, but like I said, it was merely a fun exercise on my part and it should only be used at the most as a rough estimate.  Don't use this to conclusively say that Navarro and Shoppach are horrible defensive catchers and should be run out of town; this is only one year of data, it relies on fan scouting reports, and it's very inexact.  Take this information for what it is: fun data that doesn't necessarily mean much.

3 recs  |  Comment 25 comments |

Story-email Email Printer Print

Comments

Display:

Nice work, Steve

I’d done similar stuff with the FSR before, but with catchers, I was seemed to get more unbelievable scores than with other positions — I remember for one season (I don’t remember which, and I’m too busy/lazy to get my spreadsheet up at the moment), Mauer was a +30 — and that was after I did some sort of fractional thing, I think. Maybe I’m remembering things wrongly, but all due respect to Joe Mauer…

We really need the FSR for catchers, especially. Well, we really need if for all fielders (I’d love it if Tango started doing it for pitching and hitting, too, but that’s another topic), given current state of defensive statistics. One thing I would say is that the FSR might best be considered as a judge of “true talent” — not only something to regress our "objective’ measures to in absence of more data, but also to regress players to in order to project them. Tango’s version of similarity scores is helpful in that regard as well.

Anyway, just some random thoughts. Nice job.

I'm not a sabermetrician, but I do play one at FanGraphs.

Can't get enough of me? Check out my Twitter feed.

by Matt Klaassen on Mar 10, 2010 7:38 PM EST reply actions  

Yeah, that definitely makes sense.

I think the logical next step for me after this article would be to look at the Fan Scouting Reports for the other positions, extrapolate UZR scores like above, and see how well they match up with actual UZR scores. I’ll have to get to it at some point soon. If you have links to some of your old articles, I’d love to check them out.

The FSR is pretty awesome. Besides for voting myself, this is the first time I’ve spent looking at the data and it’s quite impressive stuff.

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 10, 2010 7:59 PM EST up reply actions  

So I just ran the correlations through for 2B

It’s only one position, but there was a .52 correlation between the UZR scores predicted by the FSR and actual UZR scores. There were some big misses, but some that were right on. Not too bad overall, though, I’d say.

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 10, 2010 10:11 PM EST up reply actions  

And so if we should view the FSR as an indicator of "true talent"

Then even if it disagrees with UZR over one season of data, it should be accurate over a longer scope. Me thinks I’m going to have something to keep me busy over the next week….

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 11, 2010 1:15 AM EST up reply actions  

Rec'd. Awesome.

"Sure, because of the "cold weather" and rain." More bait and switch tactics by the New York owners of this team." --NikoHoullis, the lead blogger at Buc'em on racial and anti-semitic insensitivity.

by kericr on Mar 10, 2010 9:21 PM EST up reply actions  

Fun read Steve

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 10, 2010 9:13 PM EST reply actions  

Good stuff!

Keep the catcher stuff coming!

from Cubs Stats and Twitter @BradleyWoodrum

by B Ray on Mar 10, 2010 9:35 PM EST reply actions  

cERA & SIERA - Metric Homophones

Given the disdain shown towards both, can we acknowledge the stat community suffers from a case of homophonebia?

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 10, 2010 9:39 PM EST reply actions   1 recs

I find this way funnier than it probably should be...

Rec’d

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 10, 2010 10:12 PM EST up reply actions  

Definitely a big fan of this

It’s at least a step in the right direction, and as we get more data, or more quantifiable scouting reports at least, these projections could only become more accurate, in my opinion

by mslowins on Mar 11, 2010 12:08 AM EST reply actions  

Where is the Z-man?

rzar.wordpress.com
draysbay.com
raysprospects.com

by RZ on Mar 11, 2010 10:39 AM EST reply actions  

#20

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 11, 2010 10:43 AM EST up reply actions  

The fans underrate him

Most of those other catching metrics that use game events like wild pitches, passed balls, pitch f/x, etc., have him ranked pretty high.

rzar.wordpress.com
draysbay.com
raysprospects.com

by RZ on Mar 11, 2010 11:17 AM EST up reply actions  

Arm probably is overweighted in the fans eye

Could be another interesting study

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 11, 2010 11:21 AM EST up reply actions  

Henry Blanco FTMFW

"It's good to have a little cushion. But it's not going to be easy."

by Andy Hellicksonstine on Mar 11, 2010 11:55 AM EST reply actions  

Steve, you should use the position-specific page here:
http://www.tangotiger.net/scout/index6.php?prim_fld_cd=2

I use different weights for each position. The main page you reference has one “neutral” set of weights.

by tangotiger on Mar 12, 2010 6:02 PM EST reply actions  

Tracked down the weights:

http://www.insidethebook.com/ee/index.php/site/comments/2007_fans_scouting_report_results/

Comment 4 provides the weighting system used. This could lead to all sorts of fun analysis of moving players around, more so than WAR’s uniform positional adjustments

Follow Me on Twitter @FreeZorilla

by FreeZorilla on Mar 12, 2010 7:56 PM EST up reply actions  

Cool...thanks Tango

I’ll have to re-do it with the adjusted numbers and mess around a bit more with the data. What do you think, does the theory behind this idea make sense? If there’s anything else you think I should look into or try, I’d be willing to give it a shot.

I love Casey Fossum. Now try and take me seriously.

by Steve Slowinski on Mar 12, 2010 8:12 PM EST up reply actions  

Comments For This Post Are Closed


User Tools

Founded in 2005. DRaysBay is home to "Progressive statistical analysis and reasoned argument."
Start posting about the Rays »

Join SB Nation and dive into communities focused on all your favorite teams.

Connect_with_facebook

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Converse5vp3_small
The Trade Deadline Thread
Zorilla_small
A Fanpost on the Clutch Stat

Recent FanPosts

Andy_samberg_jimp_small
Jeremy Hellickson to make his Major League debut on Monday versus the Twins
4287_559112511892_1101386_33047121_2807872_n_small
Hanley for Hellickson, Brignac, Moore, & Barnese?
Charzissou_small
OTTOTD 7/30/10: The Poor Matthew Hall Edition
Rays_small
Carlos Pena Breaks Rays' All-Time HBP Record
Mod_target_small
7/29/2010 OTTOTD We're All Suckers!
Small
TRADE TARGET: CRAIG BRESLOW
Jamesshields_small
OTTOTD: We are all Gentle Path A-Listers
Small
Manny Ramirez as Rays DH
Mod_target_small
7/27/10 OTTOTD: Shoop da woop

+ New FanPost All FanPosts >

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recent FanShots

Rays acquire Chad Qualls for a PTBNL according to...
I mean... isn't this the Rays' problem right here? A guy cheering Matt Joyce's clutch home run in a freakin' Yankees T-shirt. YOU FAIL, SIR.
Adam Dunn said he would be comfortable becoming a DH for the rest of this...
Very Rich Guy to Buy Yankees
Red Sox sell-out streak revealed as scam
Garza: "Becoming the first Rays player ever to pitch a no-hitter has been a great experience.  I really couldn’t have done it without the support from my team.  To thank them for their nine innings of hard work, I decided to give them a personalized embroidered bag and bottle of Crown Royal Black.  That way, we can all celebrate together when enjoying the new whisky."

Classy.
Hellickson pulled after three scoreless innings
Nice outing by Matt Moore against the Yankees
Desmond Jennings now the latest Boras client
Selig: 2011 MLB season to begin sooner, end earlier

+ New FanShot All FanShots >

SBNation.com Recent Stories

ST. LOUIS - MAY 18:  Ryan Ludwick #47 of the St. Louis Cardinals rounds third base after hitting a game-winning homerun against the Washington Nationals at Busch Stadium on May 18, 2010 in St. Louis, Missouri.  The Cardinals beat the Nationals 3-2.  (Photo by Dilip Vishwanat/Getty Images) +3 updates

Padres, Cardinals, Indians Complete Three-Way Trade Involving Ryan Ludwick, Jake Westbrook

SEATTLE - JULY 08:  Alex Rodriguez #13 of the New York Yankees hits an RBI single in the ninth inning to give the Yankees a 3-1 lead against the Seattle Mariners at Safeco Field on July 8 2010 in Seattle Washington. (Photo by Otto Greule Jr/Getty Images) +15 updates

Yankees' 9th-Inning Win Completely Overshadowed By A-Rod's Ongoing Homer Drought

Colorado Rockies' Carlos Gonzalez connects on a triple against the Chicago Cubs in the third inning of a baseball game at Coors Field in Denver, Colo. on Saturday, July 31, 2010.  (AP Photo/ Matt McClain) link

Carlos Gonzalez Completes Cycle With Walkoff Homer; Rockies Beat Cubs, 6-5

More from SBNation.com >


Baseball Operations

Rays_small Steve Slowinski

Big_pun--300x300_small Tommy Rancel

Zorilla_small FreeZorilla

Price_small Erik Hahmann

Pro Scouting

P6090001_small mslowins

Player Development

52376727_small rglass44

Flying-car-m400_small RZ

Small PGP