RECENT UPDATES

Around the Bases

BASEBALL PROSPECTUS Author Page

BP Hit List

BP: Grumpy Old Men: JAWS Tackles the VC Ballot

BP: Hall of Fame Class of 2007 Infielders, Outfielders, Pitchers

BP: Chat w/ JJ

BP: The I [Heart] New York Matchup

BP: Being Tony La Russa

BP: NLCS Preview

NY SUN: Unraveling of the World Champions

ESPN Page 2: Schilling, Smoltz: In or Out?

___________ THE ROSTER

 

MORE
SPONSORED
LINKS
Your Ad Could Be Here!
All contents of this web site © Jay Jaffe, 2001-2007 except where indicated. Please contact me for any questions or comments regarding this site.

    A R O U N D   T H E   B A S E S

 
Published via Blogger • Comments via YACCS • Counting via

Weekly archives • Contact jay@futilityinfielder.comRSS Feed

AVG/OBP/SLG unless otherwise indicated • Advanced statistical glossary

Friday, January 16, 2004

Whitey Ford Sings the Blues 

On the heels of
my dustup with some hired goons, two of my loyal readers added some input regarding my Whitey Ford comments which is worth addressing. Studes, a.k.a Dave Studenmund of the fine Baseball Graphs site, mentioned some information in Bill James' Baseball Managers book in which the bearded one stated that Casey Stengel used Whitey more frequently against good opponents, while Johnny C mentioned a Rob Neyer comment about the AL's dearth of black players during the Yankees' heyday (1949-1964).

I'm a big fan of the James Managers book; I enjoy it more than the New Bill James Historical Abstract. The part to which Studes refers is on page 192; essentially it says that during the Stengel years (1953-1960 in Ford's case), Whitey had a lot more decisions against the good teams in the AL (usually Chicago and Cleveland) than the bad ones, and that his winning percentage during that time, already a stellar .681, should have been higher if the distribution of opponents was more proportional. He is almost certainly onto something there, even if he overemphasizes the pitcher's Won-Loss record rather than his outstanding ERAs (something James still tends to do).

In the Neyer piece to which Johnny C. refers, Rob is pointing out similar effects but in the opposite direction on the careers of Lefty Grove (who faced the Yankees less frequently than expected) and Warren Spahn (same, except it's the Dodgers). He doesn't refer to Ford; rather it's a more general quote about the Yankees: "I did know that when the Yankees were running roughshod over the American League from 1949 through 1964, a great majority of the great black players were in the National League, which perhaps should give us pause when considering the true greatness of those Yankees."

Recall that the Yanks didn't get a black player until Elston Howard in 1955, eight years after Jackie Robinson, and Howard was the first black AL MVP in 1963, by which time Jackie, Roy Campanella, Willie Mays, Don Newcombe, Hank Aaron, Ernie Banks, Frank Robinson and Maury Wills had combined to win eleven NL MVP awards. Exactly as Rob said it, a great majority of great black players were in the NL. For what it's worth, the two teams Ford faced most often had, in Minnie Minoso (White Sox) and Larry Doby (Indians), two of the top black stars in the league during the time. But I suspect that the two effects my readers pointed out, both involving the levels of competition Whitey faced, essentially cancel each other out.

Actually, there's a way we can look even more closely at this. The Davenport system upon which I based my Hall study contains three formulas for Wins Above Replacement. WARP1 is adjusted for a single season, while WARP3 is adjusted for all time, using the level of competition as one of its factors (park effects, league scoring levels and schedule length are others). A bit of what "comes out in the wash" historically can be seen in the way the two totals compare. For Whitey's career (1950, 1953-1967), he's got 98.7 WARP1 and 95.5 WARP3, meaning his contributions are devalued about 3.3 percent historically over the course of his career. A Hall contemporary whose career is essentially contiguous, Robin Roberts (1948-1966), spent all but three years in the NL; his WARP1 and WARP3 totals are 129.3 and 129.5, meaning his contributions are undervalued by 0.2 percent. If that's not convincing enough, I recalculated using only the seasons for which they exactly overlap and were in opposite leagues, and -- allowing for the fact that we may be losing a bit with rounding, since I'm just pulling numbers off of the page rather than going from Clay Davenport's database -- Ford is "overvalued" by 3.0 percent while Roberts is "undervalued" by 1.6 percent. Neither of these are uncommon; I suspect if you look at the Hall of Famers, you'd find the 20th century ones are mostly within 5% one way or another, with the 19th century ones and the ones from the more extreme eras of baseball history, scoring-wise -- the late '30s and early '60s -- changing the most.

I should stress that I am not, by my assertions, trying to take away from Ford's accomplishments. The man was a fantastic pitcher who accomplished a hell of a lot, and the Davenport system doesn't even attempt to account for some of those accomplishments -- the World Series stuff, his performance against key opponents, his hardware, etc. It's a macro system, not a micro system, and I'm trying to use it to value groups of seasons in a manner that Bill James does with Win Shares. The reasons I turned to this are that I think Win Shares makes a few pretty big E-6's, such as when it comes to the concept of replacement level or the reliance on pitcher W-L and S totals to measure their value.

Based on my system (which itself is based on Clay's system), Whitey is quantifiably a few hairs below the average Hall of Famer. Not enough that we should take up a petition to vote him out -- I can point to at least 15 guys who we should -- and it doesn't mean he wasn't a great ballplayer. But even if I had defined peak a bit differently -- say, for five best overall seasons rather than consecutive -- he would still probably come out below average, because he simply never achieved the single-season levels that a lot of other HOFers did. That said, were he a player currently on the ballot, I would see that he's close to average, consider his success in postseason play, and write him down without hesitation.

Looking at it from a more traditional angle, Whitey won 20 games or more only twice. I guarantee you THAT is below average for a Hall of Fame pitcher, especially one pitching in the era he did. Does it matter? Not much in the grand scheme of things, but by the "system" I just devised to suit this example, I can say, "He's below average for a Hall of Fame pitcher."

Well, that's a pretty crappy system right there, but I hope that the one I built to answer the questions I had about the 2004 ballot is at least a bit better -- certainly a lot more thought and time went into it. If one of the measurements it takes tells us Whitey comes up a little below average, does that mean the system is useless? I hope not. I've given a very qualified answer within a certain context, and the statement, taken away from its context, might look foolish. But within the context, it's a statistical fact, at least until Clay's system takes its next step forward and we can look again.

People are extremely resistant to accept new statistical systems when they tell us things which contradict what we think we know. Look at the case to be made against Derek Jeter's fielding via any one of several advanced measures, and at how resistant a certain segment of Yankee fans and mainstream baseball people are to accepting something which is by now as statistically obvious as the nose on Joe Torre's face. Look at the ridiculous scrums and bench-clearing brawls which plague baseball discussion sites across the Internet. People get very defensive about this stuff. I don't blame them, but I've come around to the other side on a lot of what I held dear, even with regards to the Hall of Fame ballot; in the past I've voted for Tommy John, Jack Morris, and Andre Dawson. Given the research I've done, I don't think I'd vote for any of them again, unless somebody can show me a better system which says I should do otherwise.

* * *

In more things Prospectus-related, I was interviewed on Wednesday by Will Carroll for Baseball Prospectus Radio, a syndicated program which probably isn't available in your market but which you can hear over the Internet. Will and I talked for about eight minutes, mostly on the subject of my Hall of Fame pieces and then a bit on the Yankees. The spot will air on Saturday, and though you'll probably be sleeping while it airs, rest assured that I'll link to it once it's added to the archives.

Still on the Prospectus topic, B-Pro's Joe Sheehan was interviewed by Rich Lederer last weekend, while over on Baseball Interactive (a site where I sometimes contribute), Gary Huckabay was interviewed by John Strubel. Both are lengthy interviews which do a good job of getting the Pros -- two of the top analysts in the field -- to talk a bit more about their company's philosophies, its history and its direction. Good stuff.

* * *

A few more random thoughts which might have become columns if I had fewer things on my plate:

* Yankee fans and media whining about Roger Clemens' departure or branding him a traitor should get a friggin' life. We all knew what we were getting when he arrived in the Bronx, and we shouldn't be surprised that a) he prefers living in Texas; and b) he still wants to pitch. The Rocket was a free agent, the circumstances for his return didn't exist at the time the Yanks had an option to persuade him to do so; and anyway, he'd accomplished everything he wanted to here. He owed the Yankees nothing, they owed him nothing, and the circumstances of his tenure in Houston are much different than they would have been here. To paraphrase Simpsons' bartender Moe Szyslak, I'm a well-wisher in that I don't wish him any specific harm.

* Pete Rose spent 14 years deceiving the American public, not to mention himself, when it came to gambling on the game of baseball. Here's a suggestion: he can only be reinstated after 14 years of penance. Being banned from the game while carrying out a lie DOES NOT count as penance. Maybe if he pledged to do something along those lines, writers and peers wouldn't be tripping over each other to back away from him more quickly. Chickens -> home -> roost.

* This story, which has nothing to do with baseball, is amazing and touching. This one, with photos, is hilariously bizarre. With friends like those...
--posted by Jay at 2:27 PM LINK

Thursday, January 15, 2004

The Dangeous World of Sabermetrics 

"Blame me if you must, but don't ever speak ill of the Program! The Program is rock solid! The Program is sound!"-- Homer J. Simpson

My pal and frequent sparring partner in baseball arguments -- I'll call him Nick because he hates Nicholas, Nikolai and Nickypoo -- took exception to one of my assertions in
yesterday's BP piece on Hall of Fame pitchers, and I thought our back-and-forth on the topic would provide an interesting glimpse into the idea behind what I've done.

With the Hall's honor sullied by several bad decisions, mostly at the hands of the now-reconstituted Veterans' Committee, the stated purpose of my mission was to produce a system by which we could evaluate the existing standards of the Hall using advanced performance metrics. We would then apply those standards to the candidates on the ballot, and those above average, the ones who would raise the standards of the Hall, would get the vote on our quasi-ballot. Over the course of two articles and some 10,000 words, I came up with four hitters and four pitchers that did so.

As I did for the hitters, I used BP's Davenport Cards to quantify the average Hall of Fame pitcher in the currencies the Davenport uses, Wins Above Replacement Player (WARP, in this case version 3, which is for use in cross-era comparisons), Pitching Runs Above Replacement (PRAR) and Pitching Runs Above Average (PRAA). I filled in a spreadsheet with each pitcher's career totals, and I also calculated their five-year WARP3 peak (PEAK) and came up with a weighted score (WPWT) that's just an average of the peak and career totals. Basically, we're crediting a player's best seasons twice so that the greatest ones float to the top, and the good ones who just stuck around forever are placed appropriately. What I came up with was a line which showed the average Hall of Fame pitcher's credentials in these advanced statistical categories:
           PRAA   PRAR   WARP3   PEAK   WPWT

AVG HOF SP 239 1002 97.0 44.9 70.9
Okay, that doesn't mean much sitting by itself, but it will soon. Iin a comment on Jack Morris, whose candidacy I once supported, I made the following comment: "Davenport-wise, Morris would be a below-average Hall of Famer, one who's in the same cluster as the elected [Whitey] Ford and [Jim] Bunning, as well as candidates [Dennis] Martinez and [Jimmy] Key."

Nick, a consummate Yankees fan, took issue with this perceived slight of ol' Whitey and wrote to ask whether I'd like a knuckle sandwich. He also asked, "How could a guy with a career ERA+ of 132 be in the same ballpark as a pitcher with a 106 career ERA+. Ford a below average Hall of Famer?"

That's a very good question, and to answer it, here's another chart showing the five pitchers' lines, along with that average again.
           PRAA   PRAR   WARP3   PEAK   WPWT

AVG HOF SP 239 1002 97.0 44.9 70.9
--------------------------------------------
Ford 238 994 95.5 36.5 66.0
Martinez 38 939 94.5 34.7 64.6
Morris 27 916 90.2 38.8 64.5
Bunning 161 1020 86.8 39.3 63.1
Key 195 817 87.4 37.7 62.6
In every category above, Ford is below the average -- microscopically so in the two run-total categories, and within a few whisker on career value, but substantially below in terms of his peak -- nearly two wins above replacement a year. I made a comment in the piece:

Whitey's usage patterns were greatly affected by pitching for the Yankees, who would often shut him down in September to keep him from racking up gaudy win totals (lest they have to pay him more) and to preserve his arm for the World Series, where he was the master, holding many important Series pitching records.

So how could Morris be so close to Ford despite an ERA+ that's 25 percent worse relative to the league? The system emphasizes wins above REPLACEMENT level, which means AVERAGE has some value, and lots of innings at average has a good amount of value. Morris pitched about 700 more innings than Ford, which boosts his career above replacement upward, though it's not quite as high as the Chairman.

This didn't satisfy Nick too much, and about an hour later, two burly men arrived at my door, bearing lead pipes and brass knuckles and offering to "show me how it all adds up." I managed to fend them off long enough to craft this answer:
The Davenport system is designed to isolate the player's individual contributions from his team, making all kinds of adjustments. It adjusts for the level of offensive support a player received (by discarding the pitcher's actual W-L record), the quality of defense (neutralizing the effects of balls in play behind him), his park (which in Whitey's case was built to favor lefties), his era (which for Ford was low scoring, with high totals of innings pitched commonplace), and the extra-curricular things which boost his credentials (World Series rings, Cy Young awards, etc.). Within that context, a slightly larger chunk of what Whitey accomplished was due to his pinstriped teammates, relative to the ways other Hall of Famers were helped by their teammates.

The system sees Whitey's best seasons as worth 7 to 9 Wins Above Replacement. That's good, but it's low for a Hall of Famer -- 10's are superstar seasons, and Whitey never had one of those (which is highly unusual for a Hall of Famer), let alone multiples within a five-year span -- which would have boosted his peak score.

I am not saying that my method is the only means of evaluating a Hall of Famer, or even the best method of ranking Hall of Famers, just that it's a very worthwhile one for leveling the playing field so as to more clearly debate the merits of potential Hall of Famers.
Rocco Knuckles and Bobby Leadfingers (they told me their names) were suitably impressed, letting me off with a few firm punches in the solar plexus while shouting stuff like, "That's for disparaging the all-time leader in winning percentage for a lefty!" and "That's for saying Whitey's below average!" and "Pinstripe this!" The last thing I remember before passing out was a blow to the face accompanied by the phrase, "Count da rings, baby!"

The moral of the story: It's a dangerous business trying to get people to look at baseball statistics a new way.
--posted by Jay at 3:48 PM LINK

Wednesday, January 14, 2004

Mo' Pro 

My
second piece for Baseball Prospectus is up, this one analyzing the pitchers on the 2004 Hall of Fame ballot. It's another epic -- I had 15 pitchers to evaluate, after all -- and this time it's part of the site's premium content, so you need to be a subscriber to read it. Which ought to be reason enough to shell out if you haven't already, right? Here's the intro:
The Baseball Writers of America's standards on what constitute a Hall of Fame pitcher are in a curious spot now, both when it comes to starters and relievers. Spoiled by a group of contemporaries who won 300 games from the mid-'60s to the mid-'80s (Tom Seaver, Steve Carlton, Gaylord Perry, Don Sutton, Nolan Ryan, Phil Niekro), the writers haven't elected a non-300-winning starter since Fergie Jenkins in 1991. That Perry, Sutton and Niekro took a combined 13 ballots to reach the Hall while Ryan waltzed in on his first ballot with the all-time highest percentage of votes is even more puzzling. Apparently what impresses the BBWAA can be summarized as "Just Wins, Baby" -- which is bad news for every active pitcher this side of Roger Clemens and Greg Maddux.

Of the 59 enshrined pitchers with major-league experience, only two of them -- Hoyt Wilhelm and Rollie Fingers -- are in Cooperstown for what they accomplished as relievers. While the standards for starters are somewhat easy to discern (if lately a bit unrealistic), the growing number of quality relievers on the ballot, the continuous evolution of the relief role, and the paucity of standards to measure them by present some interesting challenges to voters.

If there's an area in which performance analysis has struggled mightily against mainstream baseball thought, it's in hammering home the concept that the pitcher doesn't have as much control over the outcome of ballgames -- as reflected in his Won-Loss totals -- or even individual at-bats -- hits on balls
in play -- as he's generally given credit for. Good run support and good defense can make big winners of mediocre pitchers on good teams, and .500 pitchers of good hurlers on mediocre teams. As such, it's important to examine the things over which a pitcher has control and account for those he does not. Once again, the Davenport system rides to the rescue.
A hint: the system found four pitchers worthy of a vote -- and not the four you'd expect. No, no, no, not Bob Tewksbury. And alas, not my personal favorite, Fernando Valenzuela.

My jaw still aches from smiling over my BP debut last week, and I'm still answering emails related to it. Again, I'm thrilled to have my work in such hallowed company.
--posted by Jay at 11:59 AM LINK

Monday, January 12, 2004

Burn Seligula Burn 

An
L.A. Times article by Bill Shaikin about the Angels' surprising signing of Vlad Guerrero buried this quote fairly deep:
With major league owners scheduled to vote this month on whether to approve the bid of would-be Dodger buyer Frank McCourt, a source said Sunday that McCourt asked Commissioner Bud Selig whether some owners might vote against him if he spent freely to acquire Guerrero yet presented a financing package heavily dependent on loans. Selig offered no assurances, the source said, and McCourt sent word to General Manager Dan Evans to cease talks with Guerrero.
My man on the L.A. beat, Jon Weisman, has been pleading for weeks to put Guerrero in Dodger blue, and he writes that the deal casts aspersions on McCourt's ability to afford the team: "[W]e're supposed to believe or accept that, amid a market correction for player salaries, Vladimir Guerrero's is the one that would drive McCourt out of business?... If McCourt can't afford to sign Guerrero with the Dodger payroll already on the Atkins diet, he can't afford to buy the Dodgers. Period."

While I share Jon's concern that the team may be bought by an owner who won't spend the appropriate money to make the Dodgers competitive, I see a far more ominous cloud over this. Namely, that Selig effectively blackmailed McCourt out of pursuing Vlad in exchange for his blessing regarding the Dodger sale -- we all know that behind the scenes, Bud can orchestrate the other owners' yea or nay on this.

I want to see sombody investigate this as further evidence of collusion or at least a foul Seligulan shenanigan. As per our ongoing community-wide discussion on the growing influence of non-mainstream baseball writers, I think we can do our part in building a fire that will make Bud sweat this one.
--posted by Jay at 12:49 PM LINK

Sunday, January 11, 2004

Coming Up Roses 

I'm hard at work on my Hall stuff and having Internet connection problems today (my girlfriend made me a sign to post on my monitor that reads: "DANGER X WARNING! DO NOT WRITE IN BLOGGER," with the X a skull and crossbones), but I decided to take a moment to respond to a Rose-related comment to my previous piece left by a reader named Joe. He writes:
I understand the concept behind WARP/RCAP/Win Shares/random advanced metric.

However, if you ignore the fact that Pete has 4000+ hits, is he really a Hall of Famer?

I mean, all of his rate stats are pretty ordinary:
1. The guy spent most of his career as a corner outfielder/firstbaseman (2266 games compared to 1262 @ 2B or 3B), yet managed a SLG only 10 points higher than the adjusted league average.
2. His SLG is the lowest among all players with 3000 hits by 20 full points.
3. Rose and Brock are the only members of the 3000 Hit Club that have an OBP under .400 and a SLG under .425, and we all know Lou's claim to fame.
4. Of all members in the 3000-Hit-Club, Pete ranks only #15 in OBP. The seven men with worse OBPs? They have averaged 336 HR, compared to Rose's 160. Hank Aaron is tied with Rose's OBP, and has 755 HR (those 755 were not included in that 336 average).
5. Pete has fewer SB than every man in that group, other than Ripken. Those seven men have averaged 326 SB with a 74% success rate, compared to Pete's 198 and 57%.
6. We're all aware that his AVG is "only" .303 -- in the bottom half of the 3K Hit Club.
7. Despite routinely being among league leaders in games played, AB, and plate appearances, Pete had more than 100 walks in a season only once, and more than 90 walks in a season only once more.

Pete holds the career records of hits, games, and at bats. Those are records that he will likely hold for years to come. He deserves those records: anyone who plays for 24 seasons should get appropriate credit.

I've run him through the Keltner List, and I think you'd be surprised at how poorly he fared. You can run him yourself, or I can e-mail you my results.

Simply, Pete Rose is NOT one of the greatest players to ever play the game. And isn't that what the Hall of Fame is for?
Nitpicking Rose's qualifications within the 3,000 hit club won't convince me of anything. Nobody achieves that level without being very, very good. Who cares about stolen bases in that company? He beat the pants off of Eddie Murray there, but so what?

I think the larger issue is your underestimation of Pete's ability to get on base. He was 44 points above the adjusted league average for his epic career. In a 17-season span, he led the league in the number of times on base nine times, was second five other times, and the three other years finished fourth, sixth, and ninth. Put together, that has a HUGE amount of value. And despite the low slugging percentage, he was 19 points above adjusted league average for his career. To expand a bit more...

(By the way, I'm not sure what your source on league-adjusted figures is; mine's
Baseball-Reference, which I prefer because I can't use the SBE and because it's freely available.)

Pete was not a power hitter, but he was a FANTASTIC leadoff hitter. In the Retrosheet era, which covers his career from '69 and thus leaves out four great seasons of his (which would only enhance those numbers if he were leading off then, but I don't know for sure that he was) but includes all of his decline phase, he posted a .386 OBP/.424 SG. He did this despite playing many years in a time period which included some of the lowest levels of offense in modern baseball history. Who led the league in OBP in the Year of the Pitcher? Pete. Who was the leadoff hitter for the Big Red Machine, when they were scoring 25% more runs than the league average? It wasn't Joe Morgan.

Despite the period, his rate stats are NOT ordinary. To turn our attention away from the BP suite of numbers, check his OPS+. For his career he has an OPS+ of 118, and he had a 12-year prime ('65-'76) which was even higher: 116 or better every single year in that span, better than 125 in nine of those 12, better than 130 in six of those. There is no doubt he was a big benefit to his offenses during that run; they led the league in scoring five times and were second in three of those years. What's the object of baseball? Scoring runs. Check his place on the all-time runs scored list: fifth all time, behind Rickey, Ty, Hank and the Babe, and ahead of Willie, Cap, the Man, Barry, and the Iron Horse. That is select company, my friend, and you don't get there without being great.

And that's a hell of a long prime; I'd have to study it more deeply, but I'll bet you there aren't more than a dozen people in baseball history who maintained such a high level for so long. Nine wins above replacement a year for 12 years is a hell of a building block to have for a team. Is it any wonder the man was part of so many winning teams? Add to this a defensive versatility which allowed his managers to shift him around the diamond in order to accomodate the needs of his teams, and you've got a great asset to any ballclub.

Rose's career overlapped with a number of fantastic ballplayers whose careers may have overshadowed his; Willie Mays, Hank Aaron, Frank Robinson, Joe Morgan, and Mike Schmidt come to mind among NL players. And those guys were probably better than him. But I think if you examine Rose's standing using methods that incorporate a player's value including defense (WARP, Win Shares, any others?), you'll find he still comes out among the all-time greats, unequivocally good enough for the Hall of Fame. Smelling like a Rose, so to speak.
--posted by Jay at 5:30 PM LINK

THE CATCH

Quote of
the Day

"One thing I've been blessed with this year is run support and good defense."
-- David Wells
That's two things, but who's counting?

• • •

Line of
the Week

Royals pitcher Albie Lopez:
.2 IP, 6 H, 7 R, 7 ER, 2 BB, 0 SO
That's a game ERA of 94.50

• • •

The New
David Justice?

Ruben Sierra's hitting .429/.474/.714 and the Yanks are 9-4 since "The Village Idiot" rejoined the Yanks on June 7.

• • •

THE SHELF
my rec's via Amazon.com

Reading:


Game Time,
by Roger Angell

Rob Neyer's Big Book of Baseball Lineups,
by Rob Neyer

Listening:

Let's Do Rocksteady: The Story of Rocksteady 1966-68