Category Archives: Baseball Musings
If you’re a data scientist, and you happen to enjoy baseball, then MLB DFS (daily fantasy sports) is the game for you. If you like parting ways with your money, then it’s doubly enjoyable.
There is no shortage of advice and access to statistics to help you in your never ending quest for the perfect lineup. You can absorb knowledge about stacks, chalkiness, fading, and other esoteric terminology. Experts will try to convince you that you must build lineups differently for cash games versus GPP tournaments. Silly me, I just thought I had to build a lineup that’s better than everyone else’s, and then beat them all. Kind of like real baseball. I wonder if Tito Francona fields a different lineup if he thinks he’s in a cash game or a GPP? I kind of doubt it.
Some of the most entertaining DFS advice can be found on Rotogrinders (www.rotogrinders.com). In addition to statistical models, statistical probabilities, and statistical statistics, there are podcasts and–on today’s topic—videos featuring past big winners sharing their strategies for the perfect lineup.
One such video, appearing a couple of times a week, is Beer’s MLB Six-Pack. Chris “Beermakersfan” Prince runs through tonight’s slate of ball games, choosing 5 players who, according to past performance and gut instinct, should rack up those fantasy points tonight. Then a sixth player is highlighted, the so-called “Beast of the Night.” This player, saved for the grand finale of the video, is the equivalent of the “best bet” of all the players in all the lineups in all the games. Chris’s recommendations are based on sound, empirical logic (what some of us still quaintly refer to as “sabermetrics” because it makes us sound profoundly smart) using the most recent statistics about every single aspect of a plate appearance.
So how reliable is Beer’s Six-Pack when it comes to building your lineup? Well, like everything else in DFS, reliability can be quantified. For the non-technical reader, that means “counting.” Let’s see how Chris does.
I conducted some extensive research (i.e. searched YouTube) for this season’s Beer videos. I had, of course, seen most of them before, coincidentally on the same day I was building my MLB lineup. Imagine that. I analyzed 19 days’ worth of videos, 114 separate recommendations (19 of whom were Beasts), across 5,060 players who had at least one plate appearance on those 19 days between April 3 and June 20.
Each day was analyzed separately. The statistic of interest was this: how many FanDuel fantasy points did the Beer recommendation players produce relative to the rest of the league? Did Chris choose the best 6 producers that night?
My methodology was to produce an ordinal rank each of the 6 recommendations relative to all of the other players in MLB lineups that day. Since the number of available players differed based on the number of scheduled games, weather, and other occasional weirdness, I normalized the rankings be converting them to a percentile.
So, for example, Chris recommended the Red Sox’s left fielder Andrew Benintendi on April 3, and he produced 25.7 FanDuel points. He went 1 for 4, that one hit being a home run that yielded 3 RBI. He also struck out once.
Benintendi’s 35.7 points put him in the 89th percentile of all 210 batters that night. Not bad! Yasmani Grandal, the Dodgers’ catcher, delivered a whopping 40.9 points, putting him in the 97th percentile. Here’s the analysis for April 3:
All in all, a good night for Beer (aren’t they all?). But like everything else in baseball, the true measure of greatness is the full body of work. Here are the results of all 19 games. The center line represents the median score for that day for all players, and the blue bars show the results of Chris’s selections that outperformed the median. The red bars show underperformers. The BEAST scores are on the far right.
Overall, Chris’s picks outperformed the league average 51% of the time. The 19 Beasts of the Night did marginally better, performing above average 57% of the time.
As a group, the Beasts scored about the same number of points as the league average, while the other 5 picks scored below that mark. But this is one of those cases where averages can be misleading. For instance, say you’re standing in a bucket of ice water with your head sticking in a hot oven. On average, you feel fine. That’s a data scientist joke.
Now is the time you ask, “so what?” What do you do with this precious information? Here’s my completely non-scientific hypothesis:
- Chris “Beermakersfan” Prince probably knows more than you when it comes to building DFS lineups
- You don’t even know what wOBA stands for, let alone wRC+.
- You have a sneaky suspicion you want to put Edwin Encarnacion in your lineup tonight, and you watch the Six-Pack show, and sure enough, he’s the Beast of the Night! You came upon that roster spot independently! And Chris agrees with you! What an intuition! What a perfect lineup! You’re a genius!
- The Indians are rained out tonight. Thanks for playin’.
The MLB All-Star game is coming up soon, so I thought I’d toss a few random analyses your way to commemorate the occasion. Here’s one…
So you want to be an All-Star, do ya? Then change your name to Rodriguez or Robinson. Here are the surnames of the top 250 All-Stars, by number of All-Star game selections, going back to the dawn of the All-Star game, in 1933 Chicago. Unfortunately, notable baseball fan Al Capone was probably not in attendance, since he had other commitments at the time in the Big House. But I digress. The bigger and bolder the name, the more someone with that name appeared in an All-Star uniform. This fine graphic represents the intersection of baseball and big data. For example, Robinson refers to the Orioles’ immortal third baseman, Brooks Robinson (18 career All-Star games), Frank Robinson, player-manager for my beloved Tribe despite those gawd-awful red uniforms (14 career selections), Eddie Robinson, who represented the White Sox and Twins in 4 contests, and of course Jackie Robinson, with 6 games as a Brooklyn Dodger. Frank Robinson was an All-Star selection for 3 of the 4 teams he played for in his career–Cincinnati, Baltimore, and LA. He never made it to the All-Star game as an Indian. Of course. All of the Robinsons on this list are in Cooperstown. Rodriguez is attached to Alex, Ivan, Ellie, Francisco, and Henry.
The word cloud was created using the R wordcloud, tm, and rColorBrewer packages. The simple R script and data file can be found at https://github.com/dino-fire/allstar-analysis.
Like all of this and my upcoming All-Star analysis, a huge shout-out goes to the data geniuses at Baseball Reference. More baseball statistics than are fit for human consumption. This blog has been cross-posted to the most excellent R-bloggers site as well.
I have a proposition for you. It’s like one of my heroes said, it’s an offer you can’t refuse.
You stand next to a 17-inch wide rubber platter, holding a 36-inch long cylinder of ash or maple in your hands. Another, much more athletic individual stands precisely 60 feet and 6 inches away from you, and throws a baseball in your general direction as hard as he can. When it crosses that 17” plate, the baseball will be traveling between 85 and 100 miles per hour. The ball may or may not hit you. You will almost certainly not hit the ball. (A major league hitter can connect with it one time out of three if he’s very good; what makes you think you can do better than that?) Don’t worry, it will be over in about 2.3 seconds.
For your troubles, I will now award you $11,628. Before taxes, of course. What a deal! You get 11-large to stand there and get a baseball thrown at you. Once. And for every additional time you stand there and have that ball thrown at you, I’ll give you another $11,628. How long would you stand there? How many pitches would you confront for that money?
Well, if your name happens to be Miguel Cabrera, and my name happens to be the Detroit Tigers, you will come to the plate 675 times, and stand there look
ing at 2,500 pitches between now and October. At $11,628 per pitch, I will give you $29 million dollars. And to sweeten the deal, we will do this, you and I, for the next 10 years. Deal?
You’d smile too.
This is not a fantasy, except for the part that you are not the one collecting that cash, and I am not the one doling it out. This is simply one way to view the $292,000,000, 10-year contract the Tigers “inflicted” on Miguel Cabrera this week. Here are a few other fun ways to look at this princely sum.
- A typical ball game lasts 3 hours. Miggy shows up to work: he has played in an average of 157 of the 162 games in a season since he came to the American League from the Miami (nee Florida) Marlins in 2008. So he makes $61,571.13 per hour. The federal minimum wage is $7.25 per hour. You, on the other hand, would have to work 8,500 hours to earn what Miggy does in a single hour. That’s 4 years, chum.
- Cabrera is a third-baseman, mostly. He averages having 1,026 balls hit at him every year. Do the math: that’s $28,265 per fielding chance. He also makes 8 errors per year. He would probably refund the Tigers the $226,000 he got for those particular misplayed chances, but the union contract won’t let him. So you places yer bets and you takes yer chances.
- The Detroit Tigers drew 3,083,397 fans to Comerica Park in 2013 to see their Central Division winning team. And, Cabrera’s 2014 salary of $29,000,000 represents an $8,000,000 raise from 2013. So each fan needs to pony up another $2.59 to cover the additional labor cost. Put another way, the Tigers need to sell a million more $8.00 beers. Shouldn’t be too hard when you think of it that way.
“Professional baseball is on the wane. Salaries must come down or the interest
The more things change, the more they say the same.
I got a Fitbit device a few weeks ago (stop laughing). If you’re not familiar, these things keep track of your exercise (such that it is), and help you stay motivated to get in shape. It does so by tracking the number of steps you take each day, among other things.
One thing it does not do is track your heart rate. Which is a good thing, because it would have exploded last night in the 9th inning of the Indians game against Kansas City. Chris Perez, we all have to die of something, and my demise will be because of YOU.
Perez has the lofty vocation of “closer,” meaning he has one job–admittedly a difficult one–but it is one job. Get three outs, Specifically, the last 3 outs of a ball game, but only those games in which your team is already winning. He did so successfully 39 times last year, but has only 23 saves thus far in 2013. Interestingly, he actually has 5 wins this year, and 3 losses (he was 0-4 in 2012). How does a closer acquire 5 wins, you ask? By blowing saves, that’s how, at which point the Indians’ offense comes in and bails you out in extra innings. It’s not something to be proud of, CP. These are the facts, and they are not in dispute.
Perez blew 4 saves in 2012, and has blown 5 thus far in 2013. Not a big difference statistically, but boy did they seem to hurt this year. Most painful was the 6-5 loss to Boston at the end of May, the same game where he added injury to insult. Perez gave up a three-run 9th-inning lead in that game. Suspecting an injury, he was asked to throw a warm-up pitch, which he promptly launched into the general area where Kevin Costner and James Earl Jones sat in the Fenway Park scene in Field of Dreams. I couldn’t help but think it was intentional: “See, look how hurt I am! It wasn’t my fault!”
And what does Mr. Perez have to say for himself about that Boston game, last night’s Royals game, or any other game, for that matter? Nothing. Nada. Silencio. You see, CP has declared that he would not be talking to the media whatsoever this year, because “it is too much of a distraction to the team.” How convenient. Why did you walk a number 9 hitter batting 0.167 on four straight pitches? Have the consequences of having a bunch of weed shipped to your dog affected your performance? Or is it due to the substance itself? Crickets.
Talking to the media–and vicariously, to the fans that make your mediocrity possible–won’t necessarily make you Mariano Rivera overnight. But hiding behind a “no talk” clause is a poor way to absolve yourself of accountability. In case you haven’t noticed, CP, some people think the Indians actually have a chance at some October baseball this year. Last year, CP castigated fans for not coming out to Progressive Field in larger numbers, This year, you might be well advised to answer that question, at least in part, by looking in a mirror.