Welcome to Beyond the Basics!
My name is Zack Capozzi, and I run LacrosseReference.com, which focuses on developing and sharing new statistics and models for the sport.
The folks at USA Lacrosse Magazine offered me a chance to share some of my observations in a weekly column, and I jumped at the chance. Come back every Tuesday to go beyond the box score in both men’s and women’s lacrosse.
It all started with win probability. I was watching a Notre Dame-v-Denver men’s lacrosse tournament game, and as the game got down to crunch time, the question came up: I wonder what their win probability is right now? At this point, the concept of win probability was not especially new. ESPN’s game pages for baseball and football regularly showed the probability that either team would win whatever game was going on at the moment.
But as you might imagine, the Google search to determine the win probability in the lacrosse game we were watching turned up a grand total of zero results. Before the end of the game, LacrosseReference.com had been purchased from GoDaddy — and the rest is history.
As the earliest tool in the lacrosse stats toolbox, I sometimes forget about the novelty of win probabilities in lacrosse. But as we work to bring more statistical insight to lacrosse, I thought it would be worthwhile to go into a bit more detail on the metric and the process that gets it on to your screens each week.
DEFINITION
Let’s start with the basics. Win probability refers to the chance that a specific team will win a specific game. If you had two perfectly evenly matched teams playing a game on a neutral field, both teams would open the game with a 50 percent win probability. And of course, the win probability for the two teams should always add up to 100 percent.
Abstract concepts can be tough, so let’s put this in terms of what you know in your gut from watching years of lacrosse. Here are the win probabilities for home teams with the specific halftime leads:
-
Tied: 51.3%
-
Leading by one goal: 69.0%
-
Leading by two goals: 82.3%
-
Leading by three goals: 91.9%
So if your team is ever down on the road by a goal at halftime, know that there is still a 3-in-10 chance that they come back to win the game.
These are averages, and they obscure differences between teams. A team that plays at a quicker pace has a greater chance of coming back because there are more possessions and more of a chance to bridge the gap.
My favorite part of having a win probability model is being able to produce win probability charts throughout the course of a game. Here’s an example from a recent Ohio State victory over Louisville.
Can’t you just feel the excitement of the Louisville comeback to tie it at the end of regulation? And then a tense overtime period punctuated by the game-winner for the Buckeyes.
WIN PROBABILITY PRE-GAME
But how does one come to a win probability for a given matchup?
In short, you need two things: A) a numeric ranking to determine the relative strength between the two teams, and B) some formula for converting that into a probability. I use my Lax-Elo ratings to estimate the strength of the two teams. And there is a tried and true formula for converting Elo ratings into a win probability (which I wrote about at length earlier this year).
Take the recent game between Marquette and Villanova as an example.
Using the Elo ratings (1546 for Marquette and 1462 for the Wildcats) and after applying the home-field advantage bump (about four percentage points), we end up with Marquette as a slight favorite. (As it turns out, the Golden Eagles ended up winning 19-14). As you might have guessed, the in-game win probability chart like I showed for the Buckeyes always starts with the pregame win probability as the starting point.
But interpretation here matters quite a bit. And this is frustrating for some people, but that 61 percent should be interpreted as: “if these teams played 100 times, we would expect Marquette to win 61 of those games.” It definitely does not mean that the model is 61 percent confident that Marquette will win.
This is a bit odd, but this also means that if the Win Probability model gives Team A a 90% chance to beat Team B, there is nothing wrong with the model if Team B ends up winning the game. The issue would arise if, out of 100 90-percent win probability games, the favorite wasn’t winning around 90 of those games. When the model says 90 percent, you want it to mean 90 percent.