DVOA and other topics in football analytics

Super Nomario · Jan 25, 2014

Good opening post, Bellhorn.

I think it was Burke, but it might have been Dave Berri, who criticized DVOA for the following items:
1) It's a black box. We unworthies aren't given access to the methods, formulas, or weightings used to calculate DVOA. This makes it difficult to evaluate in the cases (like the Buffalo game or the Pats / Jets game a few years ago) where its results don't make intuitive sense. Can we even judge whether they're assigning too much credit for partial success when we don't even know how much they're assigning? Apart from this, it limits its value as an analytical tool. I can't take a split of the Pats' offensive DVOA with and without Gronkowski, for instance, or calculate their defensive DVOA on third down.
2) It's neither a wholly descriptive or predictive tool. It makes some allowances for controlling for randomness (such as assigning half a turnover for fumbles no matter which team recovers them) but not others (the randomness of INT rates, for instance). It doesn't adjust for clutch but does weight red zone plays higher. And because it's a black box, we can't tease out the predictive and descriptive elements to try to make it more sound.
3) The number itself is meaningless. A -12% or +4% DVOA isn't a thing except in reference to other values of DVOA. Contrast to EPA or WPA, which represent their values in terms of points or wins, or even stats like Y/A or ANYA that relate back to yards.

Shelterdog · Jan 25, 2014

The biggest issue with DVOA is that it's not clear to me that it measures anything other than DVOA. On the FO website they still tout a bunch of correlations from 2000-2005 seasons which suggest that DVOA is better at predicting next year wins than simple stats and is reasonably good (although not as good as point scored/ allowed or point differential) at correlating with same year wins, but that's the extent of the evidence about why it works.

SMU_Sox · Jan 25, 2014

While they didn't have a good year against the spread in 2013 I made bank with them. Plus I used them to win an elimination pool.

They are usually really good with their top 5 picks against the spread especially if you tease it. You have to know though they are bad at adjusting for injured players. I'd say dvoa is 60% descriptive and 40% purely predictive with the note that of that descriptive part a lot of that is predictive too.

Schatz has noted that when they predict games they don't just use dvoa. They actually use the spread too.

SeoulSoxFan · Jan 26, 2014

Besides the better known PFF and FO, I'd like to mention a couple of other stat sources:

Advanced Football Stats: http://www.advancednflstats.com/
NumberFire: https://www.numberfire.com

NF also has a nice (um, optimistic) read on 2013 season here: https://www.numberfire.com/nfl/news/1682/new-england-patriots-2013-team-review-failure-or-miracle

Needless to say, I really appreciate the thoughts being shared here.

bowiac · Jan 26, 2014

Football Outsiders has not done an especially good job explaining whether their system beats much simpler, much more transparent systems like SRS (available at football reference - just a strength of schedule adjusted margin of victory measurement).

With baseball projections, this was always the gold standard - show that PECOTA/ZiPS/etc... beat a very simple Marcel the Monkey type system. Granularity and complexity is great when it achieves something. It's never been clear to me that DVOA does.

coremiller · Jan 26, 2014

The black box issue is the biggest problem with DVOA. Without knowing all the weighting and how all the adjustments work it's impossible to evaluate. Every so often they leak a new piece of information, e.g., Aaron casually mentioned recently that the opponent adjustments take into account down and distance, so that they adjust a team's success on third downs based on the opponent's third down performance, and not the opponent's overall performance. Does this granularity increase the model's accuracy or just introduce more noise? No one knows.

The real problem, though, is not that their model is a black box, but that their dataset is proprietary. If there were open-source access to all the PBP data in a downloadable format, some smart people with free time on their hands could probably reverse engineer much of the model, or use the data to test out other models. But no one has the data.

Bellhorn, the standard FO response to your 5,3,0 vs 0,0,10 issue is that it's a predictive vs descriptive problem. A purely predictive model will prefer the 5,3,0 scenario because you had two successful plays out of three, which indicates you are more likely to have successful plays on the future than if you only had one. The model will hate the two zeros in the 0,0,10 scenario and penalize your for it.

A separate problem with FO is that they continue to tout their individual player stats, which often produce bizarre results and which are frequently worse than useless. That says nothing about the validity of the underlying DVOA model for team success, but it hurts their credibility. Their generally thin skin, and inclusion of a number of newer writers who have a tendency make obvious mistakes and say outlandishly silly things, do not help in this department.

FO is frustrating. It's clearly a big improvement over unadjusted yardage stats, but I'm always left feeling like it could be much better than it actually is.

bowiac · Jan 26, 2014

FWIW, as far as the "black box" complaints, I just ran a quick regression, and found that 97.2% of the variation in DVOA can be explained just through simple box score stats. Now maybe that last 2.8% is where the magic happens, but it's a pretty small of it happening it seems.

bowiac · Jan 27, 2014

Out of curiosity, I threw in some PFF data (which is obviously super black box itself), and got that soup up to a .99% correlation for in sample data.

SMU_Sox · Jan 27, 2014

Ok... I'm impressed. Do you have thst bad boy available for download?

SMU_Sox · Jan 27, 2014

Can't edit on my mobile. What program did you use to run the regression?

bowiac · Jan 27, 2014

SMU_Sox said:
Ok... I'm impressed. Do you have thst bad boy available for download?

PM me with your e-mail address - I'll send you a copy. It's at 99% for single season in sample data, and 97% for 3 seasons (2011-2013) of in sample data. The three season data is significantly more useful because of it retains a 97% correlation for an out of sample season (2010). I haven't bothered t-stat-ing any of this to remove extraneous coefficients, so while I'm pulling 28 variables right now, probably only seven or eight are really doing anything.

bowiac · Jan 27, 2014

SMU_Sox said:
Can't edit on my mobile. What program did you use to run the regression?

Excel. Any reason I shouldn't?

SMU_Sox · Jan 27, 2014

For not using excel for a simple regression? No. But if you made a model using data from 2005 onward you might have done it in something else. You said simple regression and then said you added something. I'm just trying to see what you did. I'm a math/stats nerd. I apologize.

Search

Search

DVOA and other topics in football analytics

Bellhorn

Lumiere

Super Nomario

Member

Shelterdog

Well-Known Member

SMU_Sox

queer eye for the next pats guy

SeoulSoxFan

I Want to Hit the World with Rocket Punch

bowiac

Caveat: I know nothing about what I speak

coremiller

Member

bowiac

Caveat: I know nothing about what I speak

bowiac

Caveat: I know nothing about what I speak

SMU_Sox

queer eye for the next pats guy

SMU_Sox

queer eye for the next pats guy

bowiac

Caveat: I know nothing about what I speak

bowiac

Caveat: I know nothing about what I speak

SMU_Sox

queer eye for the next pats guy