Advanced NFL Stats Community: Best (And Worst) Post-Season Coaching Records Since 1950, via a Binomial View

by Jim Glass

Vince Lombardi's post-season coaching W-L record of 9-1, .900, is the best ever. Or maybe not, some say Joe Gibb's record of 17-7 is standard to beat - "only" .708, but over a run of 2.4 times as many games it is much harder to keep a big winning record. How can we compare them?

One way is to compute the binomial probability of a coach attaining a given won-loss record by random chance. For instance, in Herm Edwards' first year coaching my Jets, he went 10-6. I was happy. That .625 winning record was better than Tom Landry's .607 and our former coach Bill Parcells' .570 - better than a Hall of Famer's and sure future Hall of Famer's! It looked like we had a great coach. Except that Parcells earned his .570 over 303 games and Landry earned his .607 over 418, while Herm had earned his .625 over only 16.

The binomial calculation can give the probability of winning a given number of games out of any number played, and so providea a common standard to apply to the W-L performances of different coaches with different W-L percentages over differing numbers of games. It told me that, assuming game outcomes were random with a 50% chance of winning/losing, Herm's record of .625 (or better) had only a 22% probability of occurring by random chance, which was pretty encouraging - but Bill's .570 record had only a 1% chance, and Tom's .607 had only a 0.001% chance. So perhaps it was premature to declare Herm a better coach than Tuna and Tom (as indeed it turned out to be).

Now it's playoff time, so I wondered: what if we applied this common standard when looking at the playoff coaching records of Belichick, McCarthy, Dungy, and all the other coaches of all post-season games back to 1950? And suppose we set the random probability of winning each playoff game not at 50%, but at the actual probability of winning indicated by the relative strength of the competing teams and home field advantage?

Which coaches would we see to have beaten the odds, which not, and who would the surprises be? This is the exercise followed below.

Method
I used the PFR.com data base to compile a list of all NFL post-season games and coaches from 1950 through 2010, using teams' "Simple Rating System" numbers to gauge their relative strength. The SRS gives a team's strength-of-schedule adjusted average net points per game (points for minus points against) compared to the league average. To this I added 2.5 points to the rating of the home team (when there was one). From the resulting point differential the conventional Pythagorean formula provides a winning probability for each team in each game.

Taking all the games for each coach, the resulting "Unit Pythagorean" rating provides his expected W-L percentage.
Using this percentage instead of 50% as the probability of winning, the binomial calculation gives the probability of the coach's actual W-L record resulting from random chance.

Take Dick Vermeil as an example. His 6-5 winning record with a Super Bowl championship appears very credible on its face. But his teams averaged a full touchdown better than the opposition by SRS, were the higher rated in every single game, and on top of that had home field advantage seven times to only twice for the opponent. All this gives him a .689 expecting winning rate, the highest of any coach with three or more post-season games. Put that into the calculator and it says his actual 6-5, .545 record was better than only a meager 9% of those expected by chance - random chance would beat his playoff record 91% of the time.

At the other extreme, Blanton Collier as the successor to the great Paul Brown disappointed the Browns fans of his day by going only 3-4 in the post-season. But as he had only a .255 expected winning percentage, his .429 winning record was superior to .746 of those expected by chance, the highest binomial rating of any coach with a losing record.

More recently, Ken Whisenhunt has gone 4-2, .667, in the playoffs with only a .281 expected winning rate, giving him a winning record better than .944 of those expected by chance - the highest binomial rating among all the 128 coaches to take a team to the playoffs since 1950. (Lombardi is #2.)

The numbers, and interpreting them
I've limited the list below to the 70 coaches who have coached five or more playoff games since 1950.

Given for each is the number of playoff games coached, his W-L count, actual winning percentage, expected winning percentage, and the binomial calculation of the percentage of random results his record exceeds. Thus, a figure of .612 indicates the coach's record is better than 61.2% of the records produced by chance (given his expected winning percentage).

Personally, I do not consider these numbers to be very persuasive evidence of "how good" a coach is or was. The sample sizes for all are small to very small and vary greatly, and random results in just a few close games can make a very big difference in the binomial result.

But this data does show, objectively, how well each coach's actual record in the playoffs compares to reasonable expectation. Is Coach X's "great" record in the playoffs really as good as it looks? Is Coach Y's "poor" record actually better than it looks? Does Marty Schottenheimer really have the worst playoff coaching record of all time? (No, but close.)

For perspective on (and arguments about) such questions, I think this data is useful.

Rank	Name	Games	Won	Lost	Won Lost%	Expected WL%	Binomial Performance
1	Ken Whisenhunt	6	4	2	0.667	0.281	0.944
2	Vince Lombardi	10	9	1	0.900	0.662	0.901
3	Weeb Ewbank	5	4	1	0.800	0.448	0.871
4	Bum Phillips	7	4	3	0.571	0.308	0.863
5	Joe Gibbs	24	17	7	0.708	0.578	0.862
6	Jimmy Johnson	13	9	4	0.692	0.504	0.860
7	Tom Flores	11	8	3	0.727	0.521	0.857
8	Chuck Noll	24	16	8	0.667	0.545	0.839
9	Bill Belichick	21	15	6	0.714	0.601	0.797
10	Tom Coughlin	15	8	7	0.533	0.396	0.796

11	John Fox	8	5	3	0.625	0.422	0.790
12	Bill Walsh	14	10	4	0.714	0.574	0.783
13	Hank Stram	8	5	3	0.625	0.427	0.782
14	Don McCafferty	5	4	1	0.800	0.530	0.773
15	Blanton Collier	7	3	4	0.429	0.255	0.746
16	Rex Ryan	6	4	2	0.667	0.452	0.741
17	Raymond Berry	5	3	2	0.600	0.379	0.718
18	Bill Parcells	19	11	8	0.579	0.500	0.676
19	Jerry Burns	6	3	3	0.500	0.340	0.667
20	Marv Levy	19	11	8	0.579	0.504	0.663

21	Brian Billick	8	5	3	0.625	0.512	0.610
22	Barry Switzer	7	5	2	0.714	0.594	0.593
23	Dan Reeves	20	11	9	0.550	0.499	0.592
24	Jeff Fisher	11	5	6	0.455	0.379	0.591
25	John Madden	16	9	7	0.563	0.504	0.586
26	Mark Holmgren	24	13	11	0.542	0.503	0.569
27	Mike Shanahan	13	8	5	0.615	0.556	0.556
28	Bill Cowher	21	12	9	0.571	0.534	0.548
29	John Harbaugh	7	4	3	0.571	0.479	0.546
30	Ray Malavasi	6	3	3	0.500	0.402	0.540

31	Mike Tomlin	7	5	2	0.714	0.630	0.513
32	Sam Wyche	5	3	2	0.600	0.495	0.509
33	Mike McCarthy	7	5	2	0.714	0.634	0.504
34	John Robinson	10	4	6	0.400	0.355	0.500
35	Tom Landry	36	20	16	0.556	0.543	0.492
36	George Seifert	15	10	5	0.667	0.636	0.482
37	Norv Turner	8	4	4	0.500	0.458	0.458
38	Jerry Glanville	7	3	4	0.429	0.403	0.413
39	Andy Reid	19	10	9	0.526	0.526	0.409
40	John Gruden	9	5	4	0.556	0.541	0.400

41	Tony Dungy	19	9	10	0.474	0.479	0.393
42	Sean Payton	6	4	2	0.667	0.631	0.392
43	Dave Wannstedt	5	2	3	0.400	0.378	0.376
44	Ted Marchibroda	6	2	4	0.333	0.327	0.364
45	Pete Carroll	5	2	3	0.400	0.393	0.349
46	Art Shell	5	2	3	0.400	0.427	0.292
47	Herm Edwards	6	2	4	0.333	0.375	0.274
48	Dick Nolan	5	2	3	0.400	0.458	0.244
49	Bud Grant	22	10	12	0.455	0.514	0.220
50	Steve Mariucci	7	3	4	0.429	0.513	0.206

51	Mike Ditka	12	6	6	0.500	0.586	0.184
52	Bobby Ross	8	3	5	0.375	0.474	0.181
53	Jim Fassel	5	2	3	0.400	0.506	0.180
54	Chuck Knox	18	7	11	0.389	0.474	0.169
55	Don Shula	36	19	17	0.528	0.595	0.161
56	Red Miller	5	2	3	0.400	0.532	0.150
57	Lovie Smith	6	3	3	0.500	0.631	0.139
58	Mike Martz	7	3	4	0.429	0.587	0.109
59	Don Coryell	9	3	6	0.333	0.485	0.106
60	Dick Vermeil	11	6	5	0.545	0.689	0.091

61	Mike Sherman	6	2	4	0.250	0.531	0.083
62	Wayne Fontes	5	1	4	0.200	0.409	0.072
63	Dennis Green	12	4	8	0.333	0.519	0.056
64	George Allen	9	2	7	0.222	0.432	0.048
65	Paul Brown	12	4	8	0.333	0.541	0.041
66	Sid Gillman	6	1	5	0.167	0.420	0.038
67	Jack Pardee	6	1	5	0.167	0.552	0.008
68	Wade Phillips	6	1	5	0.167	0.560	0.007
69	Marty Schottenheimer	18	5	13	0.278	0.534	0.007
70	Jim Mora, Sr.	6	0	6	0.000	0.597	0.000

6 comments:

James said...: So should we call the coaches on the lower half "bad", "underperforming (in the playoffs)", or "unlucky"?

Same for the upper half of coaches.; January 13, 2012 at 1:27 PM
Jim Glass said...: James, I'd say: "mostly lucky / unlucky".

It's hard for me to believe that Paul Brown and George Allen were really very bad post-season coaches, much worse than say Bum Phillips. For that to be true they'd have to have changed the way they coached in the post-season from the way they did during the regular season, to a worse way, and why would they do that? Who reports they did? But with very small sample sizes, just a few random events could've made a big difference in their numbers.

Take Whisenhunt as an example, #1 on the entire list. His first playoff win in 2008 was a close, less-than-one-score win against the Falcons. Games that close are largely luck. If he'd lost that game his playoff record would be 1-2 instead of 4-2, and he'd be way down the list. In fact he wouldn't even be on the "five games or more list". So one turnover could've knocked him from being ranked #1 of all over 60 years to being just another also ran. That's how much effect random luck can have on these numbers, especially on the guys with small sample sizes, only 5, 7, 8 games.

But, OTOH, it is hard to look at Lombardi and say "just pure luck". And a lot of people say Schottenheimer got very conservative in the playoffs to cause his poor results. I don't know if that is true, or just a story. But it is plausible that in the big games of the playoffs some coaches might change their behavior -- maybe their risk aversion would come out -- so playoff-level coaching quality could vary. Maybe, as organizational leaders, some coaches just deal with big-game pressure better than others.

So it is entirely credible that some degree of difference in post-season coaching ability exists and is reflected in the numbers.

But I'd still say the differences in the final numbers are largely, mostly, luck. IMHO, FWIW.

Again, the purpose of this exercise *wasn't* to find evidence that this coach is/was "better" than that coach.

It was to obtain an objective measure of how coaches actually did in the playoffs, and to maybe compare that to popular conceptions of how they did.; January 13, 2012 at 4:05 PM
Jim A said...: I guess you are comparing how coaches performed in the playoffs relative to how they were expected to perform. But how they were expected to perform in the playoffs is largely determined by how they performed in the regular season (by using SRS and Pythagorean). So some might say this is a measure of "clutch" coaching. It's an interesting exercise, though I don't know how useful it is.; January 14, 2012 at 12:31 AM
Andrew Foland said...: A slightly different way to look at this is as an ensemble of predictions of playoff performance, given regular-season performance. If playoff performance is systematically identical to regular season performance, then in such an ensemble, the probability value in the far right column should be distributed uniformly between 0 and 1. With the exception of the pileup of four under 0.01, I'd say this does in fact look very uniform in probability density.

That is an indication that over an ensemble of coaches, regular season performance accurately predicts playoff performance, and that the variations are largely those expected solely due to chance. If there were special "playoff coaching ability", there would be significant bowing at 1 and at 0. There's a slight pileup at 0, but not a net bowing, so there's some weak evidence that about 5% of coaches completely choke, but other than that, playoffs are no different from the regular season.; January 15, 2012 at 10:20 AM
Anonymous said...: There could be something to George Allen's ranking in this list. As a Redskins famously said after the 1972 Super Bowl, Allen was so obsessed with distraction that he became a distraction.

-b; January 16, 2012 at 12:21 PM
Jerrell said...: Mike Smith bucking to get into the Jim Mora, the elder Club, "Playoffs?"; January 13, 2013 at 12:32 PM