Blog 12: Predicting the past

As the headline say, I will try and predict past results in this blog. That’s obviously pretty easy, but I will try and do it without using hindsight. The model I’m using is a fairly simple one based on LS-GAA. Before we get to predict team results, I will take one last look at the individual player level.

My model has a descriptive component (LS-GAA) and a predictive component (pLS-GAA). The descriptive LS-GAA should theoretically equal the player’s goal contribution above or below league average. If a player has a positive LS-GAA he contributes to the team making the playoff.

The projected-LS-GAA tries to predict future performances of a player. It’s based on weighted data from the previous 3 seasons.

So I have calculated the pLS-GAA for 2017/2018, 2018/2019 and 2019/2020 for every player and compared it with the actual LS-GAA for those seasons. It’s the same process as in blog 10. I have just added data from two more seasons. Here’s the result:

And if we remove the goaltenders, the correlation unsurprisingly gets better:

I can refine the data by valueing the components differently:

pLS-GAA adj. = 0.46*pGK + pEVO + 0.83*pEVD + 0.95*pPP + 0.9*pSH + 1.7*pPEN

This doesn’t mean that goaltending is less important than the the other components. Goaltending is just harder to predict, so the model works better if you expect some regression towards average.

Here’s the graph with the adjustments:

Here it is without the goalies:

With that introduction, let’s now look at the team projections. I will do team projections from all 3 seasons based on both the adjusted and unadjusted model.

The first step is to assign players to the teams, and it’s a prediction so I can’t use hindsight. So every player is assigned to the team where he started the season. I can’t predict which players get traded.

When every player has been assigned to a team you find the team p-LS-GAA by adding the individual numbers. Then we use the correlation between goal differential and standing points to convert team p-LS-GAA into projeted points. Now the projected points can be compared with the actual points – the 2019/2020 season is prorated to 82 games. I’m also comparing with Dom Luszczyszyn’s model (The Athletic).

2017/2018

Team	Points	pPoints (LS-GAA)	pPoints (LS-GAA adj.)	pPoints (Dom’s model)
NSH	117	95.8	96.0	95.8
WPG	114	90.1	91.3	95.0
T.B	113	95.7	97.9	95.9
BOS	112	97.9	97.1	96.8
VGK	109	83.5	82.2	83.3
TOR	105	89.5	90.2	95.0
WSH	105	109.6	103.0	100.5
ANA	101	91.1	87.3	93.7
MIN	101	100.2	101.7	95.7
PIT	100	105.2	106.0	103.0
S.J	100	103.4	103.3	93.7
L.A	98	92.4	91.1	92.8
PHI	98	88.7	88.3	90.5
CBJ	97	99.2	96.6	95.3
N.J	97	85.6	84.8	78.2
FLA	96	85.1	87.3	93.1
COL	95	77.7	81.6	84.3
STL	94	92.1	91.8	93.3
DAL	92	92.4	94.0	95.4
CGY	84	90.4	89.6	93.0
CAR	83	92.0	93.5	94.9
NYI	80	92.9	94.1	93.4
EDM	78	95.5	92.0	93.0
NYR	77	97.1	96.4	90.8
CHI	76	91.1	90.0	95.6
DET	73	81.1	84.2	81.0
VAN	73	76.4	77.3	80.8
MTL	71	96.1	92.7	98.4
ARI	70	85.5	85.8	84.9
OTT	67	88.5	89.8	89.1
BUF	62	76.5	80.9	85.5

2018/2019

Team	Points	pPoints (LS-GAA)	pPoints (LS-GAA adj.)	pPoints (Dom’s model)
T.B	128	96.8	100.5	104.6
BOS	107	96.5	96.2	101.0
CGY	107	88.5	88.0	93.2
WSH	104	97.8	95.4	94.4
NYI	103	78.8	82.7	84.9
S.J	101	100.2	98.6	98.6
NSH	100	103.2	101.2	104.8
PIT	100	98.6	98.1	100.2
TOR	100	97.9	98.9	102.9
CAR	99	92.6	94.6	91.8
STL	99	88.9	93.1	91.5
WPG	99	97.1	96.9	100.6
CBJ	98	98.7	96.2	101.1
MTL	96	82.5	84.5	84.2
DAL	93	89.7	88.0	90.8
VGK	93	97.1	95.3	93.5
COL	90	89.0	87.8	86.7
ARI	86	89.9	88.5	89.8
FLA	86	85.7	86.8	95.1
CHI	84	82.9	82.9	90.2
MIN	83	98.1	98.6	92.9
PHI	82	93.5	95.3	96.5
VAN	81	79.5	80.9	77.6
ANA	80	101.6	94.4	97.0
EDM	79	88.7	87.3	86.4
NYR	78	85.1	83.7	81.8
BUF	76	81.2	83.0	86.8
DET	74	78.0	78.1	74.6
N.J	72	89.1	89.6	85.3
L.A	71	92.4	89.2	92.0
OTT	64	73.4	78.7	77.4

2019/2020

Team	Points	pPoints (LS-GAA)	pPoints (LS-GAA adj.)	pPoints (Dom’s model)
BOS	117.1	98.3	96.5	103.7
STL	108.6	97.6	98.9	101.4
COL	107.8	92.5	92.3	93.9
T.B	107.8	111.1	108.9	108.3
WSH	107.0	104.3	102.0	96.7
PHI	105.8	90.6	92.3	91.5
PIT	102.2	93.8	94.9	99.7
VGK	99.3	100.5	99.1	100.1
CAR	97.7	96.2	97.3	100.4
DAL	97.4	97.2	93.3	95.9
NYI	96.5	84.8	85.0	89.2
EDM	95.9	88.1	88.9	83.3
CBJ	94.9	93.1	94.7	87.3
TOR	94.9	103.3	103.6	105.2
FLA	92.7	99.9	98.4	93.3
NSH	92.7	103.6	100.2	100.6
VAN	92.7	85.8	86.0	84.9
CGY	92.5	89.4	90.3	96.0
NYR	92.5	85.3	84.2	84.9
WPG	92.4	93.1	93.4	91.8
MIN	91.5	89.6	91.8	93.0
ARI	86.7	89.1	87.2	85.1
CHI	84.3	89.5	88.7	86.6
MTL	82.0	84.3	86.5	90.1
BUF	80.8	74.9	78.0	80.2
N.J	80.8	85.8	88.7	93.0
ANA	77.4	95.8	89.2	88.5
L.A	75.0	81.8	80.5	78.8
S.J	73.8	93.4	94.0	94.1
OTT	71.6	71.8	77.9	71.2
DET	45.0	71.3	73.4	73.7

The 2017/2018 was difficult to predict. We can compare the quality of the models by looking at the average difference between actual points and projected points:

Year	LS-GAA	LS-GAA adj.	Dom’s model
2017/2018	11.9	11.9	12.0
2018/2019	8.6	8.1	8.1
2019/2020	7.7	7.4	7.2
Total	18.0	18.1	16.2

These simple projection models based on LS-GAA are comparable to Dom’s model. I still think the model can be made a lot better, but you can never make a perfect NHL model. There’s great parity in the NHL and the nature of the game is unpredictable.

When we look at the total difference in points over the 3 years, Dom’s model is better than the LS-GAA based models. This indicates that the mistakes in my models accumulate more over time.

In the next blog I will try and better the goaltender projections, so I can put more weight on that component.

Conclusion:

This was the first draft of my projection model. There’s still a lot of tweaks and refinements to be made, but the first draft is pretty good.
My model is similar to Dom Luszczyszyn’s model. They are both based on player stats and they both have a descriptive component (LS-GAA and GSVA) and a predictive component (pLS-GAA and GS). My model uses totals where Dom’s model uses rates, so he has to estimate the time on ice for each player.
I will try and have a projection model ready for the playoffs (knock on wood), but I won’t post a model I don’t trust.

Stay safe and remember to be kind

All data from www.evolving-hockey.com and www.theathletic.com

Del dette:

Related

Leave a comment Cancel reply