Quote Originally Posted by MontanaMax:
Out of curiosity I have been trying to formulate something similar like this for the NHL
Just don't know how to put the parameters together but something is itching me that there could be some gold in them darn hills
Using some or all of the below:
Sagarin ratings
RPI
Foxfeed ratings
Or whatever else...........
Just show me the door.......
I'll walk through it...........
Ok, I'm not too familiar with how to break down a hockey box score since most of my work revolves around football but I'll lay out some things that should help.
For starters, say we have a matchup of Team A vs. Team B:
- Find a data set (either full season statistics, or last (x amount of games) statistics for Team A's Offense, Defense, Goaltending, Penalties, etc.. (whatever goes into a box score of hockey).
- Break all of this down into a per game average, so you come up with a box score for Team A.
- Do the same for Team B now.
You want to basically have 4 columns of data, Team A Offense & Defense, vs. Team B Offense & Defense, all per game statistics. Create another 2 columns separate, Team A & Team B.
Average Team A Offense & Team B Defense together and compile Team A's box score in 1 of those 2 columns. Do the same for Team B Offense vs. Team A Defense and compile Team B's box score in the 2nd column.
Take each teams Sagarin Rating, RPI Rating, w/e you choose to you use, add them together, divide them by 2. ie. Team A Rating 4.5, Team B Rating 5.5, for an average rating of 5. Divide Team A's rating (4.5) by the average rating (5) for a factor of 0.9. Do the same for Team B, (5.5/5) for a factor of 1.1.
Using the 2 columns of projected box score data, you multiply Team A's stats by their factor (0.9) and do the same for Team B (1.1).
This basically means, the better teams stats are going to be inflated, and the worse of the two deflated, but can hopefully project an outcome of the matchup at hand.
Now you can use any statistics with this, but the base is to multiply these statistics by their Rating, RPI, w/e factors, and compile a projected score doing this.
What I've done, was keep an archive of every weeks projected results, as well as every weeks actual results, and just filter the results until I find what wins the most. For instance, top tier teams vs. bottom tier teams, maybe the formula projects a top tier team to cover the puckline vs bottom tier teams most of the time, but the actual results are the bottom tier team covering the puckline more often, this would result in you fading the system, or staying away. Maybe it predicts most accurate when two similar teams are playing, with the favorite at home, or the road, etc..
Sorry for the long post, hope it helps, if you can't understand I can try and dip my fingers into the NHL to see what I can come up with for a visual sample.