My endeavor is to provide a free service, without any strings attached, motivated purely by curiosity. This project is still in its infancy, so all results should be taken with a grain of salt.
So let's build a model. To narrow the scope, I will be only concentrating on OVER/UNDER wagering. We begin by laying down some assumptions and observations:
- The bookmakers want a LINE that has about the same money on both sides. This guarantees that they will make money on commissions, regardless of the outcome. Thus, the resulting LINE may or may not be statistically indifferent. What I mean by that is simple: the line doesn't necessarily give you a 50/50 chance of winning. The biases of all the individuals of the betting pool factor into where the LINE moves.
- This model attempts to find the true LINE, the one free of biases. After which if it is found to be statistically significantly different from the biased line, then the appropriate betting recommendation is given.
- The model will only make a bet if it is favored to do so; therefore, it will not make a bet just for the sake of betting.
- The OVER/UNDER LINE represents the predicted sum of two teams. Therefore, points and points allowed should be analyzed.
- Teams generally play better at home than on the road. Thus, home scoring for the home team and road scoring for the away team should also be analyzed.
- The only data that are processed are the end scores. This is done so to maintain simplicity as there are infinite potential conditions that could be considered.
- Assuming a juice rate of 10%, the model has to be right at least 52.38% of the time. This also assumes that the same sized wager is made for the long run.
- This model will most likely have "bad streaks," just like how Shaq might have missed five free throws in a row. But even his career FT% of 52.7 would make money in the long run.