A machine learning practice project
By Josh Crowhurst
March 2023
Use arrow keys or spacebar to navigate slideshow
Data used:
Initial hypotheses:
EDA helped inform feature engineering and decide which features to include:
And inspired some additional features to include in future:
The tidymodels framework helps simplify and align the syntax of various popular packages for machine learning. It was useful here to train and evaluate multiple algorithms with minimal configuration
Competitive model performance through February 23, 2023 provided by @HockeySktte on Twitter
I used odds data from bettingdata.com to backtest a simple strategy: wagering $100 any time the model suggested a positive expected value on a bet during the season to date.
This strategy was not profitable, with a cumulative loss of -10.5% ($2538) on 241 bets.
A few next steps to make this usable for sports betting:
Get in touch: