This isn’t your typical office pool, nor the higher speed online bracket contest for the annual NCAA basketball playoff tournament. For the second year, Kaggle hosts the mother of bracketology contests – March Machine Learning Mania – with a purse of $15,000 sponsored by HP. Granted that’s not as much moolah as some people will acquire via old fashioned Las Vegas standards, but it does show a growing interest and relevance to the role of data in sports.
For the contest, Kaggle primes the data lake by providing 30 years of historical data on the teams to participants. Augment the legacy info with your own social media, or use whatever data sets you can wrangle.
The contest is divided into two stages: pre and post dance card.
Stage 1 – Model Building
Feb 2 – Mar 14, 2015 – competitors build and test models on historical data. During this phase, the leaderboard shows the model performance on historical tournament outcomes.
Stage 2 – 2015 Championship
Sunday, Mar 15 – Selection Sunday (68 teams announced)
Monday, Mar 16 – Kaggle begins to accept 2015 predictions. Release of up-to-date 2014-2015 season data.
Wednesday, Mar 18 – Final deadline to submit 2015 predictions (11:59PM UTC).
Mar 19 – Apr 6 – sit back, relax, and watch your predictions come true!
Winners and Losers
The prize is NOT determined in the traditional bracket results. Instead, a more geekily appropriate percentages for the likelihood of winning each possible matchup determines the winner.
For the doubters that sports and geeks go together (and you missed the movie Moneyball), MIT hosted its 8th high profile Sloan Sports Analytics Conference last month with players, coaches, correspondents and analysts exploring the growing industry of sports analytics.