Today is Selection Sunday. It is the most important day in college basketball and the day when we finally get a chance to look at the overall bracket and begin using the models that have been developed over the past couple of weeks to pick winners and losers. We are first going to go over the results of the predictions of the models and then describe a quick and dirty final model - a simple democratic composite model.
So, here are the results of all of the conference tournaments. I made projections for all 32 tourneys with the EDM and SPM models, with the MASPM and EDLM models coming in for the later tourneys since they were developed later.
Results of Projections:
Conference | EDM | SPM | MASPM | EDLM | Actual |
---|---|---|---|---|---|
Atlantic Sun | Liberty | ||||
Big South | Gardner-Webb | ||||
Patriot | Colgate (8-1) | Colgate | |||
Horizon | Northern Kentucky (6-1) | Northern Kentucky (7-0) | Northern Kentucky | ||
Northeast | Fairleigh Dickinson | ||||
Ohio Valley | Murray State | ||||
MAAC | Iona | ||||
Missouri Valley | Bradley | ||||
West Coast | Saint Mary's | ||||
Southern | Wofford (7-2) | Wofford (9-0) | Wofford (9-0) | Wofford | |
America East | Vermont (5-2) | Vermont (5-2) | Vermont (5-2) | Vermont | |
Colonial | Northeastern | ||||
Summit | NDSU | ||||
MEAC | NC Central | ||||
MAC | Buffalo (9-2) | Buffalo (6-5) | Buffalo (7-4) | Buffalo (8-3) | Buffalo |
ACC | Duke | ||||
Sun Belt | Georgia State (7-2) | Georgia State (7-2) | Georgia State (7-2) | Georgia State | |
SWAC | Prairie View A&M (6-1) | Prairie View A&M (6-1) | Prairie View A&M (6-1) | Prairie View A&M (6-1) | Prairie View A&M |
Big Sky | Montana (8-2) | Montana (7-3) | Montana (7-3) | Montana (7-3) | Montana |
Atlantic-10 | Saint Louis | ||||
Mountain West | Utah State | ||||
Pac-12 | Oregon | ||||
Southland | Abilene Christian (4-3) | Abilene Christian (4-3) | Abilene Christian (4-3) | Abilene Christian | |
Big East | Villanova (6-3) | Villanova (7-2) | Villanova (7-2) | Villanova (6-3) | Villanova |
SEC | Auburn | ||||
Big 12 | Iowa State | ||||
Conference USA | Old Dominion (9-2) | Old Dominion (10-1) | Old Dominion (9-2) | Old Dominion | |
Big West | UC Irvine (5-2) | UC Irvine (5-2) | UC Irvine (5-2) | UC Irvine (4-3) | UC Irvine |
WAC | New Mexico State (6-1) | New Mexico State (6-1) | New Mexico State (6-1) | New Mexico State (5-2) | New Mexico State |
Ivy | Yale (2-1) | Yale (3-0) | Yale (2-1) | Yale | |
American | Cincinnati | ||||
Big Ten | Michigan State (7-6) | Michigan State (9-4) | Michigan State (9-4) | Michigan State | |
Record | 200-89 (.692) | 190-99 (.657) | 159-81 (.663) | 119-63 (.654) | EDM Wins |
The first model that I built (EDM) ended up being the best predictor of individual games being the only model winning more than twice as much as it lost (.692). The SPM had the most perfect brackets with 3. The MASPM was the only other model to get a perfect bracket having 1. The important thing to note is that these perfect tournaments were typically smaller tournaments with very few upsets. Even though the SPM had the most perfect brackets, it also had the second worst record by percentage (.657). Our newest model, the EDLM ended up performing the worst (.654).
Now that we have these results, we can design our final model: A composite model. This model is the most simple of them all. We let the other teams vote on which team they have winning at a particular point in the bracket and the team with the most votes for a game progresses in the bracket. Pretty simple, but there are a couple of edge cases to consider.
The first is what occurs during a tie. We have four models that are voting members, so how do we break ties? We could select the higher seed as a safe pick. We could select the lower seed as an upside pick. We could use a fifth model as reference like the Elo Model I described earlier in this series. In our case, we will make the EDM model the priority model. It performed best during the conference tournaments so we'll give it a little more weight in the decision making process for our composite model.
The other edge case happens with the EDLM. This model doesn't generate scores but probabilities. With the other models, we'll simply pick the team with the higher rating. But with the EDLM we can't do that. We should pick the team of those available with the highest probability to that point in the tournament which may not match the selection of the pure EDLM since each round is selected by the composite model. We also need to fix the probabilities to the results of prior rounds when considering games in later rounds.
We could treat games as independent of each other to simplify the complex probability trees, but as mentioned in the last post, games are entirely dependent on the prior game in the single elimination setting. In either case, if the other three models are in agreement, we don't have to generate probabilities and if the EDM and one other model is in agreement, a calculation can be skipped since the EDM has the tie-breaker priority.
So, that is all for this post. A short one today. We are done with the development phase and are now on to the trial phase. The big tournament for prediction purposes starts Thursday afternoon. I'll try to post each model's predictions before then, maybe once a day. Tonight, we'll know the teams. I will also provide some potential upset picks by each model to help anyone look for a trendy pick to help them in their brackets. Happy picking!
Congratulations @statsplit! You have completed the following achievement on the Steem blockchain and have been rewarded with new badge(s) :
You can view your badges on your Steem Board and compare to others on the Steem Ranking
If you no longer want to receive notifications, reply to this comment with the word
STOP
To support your work, I also upvoted your post!
Do not miss the last post from @steemitboard:
Vote for @Steemitboard as a witness to get one more award and increased upvotes!
Downvoting a post can decrease pending rewards and make it less visible. Common reasons:
Submit