Parallel

Software and Statistics: Picking NCAA Final Four Winners

, April 04, 2008

Optimization, regression, and Markov Chain

And from the Dr. Dobb's Sports Department for all you NCAA Final Four basketball fans....

Three engineering professors at the Georgia Institute of Technology have created the LRMC computer ranking system that consistently predicts NCAA basketball rankings more accurately than the AP poll of sportswriters and the ESPN/USA Today poll of coaches, formulas (the Ratings Percentage Index), other computer models (the Massey ratings and the Sagarin ratings), and even the tournament seeds themselves.

After correctly picking all four of this year's finalists, the LRMC method has now identified 30 of the last 36 Final Four participants (83 percent accuracy over the past nine years of NCAA tournaments) as one of the top two teams in their region. Over the same nine-year stretch, the seedings and polls have correctly identified only 23, and the RPI indentified 21.

LRMC, short for "Logistic Regression Markov Chain," is a college basketball rankings system designed to use only basic scoreboard data, including which teams played, which team had home court advantage and the margin of victory. It was originally designed by Joel Sokol and Paul Kvam and has been maintained and improved by Sokol and George Nemhauser, all three optimization and statistics professors in the Stewart School of Industrial and Systems Engineering at Georgia Tech.

"As fans, we only get to see most tournament teams two or three times at most during the season, so our gut feelings about a team are really colored by how well or poorly they played the few times we've been watching," said Sokol. "On the other hand, our system objectively measures each team's performance in every game it plays, and mathematically balances all of those outcomes to determine an overall ranking."

LRMC seems to have a particular knack for predicting good bubble teams and identifying the top teams. In addition to correctly picking the Final Four, LRMC also correctly identified several over-rated and under-rated teams as potential upsets. First-round losers Drake (5-seed, LRMC #30), Vanderbilt (4-seed, LRMC #38), and Connecticut (4-seed, LRMC #26), as well as second-round loser Georgetown (2-seed, LRMC #12), were all picked by LRMC as significantly over-rated teams.

On the other hand, teams like West Virginia (7-seed, LRMC #17), which defeated second-seeded Duke, and Kansas State (11-seed, LRMC #19), which defeated sixth-seeded USC, were correctly identified by LRMC as under-rated teams that could pull off one or more upsets.

But LRMC isn't perfect -- it picked Clemson as under-rated (upset in the first round) and Davidson wasn't identified as under-rated by any major ranking method, including LRMC.

LRMC differs from other computer rankings systems in two important ways. When determining the value of home court advantage, LRMC considers how much playing at home helps a team win rather than how many points playing on a home court is worth.

Georgia Tech researchers have also been able to show that very close games are often "toss-ups," meaning the better team barely wins more than half the time. So, they determined that winning a close game shouldn't be worth as much as winning easily, and losing a close game shouldn't hurt a team's ranking as much as losing badly. LRMC's ranking methodology takes this into account.

Similar to other rankings systems, LRMC also uses the quality of each team's results and the strength of each team's schedule to rank teams.

So which team does LRMC favor for the top spot this year? It's chosen Kansas, despite UNC, UCLA and Memphis being the top three ranked teams by most systems. Rock chalk, Jayhawks!

More Insights

INFO-LINK


	To upload an avatar photo, first complete your Disqus profile. \| View the list of supported HTML tags you can use to style comments. \| Please read our commenting policy.

Parallel

Software and Statistics: Picking NCAA Final Four Winners

Related Reading

More Insights

Currently we allow the following HTML tags in comments:

Single tags

Matching tags

Parallel Recent Articles

Most Popular

This month's Dr. Dobb's Journal

Upcoming Events

Featured Reports

Featured Whitepapers

Most Recent Premium Content

Parallel

Software and Statistics: Picking NCAA Final Four Winners

Related Reading

News

Commentary

Slideshow

Video

Most Popular

More Insights

White Papers

Reports

Webcasts

Currently we allow the following HTML tags in comments:

Single tags

Matching tags

Parallel Recent Articles

Most Popular

This month's Dr. Dobb's Journal

Upcoming Events

Featured Reports

Featured Whitepapers

Most Recent Premium Content