Dr. Dobb's is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them. Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.


Channels ▼
RSS

Parallel

Software and Statistics: Picking NCAA Final Four Winners



And from the Dr. Dobb's Sports Department for all you NCAA Final Four basketball fans....

Three engineering professors at the Georgia Institute of Technology have created the LRMC computer ranking system that consistently predicts NCAA basketball rankings more accurately than the AP poll of sportswriters and the ESPN/USA Today poll of coaches, formulas (the Ratings Percentage Index), other computer models (the Massey ratings and the Sagarin ratings), and even the tournament seeds themselves.

After correctly picking all four of this year's finalists, the LRMC method has now identified 30 of the last 36 Final Four participants (83 percent accuracy over the past nine years of NCAA tournaments) as one of the top two teams in their region. Over the same nine-year stretch, the seedings and polls have correctly identified only 23, and the RPI indentified 21.

LRMC, short for "Logistic Regression Markov Chain," is a college basketball rankings system designed to use only basic scoreboard data, including which teams played, which team had home court advantage and the margin of victory. It was originally designed by Joel Sokol and Paul Kvam and has been maintained and improved by Sokol and George Nemhauser, all three optimization and statistics professors in the Stewart School of Industrial and Systems Engineering at Georgia Tech.

"As fans, we only get to see most tournament teams two or three times at most during the season, so our gut feelings about a team are really colored by how well or poorly they played the few times we've been watching," said Sokol. "On the other hand, our system objectively measures each team's performance in every game it plays, and mathematically balances all of those outcomes to determine an overall ranking."

LRMC seems to have a particular knack for predicting good bubble teams and identifying the top teams. In addition to correctly picking the Final Four, LRMC also correctly identified several over-rated and under-rated teams as potential upsets. First-round losers Drake (5-seed, LRMC #30), Vanderbilt (4-seed, LRMC #38), and Connecticut (4-seed, LRMC #26), as well as second-round loser Georgetown (2-seed, LRMC #12), were all picked by LRMC as significantly over-rated teams.

On the other hand, teams like West Virginia (7-seed, LRMC #17), which defeated second-seeded Duke, and Kansas State (11-seed, LRMC #19), which defeated sixth-seeded USC, were correctly identified by LRMC as under-rated teams that could pull off one or more upsets.

But LRMC isn't perfect -- it picked Clemson as under-rated (upset in the first round) and Davidson wasn't identified as under-rated by any major ranking method, including LRMC.

LRMC differs from other computer rankings systems in two important ways. When determining the value of home court advantage, LRMC considers how much playing at home helps a team win rather than how many points playing on a home court is worth.

Georgia Tech researchers have also been able to show that very close games are often "toss-ups," meaning the better team barely wins more than half the time. So, they determined that winning a close game shouldn't be worth as much as winning easily, and losing a close game shouldn't hurt a team's ranking as much as losing badly. LRMC's ranking methodology takes this into account.

Similar to other rankings systems, LRMC also uses the quality of each team's results and the strength of each team's schedule to rank teams.

So which team does LRMC favor for the top spot this year? It's chosen Kansas, despite UNC, UCLA and Memphis being the top three ranked teams by most systems. Rock chalk, Jayhawks!


Related Reading


More Insights






Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.