<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Math Goes Pop! &#187; Sports</title>
	<atom:link href="http://www.mathgoespop.com/category/sports/feed" rel="self" type="application/rss+xml" />
	<link>http://www.mathgoespop.com</link>
	<description>Ruminations on the Intersection Between Mathematics and Popular Culture</description>
	<lastBuildDate>Sat, 04 Feb 2012 02:21:45 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Are the 49ers skilled, or just lucky?</title>
		<link>http://www.mathgoespop.com/2012/02/are-the-49ers-skilled-or-just-lucky.html</link>
		<comments>http://www.mathgoespop.com/2012/02/are-the-49ers-skilled-or-just-lucky.html#comments</comments>
		<pubDate>Fri, 03 Feb 2012 20:46:32 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Sports]]></category>
		<category><![CDATA[49ers]]></category>
		<category><![CDATA[football]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=1647</guid>
		<description><![CDATA[<p>Fans of the two football teams who face off in the Super Bowl will no doubt spend the weekend filled with nervous anticipation &#8211; hopeful that their team will emerge victorious, but certain of the knowledge that there can only be one champion.  For the rest of us, we must hang our heads with relative degrees <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2012/02/are-the-49ers-skilled-or-just-lucky.html">Are the 49ers skilled, or just lucky?</a></span>]]></description>
			<content:encoded><![CDATA[<p>Fans of the two football teams who face off in the Super Bowl will no doubt spend the weekend filled with nervous anticipation &#8211; hopeful that their team will emerge victorious, but certain of the knowledge that there can only be one champion.  For the rest of us, we must hang our heads with relative degrees of shame, and bide our time until the next season brings with it the promise of new opportunities for all 32 NFL teams.</p>
<p>For a San Francisco <a href="http://en.wikipedia.org/wiki/San_Francisco_49ers">49ers</a> fan like myself, most of the last decade has been spent in a fairly constant state of disappointment.  But after ten years without a playoff appearance, the team blossomed this season under the influence of new head coach <a href="http://en.wikipedia.org/wiki/Jim_Harbaugh">Jim Harbaugh</a>, and came within one game of their first Super Bowl appearance since 1995.</p>
<div id="attachment_1653" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.mathgoespop.com/wp-content/uploads/2012/02/49ers.jpg"><img class="size-full wp-image-1653" title="49ers" src="http://www.mathgoespop.com/wp-content/uploads/2012/02/49ers.jpg" alt="" width="500" height="670" /></a><p class="wp-caption-text">This poster hangs proudly in our apartment.</p></div>
<p>Despite a great season, in which the team won 13 of their 16 games for the first time since 1997, many people have still voiced doubts about whether or not this team is for real.  What if this season was just a fluke?  After all, quarterback <a href="http://en.wikipedia.org/wiki/Alex_Smith">Alex Smith</a>, long jeered by 49ers fans since being drafted in 2005, was the starting quarterback for every game this season.  And while his stats this year were his best ever, they don&#8217;t compare favorably to quarterbacks like <a href="http://en.wikipedia.org/wiki/Tom_Brady">Tom Brady</a>, <a href="http://en.wikipedia.org/wiki/Aaron_Rodgers">Aaron Rodgers</a>, or <a href="http://en.wikipedia.org/wiki/Drew_Brees">Drew Brees</a>.  While the offense might not be the flashiest, this year it was relatively error-free; combined with stellar defense and special teams units, the 49ers were able to grind out wins fairly consistently from week to week.</p>
<p>The question remains, though, no matter how much the fans may try to ignore it: how much of a role did simple luck play in the 49ers season?  To answer this question, of course, one needs a way to measure the team&#8217;s luck, and compare it to other teams.  Alternatively, one needs to look at some measures of skill, and show that in fact, the 49ers simply were a more skilled team this year, both compared to other teams in the league and compared to the 49ers of previous seasons.  Here I&#8217;ll talk a little bit about luck, and in a follow-up post I&#8217;ll talk more about skill.</p>
<p>In what sense could the 49ers be considered a lucky team during the 2011 season?  Well, one way to look for a team&#8217;s luck is to measure the strength of the opposing teams during the course of the season.  If your team plays all of its games against terrible opposing teams, then it&#8217;s more likely your team will have a favorable record, even if it&#8217;s only mediocre, simply by virtue of the fact that the teams it plays against aren&#8217;t very good.  Conversely, if your team plays all of its games against very strong teams, it will be much harder to get a winning record.  Bearing this in mind, we ask: how lucky were the 49ers in terms of the strength of the opponents they faced?</p>
<p>The answer, as many football fans could probably guess, is fairly lucky.  Part of this stems from the fact that the 49ers played six of their sixteen games against opponents from their own division, which is not known for its strength.  Here&#8217;s a list of all teams in the 2011 season, ranked by the strength of their schedules.</p>
<div id="attachment_1652" class="wp-caption aligncenter" style="width: 467px"><a href="http://www.mathgoespop.com/wp-content/uploads/2012/02/Screen-shot-2012-02-03-at-9.55.01-AM.png"><img class="size-full wp-image-1652" title="2011 SOS" src="http://www.mathgoespop.com/wp-content/uploads/2012/02/Screen-shot-2012-02-03-at-9.55.01-AM.png" alt="" width="457" height="579" /></a><p class="wp-caption-text">Table 1: 2011 strength of schedule. Green indicates teams that made it to the playoffs.</p></div>
<p>As you can see, the 49ers schedule is ranked near the very bottom, tied with New England and just above New Orleans.  So the schedule was relatively easy.  Having said that, I&#8217;ve heard many people attribute the success of the 49ers solely to the ease of their schedule, but have not heard the same criticism made for teams like New Orleans, New England, and Green Bay.  To remark on the ease of the 49ers schedule without pointing out the collective ease of all of these teams&#8217; schedules seems a bit unfair.</p>
<p>Let&#8217;s ask another question: how does the strength of the 2011 schedule compare to the strength of the 2010 schedule?  Here&#8217;s a new table, just like the one above, but with data from the 2010 season:</p>
<div id="attachment_1651" class="wp-caption aligncenter" style="width: 471px"><a href="http://www.mathgoespop.com/wp-content/uploads/2012/02/Screen-shot-2012-02-03-at-9.57.12-AM.png"><img class="size-full wp-image-1651" title="2010SOS" src="http://www.mathgoespop.com/wp-content/uploads/2012/02/Screen-shot-2012-02-03-at-9.57.12-AM.png" alt="" width="461" height="579" /></a><p class="wp-caption-text">Table 2: 2010 strength of schedule. Green indicates teams that made it to the playoffs.</p></div>
<p>In the 2011 season, the 49ers were much closer to the middle of the pack as far as strength of schedule is concerned.  Comparing year over year, based on this data, it would not seem unreasonable to attribute some of the 49ers&#8217; newfound success to their relatively easy schedule.</p>
<p>But wait! the careful reader may exclaim.  There&#8217;s a chicken-and-egg argument to be made here: did the 49ers improve in 2011 because they had an easier schedule, or did they have an easier schedule because they had improved?  Indeed, San Francisco&#8217;s 13 wins in 2011 correspond to an extra 13 losses for their opponents, as compared to only 6 wins for the team in 2010.  Any conclusions drawn from the above data must keep this in mind, since the opponents wins for any team are not independent of the team&#8217;s performance.</p>
<p>One way to try and correct this is to remove each team&#8217;s performance when considering the strength of the opposing teams.  So, for example, in 2011 the teams the 49ers faced won 115 games and lost 141, for a win percentage of just 0.449.  However, if we remove games played against the 49ers, who had a record of 13 wins and 3 losses, then among all remaining games, the teams the 49ers faced won 112 games, and lost 129, for a slightly higher win percentage of .467.</p>
<p>If we make this adjustment across all teams for the 2011 season, the following picture emerges.</p>
<div id="attachment_1657" class="wp-caption aligncenter" style="width: 461px"><a href="http://www.mathgoespop.com/wp-content/uploads/2012/02/Screen-shot-2012-02-03-at-12.03.23-PM.png"><img class="size-full wp-image-1657" title="2011AdjSOS" src="http://www.mathgoespop.com/wp-content/uploads/2012/02/Screen-shot-2012-02-03-at-12.03.23-PM.png" alt="" width="451" height="579" /></a><p class="wp-caption-text">Table 3: 2011 Adjusted 2011 strength of schedule.  Green indicates teams that made it to the playoffs.</p></div>
<p>While San Francisco&#8217;s schedule still ranks near the bottom after this adjustment, a few other teams have slipped just below it. How does this compare to the previous season?  To wrap things up, let&#8217;s look at the adjusted strength of schedule for 2010.</p>
<div id="attachment_1658" class="wp-caption aligncenter" style="width: 465px"><a href="http://www.mathgoespop.com/wp-content/uploads/2012/02/Screen-shot-2012-02-03-at-11.57.03-AM.png"><img class="size-full wp-image-1658" title="2010AdjSOS" src="http://www.mathgoespop.com/wp-content/uploads/2012/02/Screen-shot-2012-02-03-at-11.57.03-AM.png" alt="" width="455" height="579" /></a><p class="wp-caption-text">Table 2: Adjusted 2010 strength of schedule. Green indicates teams that made it to the playoffs.</p></div>
<p>Notice that in absolute terms, the difference in the 49ers&#8217; schedule is quite small comparing the adjusted 2010 season to the adjusted 2011 season.  In 2011, the number of adjusted games opponents won was 112, versus just 115 in 2010.  So from this standpoint, much of the 49ers&#8217; progress in 2011 appears to be due less to having a lucky schedule, and more do to with actual improvements of the team.  As promised above, next time I will dig a little deeper in an attempt to quantify the 49ers&#8217; improvement during this season.  For now though, I must pick out an appropriate outfit to wear this Sunday, in mourning of the absence of this team from the Super Bowl.</p>
<p>(A final note for the more statistically minded: In 2011, the correlation between a team&#8217;s winning percentage and the opponents&#8217; combined losing percentage is quite high &#8211; the <a href="http://en.wikipedia.org/wiki/Pearson_product-moment_correlation_coefficient">correlation coefficient</a> is roughly .766.  With the adjustment mentioned above, this drops to .529, which agrees with the notion that there should be a weaker correlation between one team&#8217;s record and the combined record of its opponents when the team&#8217;s record is removed from the combined record.  For 2010, the correlations are a bit weaker: .574 and .333, respectively).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2012/02/are-the-49ers-skilled-or-just-lucky.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Playoff Probabilities</title>
		<link>http://www.mathgoespop.com/2011/10/playoff-probabilities.html</link>
		<comments>http://www.mathgoespop.com/2011/10/playoff-probabilities.html#comments</comments>
		<pubDate>Wed, 05 Oct 2011 19:33:02 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Sports]]></category>
		<category><![CDATA[baseball]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=1454</guid>
		<description><![CDATA[<p>Continuing with last week&#8217;s theme, and since we are in the midst of playoffs, I&#8217;d like to take a moment now to discuss another link between baseball and mathematics.  This link is particularly timely since the scuttlebutt on the internet suggests that next year the playoff rules for baseball will be changed: the number of teams <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2011/10/playoff-probabilities.html">Playoff Probabilities</a></span>]]></description>
			<content:encoded><![CDATA[<p>Continuing with <a href="http://www.mathgoespop.com/2011/09/moneyball.html">last</a> week&#8217;s theme, and since we are in the midst of playoffs, I&#8217;d like to take a moment now to discuss another link between baseball and mathematics.  This link is particularly timely since the <a href="http://www.baseball-reference.com/blog/archives/10800">scuttlebutt</a> on the internet suggests that next year the playoff rules for baseball will be changed: the number of teams competing for the World Series will increase from 8 to 10, and because of that, another round of playoff games will be introduced.</p>
<p>Currently, the playoffs consist of three rounds.  The first round is the Division Series, in which eight teams compete in a best-of-five match-up (equivalently, a first-to-three match-up, i.e. the first team to win three games wins the series).  The second and third rounds, better known as the Championship Series and World Series, are composed of four and two teams, respectively, but are both best-of-seven (equivalently, first-to-four).  Because of these three rounds of several games each, the playoff season is already quite long; therefore, the new proposed playoff round, it has been suggested, would be composed of either a single game between competing teams, or a best-of-three (first-to-two) series between the two teams.</p>
<p>Many people take issue with such a short series on the grounds of fairness.  In a season where each team plays 162 games, they say, it&#8217;s not fair for a team&#8217;s World Series hopes to ride on a single game, or even a short series composed of at most three games.  There are even those who suggest that the Division Series is too short, and that all three of the current rounds should be a best-of-seven.  These are noble sentiments, but are they reasonable?  We can use mathematics to try and answer this question.</p>
<p>Suppose two teams are meeting for a playoff series, and the probability that one team (call it, I don&#8217;t know, the <a href="http://sanfrancisco.giants.mlb.com/index.jsp?c_id=sf">Giants</a>) will win a single game is <em>p</em> (this model is fairly simple, and does not take into account advantages associated with the starting pitcher, for example, but let&#8217;s keep things basic for now).  Then the probability that this team will win a one game series is again <em>p</em>, since the series consists of a single game.</p>
<p>What if the series is three games long?  In this case, the Giants will win if they win the first two games, or split the first two games and win the third game.  So there are three outcomes: WW, WLW, or LWW.  The probability of the first event is <img src='http://s.wordpress.com/latex.php?latex=p%5E2&#038;bg=T&#038;fg=000000&#038;s=0' alt='p^2' title='p^2' class='latex' />, while the probability of the second and third events are both <img src='http://s.wordpress.com/latex.php?latex=p%5E2%281-p%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='p^2(1-p)' title='p^2(1-p)' class='latex' /> (probability <em>p</em> of success is the same as probability 1-<em>p</em> of failure).  Adding these three probabilities gives the total probability that the team will win a best-of-three series:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=p%5E2%20%2B%202p%5E2%281-p%29%20%3D%20p%5E2%283-2p%29.&#038;bg=T&#038;fg=000000&#038;s=0' alt='p^2 + 2p^2(1-p) = p^2(3-2p).' title='p^2 + 2p^2(1-p) = p^2(3-2p).' class='latex' /></p>
<p style="text-align: left;">In a best-of-five series, the Giants will win if they win three in a row, two of the first three and the fourth, or two of the first four and the fifth.  Using <a href="http://en.wikipedia.org/wiki/Combination">combinations</a> to count the possibilities, we see that in this case, the probability of the Giants winning the series is equal to</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=p%5E3%20%2B%20%5Cbinom%7B3%7D%7B2%7Dp%5E3%281-p%29%2B%5Cbinom%7B4%7D%7B2%7Dp%5E3%281-p%29%5E2&#038;bg=T&#038;fg=000000&#038;s=0' alt='p^3 + \binom{3}{2}p^3(1-p)+\binom{4}{2}p^3(1-p)^2' title='p^3 + \binom{3}{2}p^3(1-p)+\binom{4}{2}p^3(1-p)^2' class='latex' /></p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%3D%20p%5E3%2810-15p%2B6p%5E2%29.&#038;bg=T&#038;fg=000000&#038;s=0' alt='= p^3(10-15p+6p^2).' title='= p^3(10-15p+6p^2).' class='latex' /></p>
<p style="text-align: left;">With the same type of argument you can calculate the probability that the Giants win a best-of-seven series.  I&#8217;ll spare you the details: the result is <img src='http://s.wordpress.com/latex.php?latex=p%5E4%2835-84p%2B70p%5E2-20p%5E3%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='p^4(35-84p+70p^2-20p^3)' title='p^4(35-84p+70p^2-20p^3)' class='latex' />.</p>
<p style="text-align: left;">In each case, the probability of winning the series is a <a href="http://en.wikipedia.org/wiki/Polynomial">polynomial</a> in <em>p</em>, the probability of winning a single game.  But how to these polynomials compare?  Let&#8217;s turn to technology to lead the way!</p>
<div id="attachment_1456" class="wp-caption aligncenter" style="width: 610px"><a href="http://www.mathgoespop.com/wp-content/uploads/2011/10/Picture-5.png"><img class="size-full wp-image-1456" title="Picture 5" src="http://www.mathgoespop.com/wp-content/uploads/2011/10/Picture-5.png" alt="" width="600" height="532" /></a><p class="wp-caption-text">Probabilities for a one, three, five, and seven game series.</p></div>
<p>Above is a graph of these four functions &#8211; the <em>x</em>-axis represents the probability <em>p</em>, while the <em>y</em>-axis represents the probability of winning the series.  The dark blue graph is for a single-game series (the function is <em>p</em>), the light blue graph is for a three-game series (the function is <img src='http://s.wordpress.com/latex.php?latex=p%5E2%283-2p%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='p^2(3-2p)' title='p^2(3-2p)' class='latex' />, the light green graph is for a five-game series (the function is <img src='http://s.wordpress.com/latex.php?latex=%3D%20p%5E3%281-15p%2B6p%5E2%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='= p^3(1-15p+6p^2)' title='= p^3(1-15p+6p^2)' class='latex' />), and the red graph is for a seven-game series (the function is <img src='http://s.wordpress.com/latex.php?latex=p%5E4%2835-84p%2B70p%5E2-20p%5E3%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='p^4(35-84p+70p^2-20p^3)' title='p^4(35-84p+70p^2-20p^3)' class='latex' />).  What can we deduce from the picture above?</p>
<p>First, note that a longer series benefits the stronger team more than the weaker team &#8211; this makes intuitive sense, if you think about it.  Also, for teams that are perfectly evenly matched (i.e. <em>p = </em>0.5), the length of the series doesn&#8217;t affect the probability of winning the series, which is also 50% in each case.</p>
<p>But what about teams with a slight, moderate, or strong advantage over their competition?  How does the length of the series affect the probability of winning the series?  Let&#8217;s look at a small table of values, in the cases <em>p</em> = .55, <em>p</em> = .6, and <em>p</em> = .7.</p>

<table id="wp-table-reloaded-id-4-no-1" class="wp-table-reloaded wp-table-reloaded-id-4">
<thead>
	<tr class="row-1 odd">
		<th class="column-1">p</th><th class="column-2">Best of One Odds of Success</th><th class="column-3">Best of Three Odds of Success</th><th class="column-4">Best of Five Odds of Success</th><th class="column-5">Best of Seven Odds of Success</th>
	</tr>
</thead>
<tbody>
	<tr class="row-2 even">
		<td class="column-1">.550</td><td class="column-2">.550</td><td class="column-3">.575</td><td class="column-4">.593</td><td class="column-5">.608</td>
	</tr>
	<tr class="row-3 odd">
		<td class="column-1">.600</td><td class="column-2">.600</td><td class="column-3">.648</td><td class="column-4">.683</td><td class="column-5">.710</td>
	</tr>
	<tr class="row-4 even">
		<td class="column-1">.700</td><td class="column-2">.700</td><td class="column-3">.784</td><td class="column-4">.837</td><td class="column-5">.874</td>
	</tr>
</tbody>
</table>

<p>As you can see from the table (or the graph), the more evenly matched the teams, the less of a difference the length of the series makes.  If your team has a 55% chance of winning a given game, the advantage in a seven game series is increased by a little less than 6 percentage points.  With a 60% chance of winning a given game, the advantage in a seven game series is increased by 11 percentage points, and with a 70% chance of winning a given game, the advantage in a seven game series is increased by over 17 percentage points.</p>
<p>Not also that the change from a best-of-five series to a best-of-seven series isn&#8217;t really very large.  Even if your team is heavily favored (70% probability of winning each game), the change from a best-of-five series to a best-of-seven series is less than four points.  With more evenly matched teams, the difference is even smaller, suggesting that expansion of the Division Series from a maximum of five to a maximum of seven games isn&#8217;t necessarily a great idea.</p>
<p>On the other hand, the largest change in probabilities is between the jump from a best-of-one series to a best-of-three series.  While the change isn&#8217;t so significant for evenly matched teams (and more evenly matched teams would be most likely to play each other in this round under the suggested rule changes), for match-ups in which one team is heavily favored, the difference can be more significant.</p>
<p>Whether or not one wants longer series or shorter series depends, I suppose, on one&#8217;s baseball philosophy.  It certainly seems like having everything ride on a single game after a season of more than 150 games is a little unbalanced, but from a mathematical standpoint, the stronger team will most likely gain only a small advantage by moving to a three game series.  Of course, this simplified model can only tell us so much, and it&#8217;s possible that the advantages of a longer series are being underrepresented here.  To err on the side of caution, I&#8217;d be more inclined to support a best-of-three series, though whether or not this is possible without stretching the season too long is something that the folks who are paid better than me to think about these matters will have to decide.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2011/10/playoff-probabilities.html/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Moneyball</title>
		<link>http://www.mathgoespop.com/2011/09/moneyball.html</link>
		<comments>http://www.mathgoespop.com/2011/09/moneyball.html#comments</comments>
		<pubDate>Wed, 28 Sep 2011 18:07:53 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Math in Books]]></category>
		<category><![CDATA[Math in the Movies]]></category>
		<category><![CDATA[Sports]]></category>
		<category><![CDATA[baseball]]></category>
		<category><![CDATA[moneyball]]></category>
		<category><![CDATA[pythagorean expectation]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=1419</guid>
		<description><![CDATA[<p>This weekend, mathematics played a supporting role to Brad Pitt in one of fall&#8217;s first critical darlings, Moneyball.  Based on the Michael Lewis book of the same name, the film profiles the Oakland A&#8217;s during their 2002 bid for World Series glory.  What allegedly separates their story from the story of other teams during that <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2011/09/moneyball.html">Moneyball</a></span>]]></description>
			<content:encoded><![CDATA[<p>This weekend, mathematics played a supporting role to Brad Pitt in one of fall&#8217;s first critical darlings, <a href="http://www.imdb.com/title/tt1210166/">Moneyball</a>.  Based on the Michael Lewis <a href="http://en.wikipedia.org/wiki/Moneyball">book of the same name</a>, the film profiles the Oakland A&#8217;s during their 2002 bid for World Series glory.  What allegedly separates their story from the story of other teams during that season is the way General Manager Billy Beane, played by Brad Pitt, deals with the budget constraints imposed on him by the team&#8217;s owners.</p>
<p style="text-align: center;"><object style="height: 390px; width: 640px;"><param name="movie" value="http://www.youtube.com/v/qxxO2sq0amY?version=3" /><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><embed type="application/x-shockwave-flash" width="640" height="360" src="http://www.youtube.com/v/qxxO2sq0amY?version=3" allowfullscreen="true" allowscriptaccess="always"></embed></object></p>
<p style="text-align: left;">With a payroll roughly a third the size of the Yankees&#8217;, Beane understood that the playing field was not a level one from an economic standpoint.  What&#8217;s more, at the end of the 2001 season, three of the A&#8217;s star players left Oakland for bigger paychecks.  To fill the void, the film (and book) show how Beane took a more analytic approach, and used statistical analysis to uncover players who were undervalued and could be purchased for much less than they were worth.  Beane, together with Paul DePodesta (Peter Brand in the film, and played by Jonah Hill), used a <a href="http://en.wikipedia.org/wiki/Sabermetrics">sabermetric</a> approach to lead the A&#8217;s to a league-leading 103 wins for the season. While their first-place ranking for number of wins that year was shared with the Yankees, they spent much less per win than their New York counterparts (the A&#8217;s spent the least per win, while the Yankees spent the third most).  Here&#8217;s a table comparing the teams; the payroll numbers are taken from <a href="http://content.usatoday.com/sportsdata/baseball/mlb/salaries/team/2002">here</a>, and differ slightly from the numbers that appear in the book.</p>
<p style="text-align: left;">
<table id="wp-table-reloaded-id-3-no-1" class="wp-table-reloaded wp-table-reloaded-id-3">
<thead>
	<tr class="row-1 odd">
		<th class="column-1">Team</th><th class="column-2">Wins</th><th class="column-3">Losses</th><th class="column-4">Payroll</th><th class="column-5">Cost Per Win (millions)</th>
	</tr>
</thead>
<tbody>
	<tr class="row-2 even">
		<td class="column-1">Oakland Athletics</td><td class="column-2">103</td><td class="column-3">59</td><td class="column-4">$40,004,167</td><td class="column-5">$0.388</td>
	</tr>
	<tr class="row-3 odd">
		<td class="column-1">Minnesota Twins</td><td class="column-2">94</td><td class="column-3">67</td><td class="column-4">$40,225,000</td><td class="column-5">$0.428</td>
	</tr>
	<tr class="row-4 even">
		<td class="column-1">Montreal Expos</td><td class="column-2">83</td><td class="column-3">79</td><td class="column-4">$38,670,500</td><td class="column-5">$0.466</td>
	</tr>
	<tr class="row-5 odd">
		<td class="column-1">Florida Marlins</td><td class="column-2">79</td><td class="column-3">83</td><td class="column-4">$41,979,917</td><td class="column-5">$0.531</td>
	</tr>
	<tr class="row-6 even">
		<td class="column-1">Cincinnati Reds</td><td class="column-2">78</td><td class="column-3">84</td><td class="column-4">$45,050,390</td><td class="column-5">$0.578</td>
	</tr>
	<tr class="row-7 odd">
		<td class="column-1">Pittsburgh Pirates</td><td class="column-2">72</td><td class="column-3">89</td><td class="column-4">$42,323,599</td><td class="column-5">$0.588</td>
	</tr>
	<tr class="row-8 even">
		<td class="column-1">Los Angeles Angels</td><td class="column-2">99</td><td class="column-3">63</td><td class="column-4">$61,721,667</td><td class="column-5">$0.624</td>
	</tr>
	<tr class="row-9 odd">
		<td class="column-1">Tampa Bay Rays</td><td class="column-2">55</td><td class="column-3">106</td><td class="column-4">$34,380,000</td><td class="column-5">$0.625</td>
	</tr>
	<tr class="row-10 even">
		<td class="column-1">San Diego Padres</td><td class="column-2">66</td><td class="column-3">96</td><td class="column-4">$41,425,000</td><td class="column-5">$0.628</td>
	</tr>
	<tr class="row-11 odd">
		<td class="column-1">Chicago White Sox</td><td class="column-2">81</td><td class="column-3">81</td><td class="column-4">$57,052,833</td><td class="column-5">$0.704</td>
	</tr>
	<tr class="row-12 even">
		<td class="column-1">Philadelphia Phillies</td><td class="column-2">80</td><td class="column-3">81</td><td class="column-4">$57,954,999</td><td class="column-5">$0.724</td>
	</tr>
	<tr class="row-13 odd">
		<td class="column-1">Houston Astros</td><td class="column-2">84</td><td class="column-3">78</td><td class="column-4">$63,448,417</td><td class="column-5">$0.755</td>
	</tr>
	<tr class="row-14 even">
		<td class="column-1">Kansas City Royals</td><td class="column-2">62</td><td class="column-3">100</td><td class="column-4">$47,257,000</td><td class="column-5">$0.762</td>
	</tr>
	<tr class="row-15 odd">
		<td class="column-1">St. Louis Cardinals</td><td class="column-2">97</td><td class="column-3">65</td><td class="column-4">$74,660,875</td><td class="column-5">$0.770</td>
	</tr>
	<tr class="row-16 even">
		<td class="column-1">Colorado Rockies</td><td class="column-2">73</td><td class="column-3">89</td><td class="column-4">$56,851,043</td><td class="column-5">$0.779</td>
	</tr>
	<tr class="row-17 odd">
		<td class="column-1">San Francisco Giants</td><td class="column-2">95</td><td class="column-3">66</td><td class="column-4">$78,299,835</td><td class="column-5">$0.824</td>
	</tr>
	<tr class="row-18 even">
		<td class="column-1">Seattle Mariners</td><td class="column-2">93</td><td class="column-3">69</td><td class="column-4">$80,282,668</td><td class="column-5">$0.863</td>
	</tr>
	<tr class="row-19 odd">
		<td class="column-1">Milwaukee Brewers</td><td class="column-2">56</td><td class="column-3">106</td><td class="column-4">$50,287,833</td><td class="column-5">$0.898</td>
	</tr>
	<tr class="row-20 even">
		<td class="column-1">Baltimore Orioles</td><td class="column-2">67</td><td class="column-3">95</td><td class="column-4">$60,493,487</td><td class="column-5">$0.903</td>
	</tr>
	<tr class="row-21 odd">
		<td class="column-1">Atlanta Braves</td><td class="column-2">101</td><td class="column-3">59</td><td class="column-4">$93,470,367</td><td class="column-5">$0.925</td>
	</tr>
	<tr class="row-22 even">
		<td class="column-1">Toronto Blue Jays</td><td class="column-2">78</td><td class="column-3">84</td><td class="column-4">$76,864,333</td><td class="column-5">$0.985</td>
	</tr>
	<tr class="row-23 odd">
		<td class="column-1">Detroit Tigers</td><td class="column-2">55</td><td class="column-3">106</td><td class="column-4">$55,048,000</td><td class="column-5">$1.001</td>
	</tr>
	<tr class="row-24 even">
		<td class="column-1">Los Angeles Dodgers</td><td class="column-2">92</td><td class="column-3">70</td><td class="column-4">$94,850,953</td><td class="column-5">$1.031</td>
	</tr>
	<tr class="row-25 odd">
		<td class="column-1">Arizona Diamondbacks</td><td class="column-2">98</td><td class="column-3">64</td><td class="column-4">$102,819,999</td><td class="column-5">$1.049</td>
	</tr>
	<tr class="row-26 even">
		<td class="column-1">Cleveland Indians</td><td class="column-2">74<br />
</td><td class="column-3">88</td><td class="column-4">$78,909,449</td><td class="column-5">$1.066</td>
	</tr>
	<tr class="row-27 odd">
		<td class="column-1">Chicago Cubs</td><td class="column-2">67</td><td class="column-3">95</td><td class="column-4">$75,690,833</td><td class="column-5">$1.130</td>
	</tr>
	<tr class="row-28 even">
		<td class="column-1">Boston Red Sox</td><td class="column-2">93</td><td class="column-3">69</td><td class="column-4">$108,366,060</td><td class="column-5">$1.165</td>
	</tr>
	<tr class="row-29 odd">
		<td class="column-1">New York Yankees</td><td class="column-2">103</td><td class="column-3">58</td><td class="column-4">$125,928,583</td><td class="column-5">$1.223</td>
	</tr>
	<tr class="row-30 even">
		<td class="column-1">New York Mets</td><td class="column-2">75</td><td class="column-3">86</td><td class="column-4">$94,633,593</td><td class="column-5">$1.262</td>
	</tr>
	<tr class="row-31 odd">
		<td class="column-1">Texas Rangers</td><td class="column-2">72</td><td class="column-3">90</td><td class="column-4">$105,726,122</td><td class="column-5">$1.468</td>
	</tr>
</tbody>
</table>
</p>
<p>&nbsp;</p>
<p>Their new approach threw out many pieces of conventional baseball wisdom: stealing bases and bunting were strict no-no&#8217;s, for example.  Naturally, these changes brought about some tension, and it&#8217;s this tension that makes for the dramatic thrust of the film.  In particular, mathematics takes a backseat, though there are some little cameos for those who are paying attention.</p>
<p>The most significant piece of mathematics making an appearance in the film is the <a href="http://en.wikipedia.org/wiki/Pythagorean_expectation">Pythagorean Expectation</a>, a formula discovered by <a href="http://en.wikipedia.org/wiki/Bill_James">Bill James</a> that estimates a team&#8217;s win percentage in terms of its runs scored and runs allowed.  More specifically, the formula asserts that a team&#8217;s win percentage is approximately equal to</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cfrac%7B%5Ctextup%7Bruns%20scored%7D%5E2%7D%7B%5Ctextup%7Bruns%20scored%7D%5E2%2B%5Ctextup%7Bruns%20allowed%7D%5E2%7D%3D%5Cfrac%7B1%7D%7B1%2B%5Ctextup%7B%28runs%20scored%2Fruns%20allowed%29%7D%5E2%7D.&#038;bg=T&#038;fg=000000&#038;s=0' alt='\frac{\textup{runs scored}^2}{\textup{runs scored}^2+\textup{runs allowed}^2}=\frac{1}{1+\textup{(runs scored/runs allowed)}^2}.' title='\frac{\textup{runs scored}^2}{\textup{runs scored}^2+\textup{runs allowed}^2}=\frac{1}{1+\textup{(runs scored/runs allowed)}^2}.' class='latex' /></p>
<p>For example, the 2002 A&#8217;s scored a total of 800 runs, and allowed a total of 654 runs, for a Pythagorean Expectation of</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cfrac%7B800%5E2%7D%7B800%5E2%2B654%5E2%7D%20%5Capprox%200.599.&#038;bg=T&#038;fg=000000&#038;s=0' alt='\frac{800^2}{800^2+654^2} \approx 0.599.' title='\frac{800^2}{800^2+654^2} \approx 0.599.' class='latex' /></p>
<p>(relevant stats can be found <a href="http://espn.go.com/mlb/stats/team/_/stat/batting/year/2002">here</a>). This compares to the team&#8217;s actual win percentage of 103/162, which is around 0.636.</p>
<p>In the film, Peter Brand applies this formula in order to estimate the number of runs the team needs to score, along with the maximum number of runs it can allow, in order to secure a playoff spot.  In one scene, he tells Billy Beane that he thinks the A&#8217;s will need to win at least 99 games to guarantee a playoff spot.  In a 162 game season, this equates to a win percentage of around 0.611.  In order to ensure that the Pythagorean Expectation is at least this large, we set</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cfrac%7B1%7D%7B1%2B%5Ctextup%7B%28runs%20scored%2Fruns%20allowed%29%7D%5E2%7D%20%3E%200.611.&#038;bg=T&#038;fg=000000&#038;s=0' alt='\frac{1}{1+\textup{(runs scored/runs allowed)}^2} &gt; 0.611.' title='\frac{1}{1+\textup{(runs scored/runs allowed)}^2} &gt; 0.611.' class='latex' /></p>
<p style="text-align: left;">With a little algebra, this is the same as</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Ctextup%7Bruns%20allowed%2Fruns%20scored%7D%20%3C%20%5Csqrt%7B%5Cfrac%7B1%7D%7B.611%7D-1%7D%5Capprox%200.798.&#038;bg=T&#038;fg=000000&#038;s=0' alt='\textup{runs allowed/runs scored} &lt; \sqrt{\frac{1}{.611}-1}\approx 0.798.' title='\textup{runs allowed/runs scored} &lt; \sqrt{\frac{1}{.611}-1}\approx 0.798.' class='latex' /></p>
<p>Brand then informs Beane that in order for this to happen, the team needs to score at least 814 runs, and can allow no more than 645 runs.  This gives a runs allowed to runs scored ratio of 645/814, or around 0.793 &lt; 0.798 (though, if I were being anal, I would point out that with 814 runs scored, the team could allow as many as 649 runs and still have a runs scored to runs allowed to runs scored ratio that is less than 0.798).</p>
<p>While the math formulas on display in the film are accurate, I would be remiss if I did not briefly discuss Hill&#8217;s portrayal of Peter Brand.  Overall, Hill does a good job; though Brand is clearly a nerd, Hill&#8217;s portrayal usually avoids caricature.</p>
<p style="text-align: center;"><object style="height: 390px; width: 640px;"><param name="movie" value="http://www.youtube.com/v/cVXLuX6Y5zs?version=3" /><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><embed type="application/x-shockwave-flash" width="640" height="360" src="http://www.youtube.com/v/cVXLuX6Y5zs?version=3" allowfullscreen="true" allowscriptaccess="always"></embed></object></p>
<p>Like every other film featuring characters who are good at math, though, Moneyball can&#8217;t help itself from including a scene where we see how good Brand is at math because he can do mental calculations quickly.  This particular scene takes place when Brand is sitting in his first meeting with Beane and the rest of the baseball scouts, and though it serves to highlight the tension that exists between Brand&#8217;s new school of thought and more traditional baseball thinking, I think the scene could have been just as effective without the clichéd math exercise.</p>
<p>Also, in the interest of full disclosure, I should point out that there are some who feel the story told in Moneyball (both the film and the book) is an exaggeration.  More specifically, as <a href="http://www.slate.com/id/2304262/">this</a> Slate article discusses, many people believe that the reason for the A&#8217;s success during the early aughts had less to do with sabermetrics, and more to do with the fact that they had awesome pitchers in <a href="http://en.wikipedia.org/wiki/Tim_Hudson">Tim Hudson</a>, <a href="http://en.wikipedia.org/wiki/Mark_Mulder">Mark Mulder</a>, and <a href="http://en.wikipedia.org/wiki/Barry_Zito">Barry Zito</a>, none of whom feature prominently in the book or film.  While I don&#8217;t feel knowledgeable enough to weigh in decisively on this issue, the role of the defense certainly appears to be underrepresented here.</p>
<p>To try and convince you of this, recall that the A&#8217;s made it to the playoffs in four consecutive years, from 2000-2003.  Here is some data on how many runs they scored and how many runs they allowed during each of those years, and in 2004, when they did not make the playoffs:</p>
<p><a href="http://www.mathgoespop.com/wp-content/uploads/2011/09/Picture-3.png"><img class="aligncenter size-full wp-image-1449" title="Runs Data" src="http://www.mathgoespop.com/wp-content/uploads/2011/09/Picture-3.png" alt="" width="640" height="403" /></a>Observe that especially from 2001-2003, while the A&#8217;s offense declined, their defense remained consistent in allowing relatively few runs.  Of course, this should not be viewed in a vacuum, but rather in relation to how baseball as a whole performed.  Therefore, it is better to consider not runs scored and runs allowed, but runs scored and runs allowed as a proportion of runs scored and runs allowed in the American League.  With this slight adjustment, we get the following picture:</p>
<p><a href="http://www.mathgoespop.com/wp-content/uploads/2011/09/Picture-4.png"><img class="aligncenter size-full wp-image-1450" title="Picture 4" src="http://www.mathgoespop.com/wp-content/uploads/2011/09/Picture-4.png" alt="" width="640" height="403" /></a>Note in the above that a proportion of 1 means that the A&#8217;s were performing at an average rate, while a proportion greater than 1 indicates above-average performance, and a proportion less than 1 indicates below-average performance.  As we can see from the data, in 2001-2003, the A&#8217;s defense was allowing runs at a rate well below the average; in other words, the defense was relatively strong.  On the other hand, during the same period, the offense consistently weakened year-over-year, so that the number of runs the A&#8217;s scored was below the league average in 2003-2004.  In particular, during the 2002 season profiled in Moneyball, the number of runs scored took a sharp downturn relative to the league average, while the number of runs allowed still remained well below average.  This indicates, to me at least, that the role of the defense was certainly an important factor in the A&#8217;s playoff runs during the 2002 and 2003 seasons.  Note also that in the 2004 season the number of runs allowed rose sharply relative to the league average; without a corresponding uptick in runs scored, the A&#8217;s didn&#8217;t make it to the playoffs.</p>
<p>Nevertheless, I don&#8217;t think the issue is binary; excellent pitching and a sabermetric approach probably combined to help the A&#8217;s.  Even though Moneyball only explores one of these issues, it&#8217;s still a film well worth seeing.  If you&#8217;re no fan of mathematics, don&#8217;t worry, there isn&#8217;t much on display.  And if you&#8217;re no fan of baseball, surprisingly, I think you might enjoy the movie anyway.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2011/09/moneyball.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Scoreboard Stats</title>
		<link>http://www.mathgoespop.com/2011/05/scoreboard-stats.html</link>
		<comments>http://www.mathgoespop.com/2011/05/scoreboard-stats.html#comments</comments>
		<pubDate>Thu, 26 May 2011 21:28:19 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Math in the News]]></category>
		<category><![CDATA[Sports]]></category>
		<category><![CDATA[baseball]]></category>
		<category><![CDATA[e]]></category>
		<category><![CDATA[poisson distribution]]></category>
		<category><![CDATA[probability]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=1247</guid>
		<description><![CDATA[<p>A couple of weeks ago I noticed this article on the Yahoo Sports page, which highlighted a statistically rare event that occurred in the American League on Sunday, May 8th.  On that day, 7 baseball games were played on the AL schedule, and in all of those games one team scored exactly 5 runs.  The post <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2011/05/scoreboard-stats.html">Scoreboard Stats</a></span>]]></description>
			<content:encoded><![CDATA[<p>A couple of weeks ago I noticed <a href="http://sports.yahoo.com/mlb/blog/big_league_stew/post/Gimme-Five-American-League-scoreboard-features-?urn=mlb-wp5759">this</a> article on the Yahoo Sports page, which highlighted a statistically rare event that occurred in the American League on Sunday, May 8th.  On that day, 7 baseball games were played on the AL schedule, and in all of those games one team scored exactly 5 runs.  The post then links to <a href="http://news.yahoo.com/s/ap/20110509/ap_on_sp_ba_ne/bba5_alive">this</a> article from the AP, which gives this rare event the following context:</p>
<blockquote><p>It was the first time in 18 years that such a quirky thing happened with a full schedule. On Aug. 10, 1993, all seven NL games featured one team scoring precisely two runs, STATS LLC said.</p>
<p>The last time it occurred with five or more runs was July 20, 1955, when all four AL games had at least one team score exactly six, STATS LLC said.</p></blockquote>
<p>When I read this article, some questions immediately came to mind: exactly how rare is it for one team in a collection of 7 baseball games to have a common score of 5?  Also, if 7 teams in 7 games have the same score, which score are they most likely to share?  Are the 7 games with a common score 0f 2 more or less likely to occur than the 7 games with a common score of 5?</p>
<p>We can answer these questions with some (relatively) simple probability models, given some caveats.  I&#8217;d like to estimate these probabilities using only one parameter: the average number of runs a team scores during a game.  Of course, that average will vary from team to team, and also from year to year (in particular, runs per game have declined from the heyday of steroid-mania that gripped baseball at the turn of the millennium).  Due to different rules, there may also be variation between the American and National Leagues.  Let me ignore this, though, and consider only an average number of runs per game overall &#8211; what we lose in precision we will more than make up for in clarity.</p>
<div id="attachment_1249" class="wp-caption aligncenter" style="width: 320px"><a href="http://www.mathgoespop.com/wp-content/uploads/2011/05/dingers.jpg"><img class="size-full wp-image-1249" title="dingers" src="http://www.mathgoespop.com/wp-content/uploads/2011/05/dingers.jpg" alt="" width="310" height="230" /></a><p class="wp-caption-text">Ahh, the late 90&#39;s, when it was easier to sock a few dingers.</p></div>
<p>The question remains: how many runs are scored on average in a baseball game?  I found some data online which is somewhat outdated, but I&#8217;ll stick to it for convenience (and, more importantly, out of laziness) &#8211; any alteration in this number is easy to propagate throughout the following discussion.  In <a href="http://www.hardballtimes.com/main/article/runs-per-game/">this</a> article from 2005, the author tabulated the average number of runs per game in MLB over a 5 year span from 2000-2004 (that&#8217;s over 12,000 games!).  He has a nice looking graph of the distribution of scores as well:</p>
<p><a href="http://www.hardballtimes.com/main/article/runs-per-game/"><img class="aligncenter size-full wp-image-1250" title="runspergame" src="http://www.mathgoespop.com/wp-content/uploads/2011/05/runspergame.gif" alt="" width="439" height="369" /></a>A savvy probability student might see the long tail of this probability distribution and liken it to the <a href="http://en.wikipedia.org/wiki/Poisson_distribution">Poisson distribution</a>, a distribution encountered in many probability courses, and which is frequently motivated by a desire to model &#8220;rare events.&#8221;  I put the term in quotations since what constitutes &#8220;rare&#8221; is frequently left undefined, and in any event, is not really pertinent to this discussion.</p>
<p>Let us suppose, then, that the number of runs scored per game by each team follows a Poisson distribution.  French aside, this means that the probability a team will score <em>n</em> runs is equal to</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=e%5E%7B-A%7D%5Cfrac%7BA%5En%7D%7Bn%21%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='e^{-A}\frac{A^n}{n!}' title='e^{-A}\frac{A^n}{n!}' class='latex' />,</p>
<p style="text-align: left;">where A is the average number of runs scored per game &#8211; in this case, 4.82, and <em>e</em> is the unsung hero sometimes known as <a href="http://www.mathgoespop.com/2010/01/e-day.html">Euler&#8217;s number</a>.  Don&#8217;t worry too much about this formula; if you prefer, the graph of the function <img src='http://s.wordpress.com/latex.php?latex=e%5E%7B-4.82%7D%5Cfrac%7B4.82%5En%7D%7Bn%21%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='e^{-4.82}\frac{4.82^n}{n!}' title='e^{-4.82}\frac{4.82^n}{n!}' class='latex' /> looks like this (courtesy of <a href="http://www.wolframalpha.com/">Wolfram Alpha</a>):</p>
<p style="text-align: left;"><a href="http://www.mathgoespop.com/wp-content/uploads/2011/05/Picture-2.png"><img class="aligncenter size-full wp-image-1252" title="Poisson482" src="http://www.mathgoespop.com/wp-content/uploads/2011/05/Picture-2.png" alt="" width="320" height="193" /></a>Note that the fit isn&#8217;t perfect &#8211; this graph starts much lower at 0 than the graph of the actual data pictured above, for example &#8211; but there is precedence for using the Poisson distrubtion to model runs in a baseball game (<a href="http://www.jstor.org/pss/2684837">this</a> article provides one such example, but a subscription is required to view it in its entirety).  More careful analysis is possible, and can be found in resources like <a href="http://books.google.com/books?id=1mNZfyil2ecC&amp;lpg=PA168&amp;ots=oXZDh_q7X5&amp;dq=probability%20distribution%20of%20runs%20scored%20in%20a%20baseball%20game&amp;pg=PP1#v=onepage&amp;q=probability%20distribution%20of%20runs%20scored%20in%20a%20baseball%20game&amp;f=false">this</a> one, but again, I want to keep things relatively simple.</p>
<p style="text-align: left;">So, let us suppose that the probability that a team scores <em>n</em> runs is <img src='http://s.wordpress.com/latex.php?latex=e%5E%7B-4.82%7D%5Cfrac%7B4.82%5En%7D%7Bn%21%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='e^{-4.82}\frac{4.82^n}{n!}' title='e^{-4.82}\frac{4.82^n}{n!}' class='latex' />.  What then, is the probability than in a baseball game, one of the teams will score <em>n</em> runs?  Either team A can score <em>n</em> runs or team <em>B</em> can score <em>n</em> runs, but they can&#8217;t both score <em>n</em> runs since baseball games can&#8217;t end in a tie.  This means that the probability of A or B scoring <em>n</em> runs is simply the probability that A scores <em>n</em> runs plus the probability that <em>B</em> scores <em>n</em> runs, or <img src='http://s.wordpress.com/latex.php?latex=2e%5E%7B-4.82%7D%5Cfrac%7B4.82%5En%7D%7Bn%21%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='2e^{-4.82}\frac{4.82^n}{n!}' title='2e^{-4.82}\frac{4.82^n}{n!}' class='latex' /></p>
<p style="text-align: left;">For the odds that this happens 7 times, we then multiply this number by itself 7 times (lurking under this is the assumption that runs scored in different games are <a href="http://en.wikipedia.org/wiki/Independence_%28probability_theory%29">independent</a>, which seems like an entirely reasonable assumption to make).  To summarize, we estimate the probability that one team in each of 7 games scores n runs is</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%282e%5E%7B-4.82%7D%5Cfrac%7B4.82%5En%7D%7Bn%21%7D%29%5E7.&#038;bg=T&#038;fg=000000&#038;s=0' alt='(2e^{-4.82}\frac{4.82^n}{n!})^7.' title='(2e^{-4.82}\frac{4.82^n}{n!})^7.' class='latex' /></p>
<p style="text-align: left;">If <em>n</em> = 5 (as it did earlier this month), the probability is roughly .064%.  In other words, if 7 AL games were played every day, you would expect this outcome once every 1,560 days or so.  Having said that, with more careful analysis it&#8217;s possible to show that in fact, if 7 games will have teams scoring the same number of runs, 5 is the most likely number.  For comparison, when <em>n</em> = 2 the probability is only a paltry 0.00812%, making what happened on May 8th over 75 times more likely than what happened on August 10, 1993.  Of course, it&#8217;s not fair to compare these records to the 6 run record in 1955, since in that case only 4 games were played, rather than 7.  Nevertheless, it&#8217;s not difficult to adjust this model from 7 games to 4 games (or an arbitrary number of games).</p>
<p style="text-align: left;">So, rather than some murky intuition telling us this event should be unlikely, with a little more effort we can attempt to quantify exactly how unlikely this event should be.  More sophisticated models for runs could be used, but perhaps that is a topic I will save for another day.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2011/05/scoreboard-stats.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Optimal Base Running</title>
		<link>http://www.mathgoespop.com/2010/10/optimal-base-running.html</link>
		<comments>http://www.mathgoespop.com/2010/10/optimal-base-running.html#comments</comments>
		<pubDate>Thu, 28 Oct 2010 23:38:02 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Sports]]></category>
		<category><![CDATA[baseball]]></category>
		<category><![CDATA[brian wilson]]></category>
		<category><![CDATA[giants]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=910</guid>
		<description><![CDATA[<p>Now that the World Series is upon is, I thought I might take a moment to discuss the latest results in the field of optimal base running.  On the face of it, this may seem like a non-issue; after all, as any decent student of geometry will tell you, the shortest distance between any two plates <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2010/10/optimal-base-running.html">Optimal Base Running</a></span>]]></description>
			<content:encoded><![CDATA[<p>Now that the World Series is upon is, I thought I might take a moment to discuss the latest results in the field of optimal base running.  On the face of it, this may seem like a non-issue; after all, as any decent student of geometry will tell you, the shortest distance between any two plates is a straight line.</p>
<p>In a game of baseball, however, it&#8217;s more important to minimize time, not distance.  Given this, running a path that consists of four straight lines connecting each base is not optimal, because the runner must slow down to make the sharp turns at each base.  Of course, baseball players already know this, which is why they often swing out in their path before crossing first when they are confident that they can reach second or more.  But still, the question remains: are these trajectories optimal?</p>
<p>According to a trio of mathematicians from <a href="http://www.williams.edu/">Williams College</a>, maybe not.  According to an article originally posted in <a href="http://www.livescience.com/culture/baseball-running-bases-curved-path-100530.html">May</a>, but which only now seems to be gaining some <a href="http://www.sciencenews.org/index/generic/activity/view/id/64589/title/Math_Trek__Winning_the_World_Series_with_math">traction</a>, recent graduate Davide Carozza wondered what would happen if runners began to curve immediately when leaving the batter&#8217;s box.  His conclusion may be somewhat surprising: using a circular path around the bases rather than a straight line path resulted in a 25% faster loop around the pitcher&#8217;s mound.</p>
<div id="attachment_916" class="wp-caption aligncenter" style="width: 609px"><a href="http://www.mathgoespop.com/wp-content/uploads/2010/10/bwilson.jpg"><img class="size-full wp-image-916" title="bwilson" src="http://www.mathgoespop.com/wp-content/uploads/2010/10/bwilson.jpg" alt="" width="599" height="290" /></a><p class="wp-caption-text">Brian Wilson likes those odds.</p></div>
<p>But of course, a circular loop doesn&#8217;t seem like the most efficient route either &#8211; after all, once you return to home you don&#8217;t need to run again, so it always makes sense to run as straight as possible from 3rd to home.  Upon further analysis, Professor Stewart Johnson found that the fastest loop indeed one that begins like a circle, but comes in closer to the bases on the last leg of the journey.</p>
<div id="attachment_914" class="wp-caption aligncenter" style="width: 310px"><a href="http://www.sciencenews.org/view/access/id/64592/name/MathTrek_oct2010_prepped.jpg"><img class="size-medium wp-image-914" title="basepath" src="http://www.mathgoespop.com/wp-content/uploads/2010/10/basepath-300x214.jpg" alt="" width="300" height="214" /></a><p class="wp-caption-text">Here is the proposed optimal path.</p></div>
<p>As with any mathematical model, some caveats are in order.  The most significant one concerns acceleration &#8211; the red lines in the image above denote the acceleration, and you can see that all the lengths are the same, meaning that acceleration is assumed to be constant.  But of course, this isn&#8217;t really true when a batter is running the bases &#8211; as the latter article points out, &#8220;a runner may not be able to speed up as quickly while running along a  curved path as he can along a straight one. In that case, some form of a  banana path might make sense, allowing a runner to go straight for the  first few critical seconds.&#8221;  Also, when the runner knows he hasn&#8217;t hit more than a single, it obviously makes little sense to run in a curved path.  Instead, it should be obvious that the wisest move is for the runner to get to first as quickly as possible.</p>
<p>Even bearing these warnings in mind, though, there still might be a good idea here worth exploiting.  If the ball is hit between two outfielders and the runner is fairly certain of a double (or better), how much of a curved trajectory should he take?  These are questions that could be answered during practices, and in a game where a fraction of a second can be the difference between a safe and an out, the answers might certainly have some applicability.</p>
<p>I would especially like to consider the San Francisco Giants to take these ideas to heart.  They need every run they can muster, and even though this team looks like a strong contender to bring San Francisco its first World Series title, a little mathematical advantage couldn&#8217;t hurt. After all, striking beard fear into the heart of your enemies can only take you so far.</p>
<div id="attachment_917" class="wp-caption aligncenter" style="width: 410px"><a href="http://www.mathgoespop.com/wp-content/uploads/2010/10/Picture-1.png"><img class="size-full wp-image-917" title="bwilson2" src="http://www.mathgoespop.com/wp-content/uploads/2010/10/Picture-1.png" alt="" width="400" height="524" /></a><p class="wp-caption-text">My betrothed, sporting a most fearsome beard.</p></div>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2010/10/optimal-base-running.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Stand Up to Questionable Odds</title>
		<link>http://www.mathgoespop.com/2010/09/standup.html</link>
		<comments>http://www.mathgoespop.com/2010/09/standup.html#comments</comments>
		<pubDate>Wed, 15 Sep 2010 15:00:31 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Math in the News]]></category>
		<category><![CDATA[Math on TV]]></category>
		<category><![CDATA[Sports]]></category>
		<category><![CDATA[bowling]]></category>
		<category><![CDATA[cancer]]></category>
		<category><![CDATA[jon stewart]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=717</guid>
		<description><![CDATA[<p style="text-align: left;">If you went to the movies in Los Angeles this summer, you may have seen the following ad from Stand Up to Cancer, a charitable program whose telethon aired last Friday night.  A clear homage to MasterCard&#8216;s long-running Priceless campaign, this ad swaps out prices for odds, ending with the sobering fact that 1 <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2010/09/standup.html">Stand Up to Questionable Odds</a></span>]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">If you went to the movies in Los Angeles this summer, you may have seen the following ad from <a href="https://www.standup2cancer.org/Default.aspx">Stand Up to Cancer</a>, a charitable program whose telethon aired last Friday night.  A clear homage to <a href="http://en.wikipedia.org/wiki/MasterCard">MasterCard</a>&#8216;s long-running Priceless campaign, this ad swaps out prices for odds, ending with the sobering fact that 1 in 2 men and 1 in 3 women will be diagnosed with some type of cancer in their lifetime.</p>
<p style="text-align: center;"><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="640" height="385" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/rwC87ZKF1dQ?fs=1&amp;hl=en_US" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="640" height="385" src="http://www.youtube.com/v/rwC87ZKF1dQ?fs=1&amp;hl=en_US" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p style="text-align: left;">Presumably, those cancer odds are taken from The American Cancer society, which has the relevant stats posted <a href="http://www.cancer.org/Cancer/CancerBasics/lifetime-probability-of-developing-or-dying-from-cancer">here</a>.  When it comes to some of the other claims in the ad, though, I couldn&#8217;t help but be skeptical.</p>
<p style="text-align: left;">Take the bowling claim, for instance.  This ad would have you believe that your odds of bowling a perfect game are 1 in 11,500.  This seems quite high, even when I consider the fact that I am not a bowling master.</p>
<p style="text-align: left;">Let&#8217;s try to reverse-engineer this statistic.  To score a perfect game in bowling, one must bowl 12 strikes in a row.  Let us suppose that your probability of bowling a strike on any given frame is some number <em>p</em>.  Furthermore, let&#8217;s suppose that your performance in any frame is independent of your performance in any other frame, so that you have a probability <em>p</em> of bowling a strike each time it&#8217;s your turn.  Of course, whether or not these probabilities are independent is up for debate.  On the one hand, bowling many strikes in a row may make you more anxious about keeping your streak going, which may in turn decrease your probability of another strike; but on the other hand, if you are an adrenaline junkie who thrives in the limelight that only a bowling alley can provide, perhaps such a chain would make it more likely for your streak to continue.  In any event, these are questions better suited to a psychologist rather than a mathematician, so for simplicity let us ignore them here.</p>
<p style="text-align: left;">If you have a probability <em>p</em> of bowling a strike, and a perfect game requires 12 strikes, then the probability you will score a perfect game is the product of 12 copies of <em>p</em> (one for each strike), or <em>p</em><sup>12</sup>.  If the above ad is to be believed, this probability must equal 1/11500.  In other words,</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=p%5E%7B12%7D%20%3D%20%5Cfrac%7B1%7D%7B11500%7D.&#038;bg=T&#038;fg=000000&#038;s=0' alt='p^{12} = \frac{1}{11500}.' title='p^{12} = \frac{1}{11500}.' class='latex' /></p>
<p style="text-align: left;">Taking the twelfth root of each side, we can then conclude that</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=p%20%3D%20%5Csqrt%5B12%5D%7B%5Cfrac%7B1%7D%7B11500%7D%7D%20%5Capprox%20.4588.&#038;bg=T&#038;fg=000000&#038;s=0' alt='p = \sqrt[12]{\frac{1}{11500}} \approx .4588.' title='p = \sqrt[12]{\frac{1}{11500}} \approx .4588.' class='latex' /></p>
<p style="text-align: left;">In other words, your odds of bowling a perfect game are 1 in 11,500 if and only if the probability that you&#8217;ll bowl a strike is around 45.88%.  This seems like an extremely generous probability to give to the population at large.  After all, who among you or your circle of friends bowls a strike, on average, every other frame?  Perhaps I bowl exclusively with people who are not very good (myself included), but I would think a fairer probability for the entire population would be closer to 20% or 30% (maybe even this is too generous).</p>
<p style="text-align: left;">What to these two alternatives yield for the odds of bowling a perfect game?  Well, if <em>p</em> = .3, then <em>p</em><sup>12</sup> is approximately 1 in 1,881,676; for p = .2, the odds plummet to 1 in 244,140,625.  Both of these are significantly lower than the odds cited in the ad (roughly 164 and 21,230 times lower, respectively).</p>
<p style="text-align: left;">It may be that the odds of witnessing a perfect game are around 11,500.  For example, when you go to a bowling alley, there may be experienced players practicing.  Moreover, there are many games occurring simultaneously at a bowling alley, thus increasing the odds that at least one of them will be a perfect game.  But saying &#8220;you have a 1 in 11,500 chance of seeing someone else bowl a perfect game&#8221; doesn&#8217;t sound as sexy as &#8220;you have a 1 in 11,500 chance of bowling a perfect game,&#8221; I suppose.</p>
<p style="text-align: left;">Some of the other odds are questionable as well.  For example, the National Weather Service has some data <a href="http://www.lightningsafety.noaa.gov/medical.htm">here</a> that suggests the odds of being struck by lightning in a given year are about 1 in 500,000, not too far off from the ad&#8217;s claim of 1 in 576,000.  This isn&#8217;t an apples to apples comparison, though, because the ad does not specify that these are the odds you will be struck by lightning <em>in a given year</em>.  If we take into account the average lifespan in the United States (<a href="http://www.google.com/publicdata?ds=wb-wdi&amp;met=sp_dyn_le00_in&amp;idim=country:USA&amp;dl=en&amp;hl=en&amp;q=average+lifespan+in+us">approximately</a> 78 years), then the probability of being struck by lightning (in one&#8217;s lifetime, not in one particular year) is closer to 1 &#8211; (499,999/500,000)<sup>78</sup>, which is around 1 in 6,411.  Much higher, you&#8217;ll note, than the odds of bowling a perfect game.  (Once again, of course, we are assuming that the odds of being struck by lightning don&#8217;t vary from year to year, and that the odds of being struck in one year are independent of the odds in any other year.)</p>
<p style="text-align: left;">If the point of the ad is to give us an intuitive understanding of how likely it is for us to develop cancer, then it seems important to give benchmarks that are accurate and relatable.  Most people have bowled, but few people will have a good intuitive understanding of what it means to face odds that are 1 in 1.8 million (the odds of bowling a perfect game if you get a strike 30% of the time).  I think the point of the ad is understood regardless, but it&#8217;s a shame that the claims leading up to this point weren&#8217;t checked more thoroughly.  Indeed, many of the odds quoted in the ad can be found <a href="http://www.funny2.com/odds.htm">here</a>, a humor website that offers no sources for any of its statistics.</p>
<p style="text-align: left;">Perhaps Jon Stewart was in charge of fact-checking.  Given his lack of understanding about the nature of this program, this is perhaps the most reasonable explanation.</p>
<p style="text-align: center;"><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="640" height="385" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/txmezfuV0P4?fs=1&amp;hl=en_US" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="640" height="385" src="http://www.youtube.com/v/txmezfuV0P4?fs=1&amp;hl=en_US" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p style="text-align: left;">
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2010/09/standup.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Let’s Make a Deal with Paul the Octopus</title>
		<link>http://www.mathgoespop.com/2010/07/lets-make-a-deal-with-paul-the-octopus.html</link>
		<comments>http://www.mathgoespop.com/2010/07/lets-make-a-deal-with-paul-the-octopus.html#comments</comments>
		<pubDate>Wed, 14 Jul 2010 15:00:40 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Animal Math]]></category>
		<category><![CDATA[Math in the News]]></category>
		<category><![CDATA[Sports]]></category>
		<category><![CDATA[monty hall]]></category>
		<category><![CDATA[probability]]></category>
		<category><![CDATA[soccer]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=486</guid>
		<description><![CDATA[<p>As summer reaches its midpoint, we come to the end of another rousing year of World Cup soccer.  As with any international sporting event, fans all over the world have undoubtedly had their share of ups and downs.  Of all the countries in this year&#8217;s tournament, however, I think Germany may be receiving the most attention, <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2010/07/lets-make-a-deal-with-paul-the-octopus.html">Let’s Make a Deal with Paul the Octopus</a></span>]]></description>
			<content:encoded><![CDATA[<p>As summer reaches its midpoint, we come to the end of another rousing year of World Cup soccer.  As with any international sporting event, fans all over the world have undoubtedly had their share of ups and downs.  Of all the countries in this year&#8217;s tournament, however, I think Germany may be receiving the most attention, for even though they didn&#8217;t make it into the finals, the Germans have one thing no other country has: a precognitive octopus.</p>
<p>At least, that is what the media would have us believe.  For the past several weeks, <a href="http://en.wikipedia.org/wiki/Paul_the_Octopus">Paul the Octopus</a> has captured the hearts, minds, and stomachs of people around the world.  He&#8217;s a charming octopus, to be sure, but it isn&#8217;t his good looks that have gotten him this far.  Instead, it&#8217;s his seeming ability to correctly predict the outcome of soccer matches.  As time has gone on and Paul&#8217;s predictions have continued to prove themselves accurate, the amount of press he has received has only increased.  Articles about him are everywhere on the internet: <a href="http://g.sports.yahoo.com/soccer/world-cup/news/dinner--fbintl_dw-octopus070810.html">here</a>&#8216;s one discussing public outrage after he correctly predicted Spain to defeat Germany in the semifinals, and <a href="http://g.sports.yahoo.com/soccer/world-cup/news/dinner--fbintl_dw-octopus070810.html">here</a>&#8216;s an article discussing his preference for Spain over the Netherlands in the finals.  Search for &#8220;Paul the Octopus&#8221; in Google News and you will find thousands of results.</p>
<div class="wp-caption aligncenter" style="width: 349px"><a href="http://upload.wikimedia.org/wikipedia/commons/thumb/b/b9/Oktopus-Orakel_Paul_mit_Schuh.JPG/450px-Oktopus-Orakel_Paul_mit_Schuh.JPG"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/b/b9/Oktopus-Orakel_Paul_mit_Schuh.JPG/450px-Oktopus-Orakel_Paul_mit_Schuh.JPG" alt="" width="339" height="452" /></a><p class="wp-caption-text">This is one popular mollusk.</p></div>
<p>Of course, I suppose it&#8217;s possible that Paul really can see into the future, at least as far as soccer is concerned.  After all, he did correctly predict the winner of every game asked of him; an impressive feat, seeing as how his advice was requested a total of 8 times.  However, <a href="http://en.wikipedia.org/wiki/Occam%27s_razor">Occam&#8217;s Razor</a> suggests that we should look for a simpler explanation.</p>
<p>A natural first choice is to guess that Paul was simply guessing randomly, and is very lucky.  The odds of this happening are small &#8211; assuming he has a 50% chance of picking correctly, the odds of him being right each one of the 8 times he predicted a winner in this World cup would be 1/2<sup>8</sup>, which is only approximately .39%.  Very low odds indeed.</p>
<p>This analysis ignores biases that may be present &#8211; in particular, Paul&#8217;s octopus vision may bias him towards certain flag designs (which, given the fact that he frequently chooses Germany, seems plausible).  The Wikipedia article I linked to discusses other sorts of possible bias.  However, these biases would only influence which box he selected &#8211; they wouldn&#8217;t necessarily affect the odds that his selection would correspond to the winning team (although it is possible that he is being persuaded to throw his chips in for the favored team, which would increase the likelihood of his success).  Either way, questions concerning how Paul makes his selection are more interesting to me, so let me focus for the moment on that.</p>
<p>First of all, one could easily argue that unlike flipping a coin, the trials here (i.e. Paul&#8217;s selections) are NOT independent.  Indeed, what&#8217;s going on here may be very similar to an article in the New York Times a couple of years ago that discussed the lurking presence of the Monty Hall Problem in a classic experiment from psychology (which I discussed <a href="http://www.mathgoespop.com/2008/08/math-in-the-news-monty-hall-strikes-again.html">here</a>).</p>
<p>The idea is quite simple.  Paul chooses between two opponents in the World Cup by selecting a piece of food from one of two boxes.  Each box is labeled with a country&#8217;s flag, and this is the most obvious distinction between the boxes.  Suppose, for the sake of argument, that Paul has in his mind a ranking of his preferences for the flags, starting with the one he likes the most, and ending with the one he likes the least.  Assuming between any pair of flags Paul prefers one to another, one could theoretically determine his preferences by giving him sufficiently many pairings of different flags.  Moreover, assuming he does have preferences, the game selections are no longer independent, because each game gives us some information about his preferences.</p>
<p>Let&#8217;s dig deeper and look at his selections throughout the World Cup.  Paul gave predictions for 8 games, and those 8 games involved 9 separate teams (only one game did not involve Germany).  In the first game, Germany versus Australia, Paul selected Germany.  Let us note that as: <a href="http://www.mathgoespop.com/wp-content/uploads/2010/07/geraus.jpg"><img class="aligncenter size-full wp-image-525" title="geraus" src="http://www.mathgoespop.com/wp-content/uploads/2010/07/geraus.jpg" alt="" width="425" height="91" /></a>Now, already this gives us information about Paul&#8217;s preferences.  There are 9 x 8 x 7 x 6 x 5 x 4 x 3 x 2 x 1 = 9! = 362,880 possible ways to order these 9 teams, since you can choose any one of the 9 teams to be your favorite, any one of the 8 remaining to be your second favorite, and so on.  But given the above information, we already know that any choice with Australia ranked higher than Germany can not match Paul&#8217;s preferences.  This eliminates a surprising number of possible outcomes &#8211; half, in fact, since for every list in which Germany is ranked higher than Australia, we can flip these two countries to obtain a ranking in which Australia is higher than Germany.</p>
<p>This then affects the probability of all subsequent pairings!  Let&#8217;s take a look at the next game.  In the second game, Paul correctly predicted Serbia to defeat Germany.  We can write this as: <a href="http://www.mathgoespop.com/wp-content/uploads/2010/07/serbger.jpg"><img class="aligncenter size-full wp-image-527" title="serbger" src="http://www.mathgoespop.com/wp-content/uploads/2010/07/serbger.jpg" alt="" width="425" height="91" /></a>This gives us even more information!  Now, not only do we know that Paul prefers Germany to Australia, we also know he prefers Serbia to Germany.</p>
<p>Moreover, suppose (as seems reasonable) we assume the probability that he prefers Germany to Australia is 50%.  Now, after the first match, we know he prefers Germany to Australia &#8211; the right followup question to ask is, what is the probability he prefers Serbia to Germany GIVEN that he prefers Germany to Australia?  In particular, we have a conditional probability question here &#8211; these two facts are not independent!  For example, the fact that Paul prefers Germany to Australia means that Germany cannot be last in his rankings &#8211; this should <em>decrease</em> the probability that he prefers Serbia to Germany.  And indeed it does: supposing that each of the 362,880 rankings is equally likely, the probability that someone will prefer Serbia to Germany given that they prefer Germany to Australia is not 50%, but is only 33%!</p>
<p>To see why, consider an arbitrary ranking of the 9 teams.  If we only consider the relative placement of Serbia (S), Germany (Ge), and Australia (A), there are 6 possible rankings among these three:</p>
<ol>
<li> S &gt; Ge &gt; A </li>
<li>S &gt; A &gt; Ge </li>
<li>Ge &gt; S &gt; A </li>
<li>Ge &gt; A &gt; S </li>
<li>A &gt; Ge &gt; S </li>
<li>A &gt; S &gt; Ge.</li>
</ol>
<p>However, if we also know that Ge &gt; A, then rankings 2, 5, and 6 are eliminated, leaving us only with</p>
<ol>
<li>S &gt; Ge &gt; A</li>
<li>Ge &gt; S &gt; A</li>
<li>Ge &gt; A &gt; S.</li>
</ol>
<p>In particular, of these three remaining, Serbia is ranked higher than Germany only once.  Therefore, the probability that S &gt; G GIVEN that G &gt; A is only 1/3, or 33%.  (If you prefer, you can use a counting argument to show this as well.)</p>
<p>Of course, just as the first match gave us information, so did the second.  Therefore, when it comes to the third match, we have even more information at our disposal.  The third match was between Germany and Ghana, and Paul correctly identified Germany.  In other words: <a href="http://www.mathgoespop.com/wp-content/uploads/2010/07/gergha.jpg"><img class="aligncenter size-full wp-image-530" title="gergha" src="http://www.mathgoespop.com/wp-content/uploads/2010/07/gergha.jpg" alt="" width="425" height="91" /></a>Now the appropriate question to ask, of course, is: what is the probability that Paul prefers Germany to Ghana, given that he prefers Serbia to Germany and Germany to Australia?  Well, we know that Germany can&#8217;t be his first or his last choice, because it must be preceded by Serbia and followed by Australia.  Therefore, among these four countries, Germany must rank second or third.</p>
<p>If Germany ranks second, Serbia must be first, but we are free to make Australia third or fourth.  Similarly, if Germany is third, then Australia must be last, and we are free to make Serbia first or second.  In other words, we have four outcomes:</p>
<ol>
<li>S &gt; Ge &gt; Gh &gt; A</li>
<li>S &gt; Ge &gt; A &gt; Gh</li>
<li>S &gt; Gh &gt; Ge &gt; A</li>
<li>Gh &gt; S &gt; Ge &gt; A.</li>
</ol>
<p>Among these four choices, only 2 have Germany preferred over Ghana.  Thus the conditional probability that one would prefer Germany to Ghana is again 50%.</p>
<p>The interested reader can easily continue on in this fashion.  If you&#8217;re impatient, however, you can calculate the probability that Paul would have made the selections he did more directly.  All we need to know is who Paul selected in each match.  I&#8217;ll tell you that in the subsequent matches, Paul picked Germany over England, Germany over Argentina, Spain over Germany, Germany over Uruguay, and Spain over the Netherlands.</p>
<p>Given this information, suppose you want to know the probability that Paul selects Germany over Australia, Argentina, Uruguay, Ghana, and England, while he selected Spain over Germany and the Netherlands and Serbia over Germany.  As stated before, there are 9! possible lists of preferences.  In this case, it&#8217;s not hard to determine how many would lead to the behavior seen in this year&#8217;s World Cup.  Since Paul picked Germany over 5 teams, but behind 2 teams, we know that Germany can only be ranked 3rd or 4th (any higher and there wouldn&#8217;t be room for Spain and Serbia above, any lower and there wouldn&#8217;t be room for the 5 teams below).</p>
<p>If Germany is ranked 3rd, then we can choose to put either Spain or Serbia in 1st place.  Whichever one we don&#8217;t put in first place will then need to go in 2nd place, so that both countries are ranked higher than Germany.  After that, we are free to order the remaining countries however we like.  In other words, we see that there are 2 x 6! = 1,440 possible lists of preferences if Germany is ranked 3rd:</p>
<ol>
<li>2 choices</li>
<li>1 choice</li>
<li>Germany</li>
<li>6 choices</li>
<li>5 choices</li>
<li>4 choices</li>
<li>3 choices</li>
<li>2 choices</li>
<li>1 choice.</li>
</ol>
<p>Meanwhile, if Germany is in 4th place, we need to figure out how many ways there are to choose the three teams above it.  Notice that since Australia, Argentina, Uruguay, Ghana and England must all be ranked lower than Germany, the three countries ranked above it must be Spain, Serbia, and the Netherlands.  However, since we also need the Netherlands to be ranked below Spain, this only gives us three possibilities for the ranking of the first three teams: Spain, Serbia, Netherlands; Spain, Netherlands, Serbia; and Serbia, Spain, Netherlands.  Once we have made that selection, however, we are free to choose the 5 teams below Germany however we please.  In other words, if Germany is 4th, there are 3 x 5! = 360 possible lists of preferences.</p>
<p>Combining these, we see that there are 1,800 possible lists of preferences that would lead to the behavior shown by Paul.  Since the total number of outcomes is 9!, this gives a probability of only 1,800/9!, or roughly .49%.  This is the same value you will get if you calculate the remaining conditional probabilities and multiply them together.</p>
<p>Of course, if one wants a more impressive number, one can always try to correct for bias in Paul&#8217;s selection.  For example, suppose we assume that Paul will never choose England or Australia if given the option of a different flag &#8211; this seems reasonable, given experiments on how his species sees (the other flags have more contrast and are more focused on horizontal shapes, which apparently his species is drawn to).  If we make this assumption, the number of potential preference lists drops to 7! x 2 = 10,080, in which case the probability that Paul would choose as he did jumps up to 17.9%!</p>
<p>There are valid concerns about this model, though.  For instance, given the choice between two flags, why should we assume that Paul will always choose the same one over the other?  Equivalently, why should we believe that Paul&#8217;s decisions follow a prescribed preference list?  Indeed, when Paul was used to make predictions in 2008, he selected Germany over Spain, unlike his selection of Spain over Germany in 2010.  In 2008, it was Germany who was the victor, and so Paul guessed incorrectly &#8211; perhaps he learns from his mistakes, after all.</p>
<p>Whatever the case, I doubt this octopus has any special ability.  And even if he does, I don&#8217;t know that that would necessarily be a good thing.  For we all know that when 8 limbs are combined with super powers, nothing good can come of it.</p>
<div class="wp-caption aligncenter" style="width: 290px"><a href="http://upload.wikimedia.org/wikipedia/en/b/b6/2826_DocOck.jpg"><img src="http://upload.wikimedia.org/wikipedia/en/b/b6/2826_DocOck.jpg" alt="" width="280" height="425" /></a><p class="wp-caption-text">Is this the future that Paul&#39;s powers portend?  I believe it is.</p></div>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2010/07/lets-make-a-deal-with-paul-the-octopus.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mathematics Awareness Month 2010</title>
		<link>http://www.mathgoespop.com/2010/04/mathematics-awareness-month-2010.html</link>
		<comments>http://www.mathgoespop.com/2010/04/mathematics-awareness-month-2010.html#comments</comments>
		<pubDate>Sat, 01 May 2010 05:58:52 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Et cetera]]></category>
		<category><![CDATA[Sports]]></category>
		<category><![CDATA[baseball]]></category>
		<category><![CDATA[basketball]]></category>
		<category><![CDATA[football]]></category>
		<category><![CDATA[golf]]></category>
		<category><![CDATA[math awareness month]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=306</guid>
		<description><![CDATA[<p style="text-align: left;">As April comes and goes, so too does Mathematics Awareness Month.  Every year, the Joint Policy Board for Mathematics swirls mathematics with a different delightful discipline: last year it was climate, and the year before was voting.</p>
<p style="text-align: left;">This year&#8217;s theme is mathematics and sports, a topic which has inspired a number of articles <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2010/04/mathematics-awareness-month-2010.html">Mathematics Awareness Month 2010</a></span>]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;"><a href="http://www.mathaware.org/mam/2010/images/Large-mam_poster_golf.jpg"></a>As April comes and goes, so too does Mathematics Awareness Month.  Every year, the <a href="http://www.mathaware.org/about.jpbm.html">Joint Policy Board for Mathematics</a> swirls mathematics with a different delightful discipline: last year it was <a href="http://www.mathaware.org/mam/09/">climate</a>, and the year before was <a href="http://www.mathaware.org/mam/08/">voting</a>.</p>
<p style="text-align: left;">This year&#8217;s theme is <a href="http://www.mathaware.org/index.html">mathematics and sports</a>, a topic which has inspired a number of <a href="http://www.mathgoespop.com/category/sports">articles</a> here on this site.  As usual, there are a number of <a href="http://www.mathaware.org/mam/2010/essays/">essays</a> that discuss this theme from various perspectives; while usual suspects such as football and baseball play a central role in many of the essays, other sports get to mingle with mathematics as well, including track, golf, and tennis (also NASCAR, if you consider that a sport).</p>
<p><br class="spacer_" /></p>
<div class="wp-caption aligncenter" style="width: 323px"><a href="http://www.mathaware.org/mam/2010/images/Large-mam_poster_golf.jpg"><img src="http://www.mathaware.org/mam/2010/images/Large-mam_poster_golf.jpg" alt="" width="313" height="482" /></a><p class="wp-caption-text">This dude always thinks about math when he is golfing.</p></div>
<p><br class="spacer_" /></p>
<p>There are too many articles to discuss, so I&#8217;d encourage you to go take a look and see if anything strikes your fancy.  However, here are a few highlights:</p>
<p>If football is your game, Chris Jones of St. Mary&#8217;s College of California has written an <a href="http://www.mathaware.org/mam/2010/essays/JonesOvertime.pdf">article</a> about NFL overtime rules and offers a mathematical model for determining the winner in overtime based on the winner of the coin toss at the beginning of overtime.  Since overtime ends after any team scores, one would naturally expect that winning the coin toss carries with it a significant advantage, and this is born out in the data.  Jones offers an alternative rule scheme whereby the winning team is the first one to score six points, but in this case the team which wins the coin toss still has an advantage, and it is more likely that the game will end in a tie.</p>
<p>Given that the NFL recently <a href="http://www.nytimes.com/2010/03/24/sports/football/24nfl.html">changed</a> their overtime rules for playoff games, it&#8217;s too bad that Jones did not include this scheme into his analysis.  Perhaps, gentle reader, this would be a good exercise for you.</p>
<p>If your sports preferences are more varied, you may prefer the <a href="http://www.mathaware.org/mam/2010/essays/ClearyStreaks.pdf">article</a> by Rick Cleary of Bentley University, which discusses the probability of rare events in the contexts of football, baseball, and basketball.  My favorite example deals with the complaints many people have with regards to playoffs in Major League Baseball.  More specifically, the first round in MLB playoffs pits teams into a best-of-5 series, while the remaining rounds of the playoffs use a best-of-7 series.  Critics claim that the shorter first round series puts the stronger teams at a disadvantage, but in fact, a 7-series round is only slightly more advantageous for the stronger team.  In effect, Cleary argues that it&#8217;s almost incompatible to say that a best-of-5 series is unfair without also arguing that a best-of-7 series is also unfair.  The article is also well suited for a general audience.</p>
<p>Then again, maybe you are more interested in the intricate links between math and golf.  If that&#8217;s the case, you may want to peruse this <a href="http://www.mathaware.org/mam/2010/essays/BerryTiger.pdf">article</a> by Scott M. Berry, in which he analyzes the question: is Tiger Woods a winner?  In other words, does his ability to win transcend his skill level?  Does he have a mental game that helps push him to the top because of the influence he has on other players?</p>
<p>Berry modeled Tiger Woods&#8217; performance with the affectionately named RoboTiger, and concluded that in fact, Woods&#8217; record does not prove him to be a &#8220;winner&#8221; &#8211; he&#8217;s just a very skilled golfer.  The jury is still out, however, on the mathematical significance of any &#8220;winning&#8221; label for Tiger woods in the bedroom.</p>
<p style="text-align: center;">
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="480" height="400" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="bgcolor" value="#000000" /><param name="flashvars" value="autoPlay=false&amp;dist=www.southparkstudios.com&amp;orig=" /><param name="src" value="http://media.mtvnservices.com/mgid:cms:item:southparkstudios.com:267340" /><param name="wmode" value="window" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="480" height="400" src="http://media.mtvnservices.com/mgid:cms:item:southparkstudios.com:267340" allowfullscreen="true" wmode="window" flashvars="autoPlay=false&amp;dist=www.southparkstudios.com&amp;orig=" bgcolor="#000000"></embed></object>
</p>
<p>Finally, if you&#8217;re interested in turning mathematics into cash, you may be interested in this <a href="http://www.mathaware.org/mam/2010/essays/ChartierBracketology.pdf">article </a> by Tim Chartier, Erich Kreutzer, Amy Langville, and Kathryn Pedings, which discusses different methods of predicting winners in the annual NCAA Men&#8217;s Basketball Tournament.  While I&#8217;ve discussed this topic <a href="http://www.mathgoespop.com/2009/03/the-math-of-march-madness.html">before</a>, this article gives more detail on a variety of methods, which, if carefully applied, will make your bracket a sure fire winner.  Just make sure no one else in your local pool is so mathematically inclined.</p>
<p>There are plenty of other examples illustrating the intersection of math and sports, so don&#8217;t let the magic stop here.  If you&#8217;ve ever wanted to learn how to bend it like Beckham, or if you&#8217;ve ever dreamed of somehow connecting math to NASCAR, click through to the Mathematics Awareness Month website and read on.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2010/04/mathematics-awareness-month-2010.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lying with Statistics in Football</title>
		<link>http://www.mathgoespop.com/2010/02/lying-with-statistics.html</link>
		<comments>http://www.mathgoespop.com/2010/02/lying-with-statistics.html#comments</comments>
		<pubDate>Mon, 08 Feb 2010 17:33:00 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Math in the News]]></category>
		<category><![CDATA[Sports]]></category>
		<category><![CDATA[football]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=158</guid>
		<description><![CDATA[In the aftermath of the Super Bowl, some of you fans may be dreading the next six months.  To kick off this football drought, I&#8217;d like to highlight this article, which was featured on Yahoo yesterday.  The article says that Saints quarterback Drew Brees should hope to lose the coin toss at the start of the <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2010/02/lying-with-statistics.html">Lying with Statistics in Football</a></span>]]></description>
			<content:encoded><![CDATA[<div style="text-align: justify;">In the aftermath of the Super Bowl, some of you fans may be dreading the next six months.  To kick off this football drought, I&#8217;d like to highlight <a href=" http://sports.yahoo.com/nfl/blog/shutdown_corner/post/If-Saints-win-coin-toss-Super-Bowl-could-be-ove?urn=nfl,217725">this article</a>, which was featured on Yahoo yesterday.  The article says that Saints quarterback Drew Brees should hope to lose the coin toss at the start of the game, because in the past 43 Super Bowls, the team that won the coin toss had only won 20 times.</div>
<div id="attachment_163" class="wp-caption aligncenter" style="width: 385px"><a href="http://www.mathgoespop.com/wp-content/uploads/2010/02/xlivcoin.jpg"><img class="size-full wp-image-163" title="xlivcoin" src="http://www.mathgoespop.com/wp-content/uploads/2010/02/xlivcoin.jpg" alt="" width="375" height="375" /></a><p class="wp-caption-text">An unlucky coin?  Unlikely.</p></div>
<p>Um&#8230;what?  Who cares?  While 20/43 is slightly less than the expected 50%, this difference is not even close to being statistically significant.  Actually, the fact that this ratio is only 1 1/2 games shy of the mean is pretty good.  Matt Springer has posted an <a href="http://scienceblogs.com/builtonfacts/2010/02/super_bowl_coin_toss_mathemati.php">article</a> that discusses why we shouldn&#8217;t really care about this difference.</p>
<div style="text-align: justify;">
<p>Of course, the sample size is naturally restricted by the small number of Super Bowls, but if the author (Mark Pesavento) had really been interested in the question of whether or not the coin toss is correlated with the winner in a football game, he could&#8217;ve easily collected data over a couple of seasons and obtained an answer to the question.  At the very least, he could&#8217;ve owned up to the fact that his analysis is worthless, but instead, to the critics he offers only the following rebuttal: &#8220;because of the small sample size, some statisticians argue that the win-loss record of coin-toss winners is statistically insignificant.&#8221;</p>
<p>This is completely disingenuous, because it suggests that there would be a debate among statisticians about the significance in the data Pesavento uses, when no such debate exists.  Anyone with even a rudimentary background in statistics would understand that the sample size here would be too small to draw the conclusion he draws.</p>
<p>Moreover, Pesavento falls for one of the most common traps in statistics: mistaking correlation for causation.  Even if the data was much stronger in indicating that the coin toss winner is at a disadvantage, this would not imply that Brees should hope to lose the toss.  A correlation between these two effects does not imply a causal relationship between the two.  I feel like I&#8217;ve discussed this before, but just in case, here&#8217;s a thorough <a href="http://en.wikipedia.org/wiki/Correlation_does_not_imply_causation">discussion</a> of this misconception.</p>
<p>Here this point is moot, since we don&#8217;t even have a correlation.  I thought no one would need to point out that &#8220;No correlation does not imply causation,&#8221; but apparently we do.</p>
<p>Thankfully, most of the comments on Pesavento&#8217;s post are scathing in regards to his methods.  But that&#8217;s cold comfort in light of the fact that the article was deemed fit for posting on the front page of Yahoo.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2010/02/lying-with-statistics.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Football Pools, Part 3</title>
		<link>http://www.mathgoespop.com/2010/01/football-pools-part-3.html</link>
		<comments>http://www.mathgoespop.com/2010/01/football-pools-part-3.html#comments</comments>
		<pubDate>Sun, 31 Jan 2010 16:00:06 +0000</pubDate>
		<dc:creator>Matt</dc:creator>
				<category><![CDATA[Sports]]></category>
		<category><![CDATA[betting]]></category>
		<category><![CDATA[digital root]]></category>
		<category><![CDATA[football]]></category>
		<category><![CDATA[pool]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.mathgoespop.com/?p=125</guid>
		<description><![CDATA[
<p>This is the third in a series of posts about pools used for betting on the outcome of football games (part one can be found here, and part two here).  Let me briefly recall the setting, which is probably familiar to anyone who has been to a Super Bowl party.  Typically, one bets on the outcome <span style="color:#777"> . . . &#8594; Read More: <a href="http://www.mathgoespop.com/2010/01/football-pools-part-3.html">Football Pools, Part 3</a></span>]]></description>
			<content:encoded><![CDATA[<div style="text-align: justify;">
<p>This is the third in a series of posts about pools used for betting on the outcome of football games (part one can be found <a href="http://www.mathgoespop.com/2009/02/a-variant-of-the-traditional-football-pool.html">here</a>, and part two <a href="http://www.mathgoespop.com/2009/10/more-on-football-pools.html">here</a>).  Let me briefly recall the setting, which is probably familiar to anyone who has been to a Super Bowl party.  Typically, one bets on the outcome of a football game using a 10 x 10 grid.  People can buy any number of the 100 squares on the grid, and when all the squares have been purchased, each row and each column is assigned a random digit from 0 to 9.</p>
<p>Suppose, for example, that you buy four squares, and after the rows and columns have been labeled, you find that you own square 3-7, square 2-5, square 9-0, and square 6-6.  You will win money if, at the end of any one of the four quarters, the last digit in each team&#8217;s score matches your pair.  For example, if the score after the 3rd quarter is 13-27, you will win some money, since the last two digits are 3 and 7, and you own square 3-7.  There are variants of this: some pools only pay out every half, not every quarter, and usually the payouts vary by quarter, so that having the right square at the end of the game wins you more money than having the right square at the end of the first quarter.</p>
</div>
<div id="attachment_133" class="wp-caption aligncenter" style="width: 468px"><a href="http://www.mathgoespop.com/wp-content/uploads/2010/01/fpool3.jpg"><img class="size-full wp-image-133" title="fpool3" src="http://www.mathgoespop.com/wp-content/uploads/2010/01/fpool3.jpg" alt="" width="458" height="227" /></a><p class="wp-caption-text">Here&#39;s an example of a football pool which has been tagged in the four squares mentioned above.</p></div>
<div style="text-align: justify;">
<p>In the first part of this discussion, we introduced a new way to conduct the pool: rather than looking at the last digit of a team&#8217;s score, we looked instead at the digital root of the team&#8217;s score.  Recall that the digital root of a team&#8217;s score is obtained by adding the digits in their score.  If that sum is between 1 and 9, we stop &#8211; if it is larger than 9, we compute the digital root again, until we get a digit between 1 and 9.  For example, the digital root of 14 is 1 + 4 = 5, while the digital root of 38 is 2, since 3 + 8 = 11, and 1 + 1 = 2.  We then analyzed the distribution of scores, and found that the digital root of a team&#8217;s score is more evenly distributed between 1 and 9 than the last digit of a team&#8217;s score is evenly distributed between 0 and 9 (this is subject to the convention that we assign 0 a digital root of 9, since 0 is the only number with digital root equal to 0).</p>
<p>In the second part of the discussion, we tackled questions of independence.  Namely, we asked whether the last digit in one team&#8217;s score is independent of the last digit of the other team&#8217;s score, and similarly we asked whether the digital root in one team&#8217;s score is independent of the digital root of the other team&#8217;s score.  In both cases, we found the answer to be negative.</p>
<p>The subject of this article is based on the following observation: when you have wagered in a traditional football pool, it&#8217;s not uncommon for a small number of squares to be hit with high frequency during the course of a game.  For example, suppose you watch a game in which one team scores 7 points, then 3, then 7, then 3, while the opposing team never scores.  This means that the game&#8217;s score will go from 0-0, to 7-0, to 10-0, and then to 17-0.  So, while there are four unique scores in the game, with the usual football pool, only two squares will be hit: the 0-0 square, and the 7-0 square.  However, with the digital root pool, four squares will be hit: again using the convention that we assign 0 a digital root of 9, the squares will be 9-9, 7-9, 1-9, and 8-9.</p>
<p>The reason the digital root pool hits more squares in this case is because whenever one team increases its score by 10, the last digit of their score will return to a previous value.  However, with the digital root method, if a team increases its score by 10, the digital root increases by 1.  Because a score increase of 10 is a relatively common occurrence in football (all one needs is a touchdown, extra point, and field goal), one may therefore guess that using the digital root pool, more squares should be hit during the course of the game.</p>
<p>Whether one would want more squares to be hit or not is up for debate, but I see certain benefits.  For example, if more squares are hit during the game, then more people will have something invested in the game as it airs.  If you are sitting on the square that represents the current score, you want the score to remain the same through the end of the quarter so that you can reap the rewards &#8211; but if the winning squares keep bouncing around between a small number of people, there may be fewer people actively interested in the score as the game progresses.  This is especially true in Super Bowl parties, when many of the attendees are less interested in the game than they should be.</p>
<p>In other words, I&#8217;m of the belief that if more squares are hit, it&#8217;s a good thing.  It therefore becomes natural to ask whether or not the digital root pool actually does hit more unique squares than the traditional pool.  Thankfully, we have a wealth of data which we can use to answer this question.</p>
<p>I looked at all the games from this current season, and counted the number of boxes that would have been hit in each game using the traditional pool and the digital root pool.  Averaged over 331 games (this includes preseason and postseason), the number of squares hit using the traditional pool is approximately 6.84.  By comparison, the number of squares hit using the digital root pool is 8.43 &#8211; an increase of 1.59 boxes, or an increase of about 23%.  This effect is amplified when one considers the fact that the digital root pool uses only 81 squares, as opposed to the traditional pool&#8217;s 100.  This means that as a proportion of the total number of squares, the traditional pool hits about 6.84% of its squares, while the digital root pool hits 10.4% &#8211; here we have an increase of over 50%!</p>
<p>This is strong evidence that the digital root pool hits more squares than the tradition pool.  In fact, the data shows that an average game will have a change in the score approximately 8.73 times, which is only a bit higher than the average number of boxes hit by the digital root pool.  This makes sense when we slice the data another way: of the 331 games analyzed, in 252 of them the number of squares hit with the digital root pool was equal to the number of changes of score, meaning that no square got hit more than once.  The same cannot be said of the traditional pool &#8211; in this case, the number of games in which no square got hit more than once was only 62.</p>
<p>The data has convinced me that the digital root pool may be better suited for festive gathering, where wagering on football will be but one of many activities designed to induce merriment.  At the very least, it&#8217;s hard to argue that the traditional pool will hit as many squares as the digital root pool. Some may balk at a break from football pool tradition, but that&#8217;s ok.  I won&#8217;t watch football games with them anyway.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mathgoespop.com/2010/01/football-pools-part-3.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

