Statitudes

Statitudes

Share this post

Statitudes
Statitudes
Pythagoras of the Hardwood

Pythagoras of the Hardwood

Using points scored and points allowed to predict team wins.

Justin Kubatko's avatar
Justin Kubatko
Jan 27, 2023
∙ Paid
2

Share this post

Statitudes
Statitudes
Pythagoras of the Hardwood
1
Share

As with many things in sports analytics, it started with Bill James.

James was looking for a way to translate a baseball team’s runs scored and runs allowed into wins and losses. He noticed that the relationship could be expressed as follows:

\(\frac{W}{L} = \frac{RS^2}{RA^2}\)

The above could then be used to express a team’s winning percentage as a function of runs scored and runs allowed:

\(W\% = \frac{RS^2}{RS^2 + RA^2}\)

Because of the squared terms in the equation above, James chose to dub this the Pythagorean formula. James found that when he used this formula to predict a team’s win total given its runs scored and runs allowed, he would usually get within plus or minus four wins of the team’s actual total.

As it turns out, James’ formula is actually a logit model in disguise:

\(log\left(\frac{W\%}{1 - W\%}\right) = \beta \times log\left(\frac{RS}{RA}\right)\)

Solving the expression above for winning percentage yields:

\(W\% = \frac{RS^\beta}{RS^\beta + RA^\beta}\)

Where, in the case of baseball, the coefficient beta has a value of approximately 2.

Keep reading with a 7-day free trial

Subscribe to Statitudes to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Justin Kubatko
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share