Q: What do the symbols mean in the Coefficient of Determination formula?

$r^2$ ranges from 0 to 1. $\text{SS}_{\text{total}}$ = total sum of squares. $\text{SS}_{\text{residual}}$ = residual sum of squares.

Question 1

What is the Coefficient of Determination formula?

Accepted Answer

The proportion of the total variation in the response variable $y$ that is explained by the linear relationship with the explanatory variable $x$. It equals the square of the correlation coefficient: $r^2$.

Question 2

How do you use the Coefficient of Determination formula?

Accepted Answer

Total variation in $y$ has two parts: what the regression line explains and what's left over (residual variation). If $r^2 = 0.85$, the regression line accounts for $85\%$ of why $y$ values differ from each other, and $15\%$ is unexplained. Think of $r^2$ as a report card for how well $x$ predicts $y$.

Question 3

What do the symbols mean in the Coefficient of Determination formula?

Accepted Answer

$r^2$ ranges from 0 to 1. $	ext{SS}_{	ext{total}}$ = total sum of squares. $	ext{SS}_{	ext{residual}}$ = residual sum of squares.

Question 4

Why is the Coefficient of Determination formula important in Math?

Accepted Answer

$r^2$ is the standard one-number report card for a regression's predictive usefulness, and squaring $r$ exposes how much weaker a 'decent' correlation really is ($r=0.7$ explains only 49%). Mixing it up with $r$ or with causation is what leads people to overstate how much a model actually tells them. Recognizing it by "Am I reporting the fraction of $y$'s variation explained by the linear model (a 0-to-1 number), not the slope or the correlation's sign?" — rather than by familiar numbers — is what lets a student tell it apart from correlation $r$ and slope $b$ and residual variation in a mixed problem set.

Question 5

What do students get wrong about Coefficient of Determination?

Accepted Answer

The procedure for coefficient of determination is the easy part; the trap is reporting $r$ when the question asks for $r^2$. Asking "Am I reporting the fraction of $y$'s variation explained by the linear model (a 0-to-1 number), not the slope or the correlation's sign?" first is what keeps a correct-looking calculation from being attached to the wrong concept.

Question 6

What should I learn before the Coefficient of Determination formula?

Accepted Answer

Before studying the Coefficient of Determination formula, you should understand: correlation, linear regression lsrl, residuals.

Coefficient of Determination Formula

The Formula

Quick Example

Notation

What This Formula Means

Formal View

Worked Examples

Example 1

First step

Example 2

Example 3

Common Mistakes

Why This Formula Matters

Frequently Asked Questions