Coefficient of Determination Examples in Math

Start with the recap, study the fully worked examples, then use the practice problems to check your understanding of Coefficient of Determination.

This page combines explanation, solved examples, and follow-up practice so you can move from recognition to confident problem-solving in Math.

Concept Recap

The proportion of the total variation in the response variable yy that is explained by the linear relationship with the explanatory variable xx. It equals the square of the correlation coefficient: r2r^2.

Total variation in yy has two parts: what the regression line explains and what's left over (residual variation). If r2=0.85r^2 = 0.85, the regression line accounts for 85%85\% of why yy values differ from each other, and 15%15\% is unexplained. Think of r2r^2 as a report card for how well xx predicts yy.

Read the full concept explanation โ†’

How to Use These Examples

  • Read the first worked example with the solution open so the structure is clear.
  • Try the practice problems before revealing each solution.
  • Use the related concepts and background knowledge badges if you feel stuck.

What to Focus On

Core idea: r2r^2 is the proportion of variation in yy accounted for by the linear relationship with xx โ€” the square of the correlation.

Common stuck point: The procedure for coefficient of determination is the easy part; the trap is reporting rr when the question asks for r2r^2. Asking "Am I reporting the fraction of yy's variation explained by the linear model (a 0-to-1 number), not the slope or the correlation's sign?" first is what keeps a correct-looking calculation from being attached to the wrong concept.

Sense of Study hint: Ask: Am I reporting the fraction of yy's variation explained by the linear model (a 0-to-1 number), not the slope or the correlation's sign?

Worked Examples

Example 1

medium
A regression model has SST=500SST = 500 (total variation) and SSE=125SSE = 125 (unexplained variation). Calculate R2R^2 and interpret its meaning.

Answer

R2=0.75R^2 = 0.75. The model explains 75% of variation in y.

First step

1
R2=1โˆ’SSESST=1โˆ’125500=1โˆ’0.25=0.75R^2 = 1 - \frac{SSE}{SST} = 1 - \frac{125}{500} = 1 - 0.25 = 0.75

See the full worked solution + why-it-works coaching

SetupKey insightWhy it worksCommon pitfallConnection

Unlock answer keys One Family plan โ€” every worked solution, all subjects

Example 2

hard
Two models predict house prices: Model 1 (size only): R2=0.60R^2=0.60. Model 2 (size + neighborhood + age): R2=0.85R^2=0.85. Explain what the increase in R2R^2 means and what caution should be applied with multi-variable R2R^2.

Example 3

medium
A model has SST=1200SST=1200 and r2=0.7r^2=0.7. Find the explained sum of squares SSRSSR and the residual SSESSE.

Example 4

medium
A model on n=20n=20 points has SST=500SST=500 and r2=0.84r^2=0.84. Compute SSESSE and the residual standard deviation s=SSE/(nโˆ’2)s=\sqrt{SSE/(n-2)}.

Example 5

hard
Five yy-values are 4,5,7,8,114,5,7,8,11 with yห‰=7\bar y=7. The residuals from a fitted line are 1,โˆ’1,0,โˆ’1,11,-1,0,-1,1. Compute r2r^2.

Example 6

challenge
A simple linear regression has r2=0.64r^2=0.64 with n=10n=10 points. Compute the test statistic t=r(nโˆ’2)/(1โˆ’r2)t=r\sqrt{(n-2)/(1-r^2)} for testing H0:ฯ=0H_0:\rho=0 assuming positive slope.

Practice Problems

Try these problems on your own first, then open the solution to compare your method.

Example 1

easy
The correlation between study hours and test score is r=0.8r=0.8. Calculate R2R^2 and interpret it.

Example 2

hard
A model has R2=0.95R^2=0.95. A researcher concludes 'the model is perfect and ready for deployment.' Identify two potential problems with this conclusion.

Example 3

easy
The correlation is r=0.7r = 0.7. Compute r2r^2.

Example 4

easy
r2=0.64r^2 = 0.64 and the slope is positive. Find rr.

Example 5

easy
Interpret r2=0.85r^2 = 0.85 in words.

Example 6

easy
What is the range of possible values for r2r^2?

Example 7

easy
If r2=1r^2 = 1, what does that say about the data and the line?

Example 8

easy
If r2=0r^2 = 0, how much of the variation in yy does the line explain?

Example 9

easy
True or false: a high r2r^2 guarantees the linear model is the correct model.

Example 10

easy
r2=0.36r^2 = 0.36 for a regression with a negative slope. Find rr.

Example 11

medium
Total variation in yy (SST) is 200200 and residual variation (SSE) is 5050. Compute r2r^2.

Example 12

medium
Explained variation is 180180 out of a total variation of 240240. Compute r2r^2.

Example 13

medium
A report states r=0.5r = 0.5. A student claims '50%50\% of the variation is explained.' Correct them.

Example 14

medium
r2=0.81r^2 = 0.81. A student says '81%81\% of the data points fall on the line.' What is wrong, and what is correct?

Example 15

medium
Model A has r2=0.9r^2 = 0.9 but a strongly curved residual plot. Model B has r2=0.8r^2 = 0.8 with random residuals. Which has the better-justified linear fit?

Example 16

medium
A regression has SST =500= 500 and r2=0.6r^2 = 0.6. Find the residual sum of squares SSE.

Example 17

medium
If r2r^2 rises from 0.490.49 to 0.640.64, how does the magnitude of rr change (slope stays positive)?

Example 18

medium
A linear model gives r2=0.95r^2 = 0.95, but extrapolating it predicts a negative weight for a real object. What does this reveal?

Example 19

medium
A regression has explained variation SSR=90SSR = 90 and residual variation SSE=60SSE = 60. Compute r2r^2.

Example 20

challenge
Total variation SST =1000= 1000. Adding a predictor reduces residual variation from SSE =400= 400 to SSE =250= 250. By how much does r2r^2 increase?

Example 21

challenge
Two variables have r2=0r^2 = 0 yet are perfectly related by y=x2y = x^2 over [โˆ’2,2][-2,2]. Explain how both can be true.

Example 22

challenge
A model on n=10n=10 points has SST =360= 360 and r2=0.75r^2 = 0.75. Find SSE and then the residual standard deviation s=SSEnโˆ’2s = \sqrt{\frac{SSE}{n-2}}.

Example 23

easy
Given r=0.6r=0.6, compute r2r^2.

Example 24

easy
If r2=0.49r^2=0.49 and the slope is positive, find rr.

Example 25

easy
SST=400SST=400, SSE=80SSE=80. Compute r2r^2.

Example 26

easy
For a regression on a dataset, r=โˆ’0.9r=-0.9. Compute r2r^2.

Example 27

easy
A student says r=0.4r=0.4 means '40% of the variation is explained.' Correct them.

Example 28

medium
A regression gives r2=0.81r^2=0.81 and a negative slope. Find rr.

Example 29

medium
Two datasets have r2=0.64r^2=0.64 and r2=0.49r^2=0.49. Which has the stronger linear association?

Example 30

medium
SSE=120SSE=120 and r2=0.4r^2=0.4. Find SSTSST.

Example 31

medium
Adding a predictor changes SSESSE from 300300 to 180180 with SST=600SST=600. How much does r2r^2 rise?

Example 32

medium
The residuals from a line on yy are exactly half the residuals from using yห‰\bar y. Find r2r^2.

Example 33

medium
r2=0.04r^2=0.04. A student calls the relationship 'strong.' Correct them.

Example 34

medium
A least-squares line passes through every data point. State r2r^2 and what that implies about SSESSE.

Example 35

medium
Data with strong curvature gives a regression line with r2=0.82r^2=0.82. Is the linear model appropriate? Justify.

Example 36

hard
SST=900SST=900. The fitted line gives residual SS SSE=144SSE=144. Compute r2r^2 to two decimals.

Example 37

hard
For a regression with n=12n=12 points and r2=0.49r^2=0.49, find SSESSE when SST=600SST=600 and the typical residual s=SSE/(nโˆ’2)s=\sqrt{SSE/(n-2)}.

Example 38

hard
Explain why r2r^2 is identical whether the regression line is yy on xx or xx on yy.

Example 39

hard
In a regression of yy on xx, doubling every yy-value (no change to xx) โ€” what happens to r2r^2?

Example 40

challenge
In a multiple regression with k=3k=3 predictors and n=20n=20, R2=0.7R^2=0.7. Compute the adjusted R2=1โˆ’(1โˆ’R2)nโˆ’1nโˆ’kโˆ’1R^2 = 1 - (1-R^2)\frac{n-1}{n-k-1}.

Background Knowledge

These ideas may be useful before you work through the harder examples.

correlationlinear regression lsrlresiduals