Residuals Formula
The Formula
When to use: A residual is how much the model got wrong for a specific data point. Positive residual means the actual value was higher than predicted; negative means it was lower. If you plot all residuals, the pattern (or lack thereof) tells you whether the model is appropriate.
Quick Example
Notation
What This Formula Means
The difference between an observed value and its predicted value from a regression model: \text{residual} = y - \hat{y} (observed minus predicted).
A residual is how much the model got wrong for a specific data point. Positive residual means the actual value was higher than predicted; negative means it was lower. If you plot all residuals, the pattern (or lack thereof) tells you whether the model is appropriate.
Formal View
Worked Examples
Example 1
easySolution
- 1 Calculate predicted value: \hat{y} = 2 + 3(4) = 2 + 12 = 14
- 2 Calculate residual: e = y - \hat{y} = 15 - 14 = 1
- 3 Positive residual: actual value (15) is ABOVE the predicted value (14)
- 4 Interpretation: the model under-predicts by 1 unit for this observation
Answer
Example 2
mediumCommon Mistakes
- Computing residuals as \hat{y} - y instead of y - \hat{y}—the convention is observed minus predicted.
- Ignoring the residual plot and only looking at r^2—a high r^2 can still come with a terrible model if the relationship is curved.
- Expecting residuals to all be close to zero—some large residuals are normal; look for patterns, not individual values.
Why This Formula Matters
Residuals are how you check whether your model is appropriate. The regression equation alone doesn't tell you if the model fits well—the residual plot does.
Frequently Asked Questions
What is the Residuals formula?
The difference between an observed value and its predicted value from a regression model: \text{residual} = y - \hat{y} (observed minus predicted).
How do you use the Residuals formula?
A residual is how much the model got wrong for a specific data point. Positive residual means the actual value was higher than predicted; negative means it was lower. If you plot all residuals, the pattern (or lack thereof) tells you whether the model is appropriate.
What do the symbols mean in the Residuals formula?
e_i is the residual for the i-th observation. The sum of all residuals from a LSRL is always zero: \sum e_i = 0.
Why is the Residuals formula important in Math?
Residuals are how you check whether your model is appropriate. The regression equation alone doesn't tell you if the model fits well—the residual plot does.
What do students get wrong about Residuals?
Students compute residuals correctly but don't know how to read residual plots. The key: look for patterns. No pattern = good. Any systematic pattern = problem.
What should I learn before the Residuals formula?
Before studying the Residuals formula, you should understand: linear regression lsrl.
Want the Full Guide?
This formula is covered in depth in our complete guide:
Data Representation, Variability, and Sampling Guide →