Line of Best Fit Formula
The Formula
When to use: If you stretched a rubber band through a scatter plot to be as close to all points as possible, that's the line of best fit. It captures the overall trend.
Quick Example
Notation
What This Formula Means
The line of best fit (trend line) is the straight line that best represents the overall trend in a scatter plot by minimizing the sum of squared vertical distances between the line and all data points. Its equation enables predictions for new x-values.
If you stretched a rubber band through a scatter plot to be as close to all points as possible, that's the line of best fit. It captures the overall trend.
Formal View
Worked Examples
Example 1
easySolution
- 1 Step 1: (a) The slope is 5, meaning for each additional hour studied, the predicted test score increases by 5 points.
- 2 Step 2: (b) Substitute x = 8: y = 5(8) + 40 = 40 + 40 = 80.
- 3 Step 3: The predicted score is 80. This is an interpolation if 8 hours is within the range of the data, or an extrapolation if it is outside the data range.
Answer
Example 2
mediumCommon Mistakes
- Forcing line through origin when inappropriate
- Using when relationship isn't linear
- Ignoring outliers' influence
Why This Formula Matters
The line of best fit enables prediction and summarizes the relationship between variables with a simple equation.
Frequently Asked Questions
What is the Line of Best Fit formula?
The line of best fit (trend line) is the straight line that best represents the overall trend in a scatter plot by minimizing the sum of squared vertical distances between the line and all data points. Its equation enables predictions for new x-values.
How do you use the Line of Best Fit formula?
If you stretched a rubber band through a scatter plot to be as close to all points as possible, that's the line of best fit. It captures the overall trend.
What do the symbols mean in the Line of Best Fit formula?
\hat{y} = b_0 + b_1 x is the equation of the line. b_1 (slope) is the change in y per unit change in x. b_0 (intercept) is the predicted y when x = 0.
Why is the Line of Best Fit formula important in Statistics?
The line of best fit enables prediction and summarizes the relationship between variables with a simple equation.
What do students get wrong about Line of Best Fit?
Students draw the line of best fit by eye, often forcing it through too many points rather than balancing points above and below the line.
What should I learn before the Line of Best Fit formula?
Before studying the Line of Best Fit formula, you should understand: stat scatter plot.