- Home
- /
- Statistics
- /
- relationships and regression
- /
- Line of Best Fit
The line of best fit (trend line) is the straight line that best represents the overall trend in a scatter plot by minimizing the sum of squared vertical distances between the line and all data points. The line of best fit enables prediction and summarizes the relationship between variables with a simple equation.
Definition
The line of best fit (trend line) is the straight line that best represents the overall trend in a scatter plot by minimizing the sum of squared vertical distances between the line and all data points. Its equation enables predictions for new x-values.
๐ก Intuition
If you stretched a rubber band through a scatter plot to be as close to all points as possible, that's the line of best fit. It captures the overall trend.
๐ฏ Core Idea
The line of best fit (least-squares line) minimizes the sum of squared vertical distances from each data point to the line, giving the most accurate linear predictions.
Example
Formula
Notation
\hat{y} = b_0 + b_1 x is the equation of the line. b_1 (slope) is the change in y per unit change in x. b_0 (intercept) is the predicted y when x = 0.
๐ Why It Matters
The line of best fit enables prediction and summarizes the relationship between variables with a simple equation.
๐ญ Hint When Stuck
First, plot all data points on a scatter plot. Then calculate the slope and y-intercept using the least-squares formulas (or use a calculator/spreadsheet). Finally, draw the line and verify it balances points above and below it roughly equally.
Formal View
Related Concepts
Compare With Similar Concepts
๐ง Common Stuck Point
Students draw the line of best fit by eye, often forcing it through too many points rather than balancing points above and below the line.
โ ๏ธ Common Mistakes
- Forcing line through origin when inappropriate
- Using when relationship isn't linear
- Ignoring outliers' influence
Go Deeper
Frequently Asked Questions
What is Line of Best Fit in Statistics?
The line of best fit (trend line) is the straight line that best represents the overall trend in a scatter plot by minimizing the sum of squared vertical distances between the line and all data points. Its equation enables predictions for new x-values.
What is the Line of Best Fit formula?
When do you use Line of Best Fit?
First, plot all data points on a scatter plot. Then calculate the slope and y-intercept using the least-squares formulas (or use a calculator/spreadsheet). Finally, draw the line and verify it balances points above and below it roughly equally.
Prerequisites
Next Steps
How Line of Best Fit Connects to Other Ideas
To understand line of best fit, you should first be comfortable with stat scatter plot. Once you have a solid grasp of line of best fit, you can move on to linear regression and correlation coefficient.