- Home
- /
- Statistics
- /
- relationships and regression
- /
- Correlation
Correlation is a statistical relationship between two variables where changes in one are associated with changes in the other. Correlation helps us spot patterns and relationships in data.
Definition
Correlation is a statistical relationship between two variables where changes in one are associated with changes in the other. Positive correlation means both increase together; negative correlation means one increases as the other decreases; no correlation means no consistent pattern.
๐ก Intuition
When one thing goes up and another tends to go up with it (like study time and test scores), that's positive correlation. When one goes up and the other goes down (like TV time and exercise), that's negative correlation. They 'move together' in some pattern.
๐ฏ Core Idea
Correlation measures the direction and strength of the linear relationship between two variables. It ranges from โ1 (perfect negative) to +1 (perfect positive).
Example
Notation
r denotes the sample correlation coefficient. r > 0 indicates positive correlation, r < 0 indicates negative correlation, and |r| close to 1 indicates a strong linear relationship.
๐ Why It Matters
Correlation helps us spot patterns and relationships in data. But it's just the first step - we can't assume one thing causes the other!
๐ญ Hint When Stuck
First, plot both variables on a scatter plot and look at the overall pattern. Then determine the direction: upward trend means positive, downward means negative. Finally, judge the strength by how tightly the points cluster around a line โ tighter clustering means stronger correlation.
Formal View
See Also
๐ง Common Stuck Point
Students often confuse the strength of a correlation with its direction โ a correlation of โ0.9 is very strong, just negative.
โ ๏ธ Common Mistakes
- Thinking correlation proves causation
- Missing confounding variables
- Confusing a strong negative correlation with a weak relationship
Common Mistakes Guides
Frequently Asked Questions
What is Correlation in Statistics?
Correlation is a statistical relationship between two variables where changes in one are associated with changes in the other. Positive correlation means both increase together; negative correlation means one increases as the other decreases; no correlation means no consistent pattern.
When do you use Correlation?
First, plot both variables on a scatter plot and look at the overall pattern. Then determine the direction: upward trend means positive, downward means negative. Finally, judge the strength by how tightly the points cluster around a line โ tighter clustering means stronger correlation.
What do students usually get wrong about Correlation?
Students often confuse the strength of a correlation with its direction โ a correlation of โ0.9 is very strong, just negative.
Prerequisites
How Correlation Connects to Other Ideas
To understand correlation, you should first be comfortable with stat scatter plot and data collection. Once you have a solid grasp of correlation, you can move on to correlation coefficient and correlation vs causation.