Correlation Formula

Correlation measures the strength and direction of the linear relationship between two quantitative variables, ranging from -1 to +1.

The Formula

โˆ’1โ‰คrโ‰ค1-1\le r\le 1

When to use: Do two things go up and down together? r=+1r = +1 means perfectly together, r=โˆ’1r = -1 means perfectly opposite.

Quick Example

Height and weight: rโ‰ˆ0.7r \approx 0.7 (positive). Temperature and heating bill: r<0r < 0 (negative).

Notation

rr summarizes direction and strength for a roughly linear association.

What This Formula Means

Correlation measures the strength and direction of the linear relationship between two quantitative variables, ranging from โˆ’1-1 to +1+1.

Do two things go up and down together? r=+1r = +1 means perfectly together, r=โˆ’1r = -1 means perfectly opposite.

Formal View

r=1nโˆ’1โˆ‘i=1n(xiโˆ’xห‰sx)(yiโˆ’yห‰sy)r = \frac{1}{n-1}\sum_{i=1}^{n}\left(\frac{x_i - \bar{x}}{s_x}\right)\left(\frac{y_i - \bar{y}}{s_y}\right) where โˆ’1โ‰คrโ‰ค1-1 \leq r \leq 1

Worked Examples

Example 1

medium
Given five data points (1,2),(2,4),(3,5),(4,4),(5,5)(1,2), (2,4), (3,5), (4,4), (5,5), compute the Pearson correlation coefficient rr.

Answer

rโ‰ˆ0.775r \approx 0.775

First step

1
Compute means: xห‰=3\bar{x} = 3, yห‰=4\bar{y} = 4.

See the full worked solution + why-it-works coaching

SetupKey insightWhy it worksCommon pitfallConnection

Unlock answer keys One Family plan โ€” every worked solution, all subjects

Example 2

easy
A study finds r=โˆ’0.85r = -0.85 between hours of TV watched per day and exam scores. Interpret this value.

Example 3

medium
Scatterplot shows data tightly clustered around a flat horizontal trend. Estimate rr.

Common Mistakes

  • Saying correlation proves causation โ€” association alone is not proof of cause.
  • Calling any upward pattern strong โ€” strength depends on how tightly dots cluster around the trend.
  • Using correlation with categorical data โ€” correlation needs paired numerical variables.

Why This Formula Matters

Correlation helps students read data claims carefully. It supports prediction while protecting against the common mistake of treating association as causation. Recognizing it by "Do the dots show a direction and tightness of pattern?" โ€” rather than by familiar numbers โ€” is what lets a student tell it apart from causation and scatter plot in a mixed problem set.

Frequently Asked Questions

What is the Correlation formula?

Correlation measures the strength and direction of the linear relationship between two quantitative variables, ranging from โˆ’1-1 to +1+1.

How do you use the Correlation formula?

Do two things go up and down together? r=+1r = +1 means perfectly together, r=โˆ’1r = -1 means perfectly opposite.

What do the symbols mean in the Correlation formula?

rr summarizes direction and strength for a roughly linear association.

Why is the Correlation formula important in Math?

Correlation helps students read data claims carefully. It supports prediction while protecting against the common mistake of treating association as causation. Recognizing it by "Do the dots show a direction and tightness of pattern?" โ€” rather than by familiar numbers โ€” is what lets a student tell it apart from causation and scatter plot in a mixed problem set.

What do students get wrong about Correlation?

The procedure for correlation is the easy part; the trap is saying correlation proves causation. Asking "Do the dots show a direction and tightness of pattern?" first is what keeps a correct-looking calculation from being attached to the wrong concept.

What should I learn before the Correlation formula?

Before studying the Correlation formula, you should understand: mean, standard deviation.