Confounding Variables Examples in Statistics

Start with the recap, study the fully worked examples, then use the practice problems to check your understanding of Confounding Variables.

This page combines explanation, solved examples, and follow-up practice so you can move from recognition to confident problem-solving in Statistics.

Concept Recap

A variable that influences both the independent and dependent variables, creating a spurious association that can be mistaken for causation.

Ice cream sales and drowning deaths correlate. Confounding variable: hot weather. It causes both! Without recognizing confounders, you'd wrongly blame ice cream for drowning.

Read the full concept explanation โ†’

How to Use These Examples

  • Read the first worked example with the solution open so the structure is clear.
  • Try the practice problems before revealing each solution.
  • Use the related concepts and background knowledge badges if you feel stuck.

What to Focus On

Core idea: A confounding variable is related to both the explanatory and response variables, creating a false appearance of a direct relationship between them.

Common stuck point: Students accept correlations as causal without asking 'what else could explain this relationship?' Always consider whether a third variable could account for the pattern.

Worked Examples

Example 1

easy
A study finds that children who have more books at home get higher test scores. A researcher concludes that buying more books will raise test scores. Identify the confounding variable.

Solution

  1. 1
    Step 1: The two variables are: number of books at home and test scores.
  2. 2
    Step 2: A confounding variable is family income (or parental education level). Wealthier or more educated families tend to have more books AND provide more educational support, tutoring, and resources.
  3. 3
    Step 3: The books themselves may not cause higher scores โ€” the underlying factor (family resources/education level) affects both variables simultaneously.

Answer

Family income or parental education is a confounding variable โ€” it is associated with both having more books and higher test scores. Simply buying more books may not raise scores.
A confounding variable is a third variable that is related to both the explanatory and response variables, creating a spurious association between them. Identifying confounders is essential for correctly interpreting observed relationships.

Example 2

medium
Hospitals A and B both perform a surgery. Hospital A has a 90% survival rate, Hospital B has 95%. However, Hospital A takes more high-risk patients. When only high-risk patients are compared, Hospital A has a higher survival rate. Explain this paradox.

Practice Problems

Try these problems on your own first, then open the solution to compare your method.

Example 1

medium
A study finds that people who own swimming pools are more likely to have skin cancer. Identify a possible confounding variable and explain how random assignment could address it.

Example 2

hard
A company claims its energy drink improves athletic performance because athletes who drink it run faster. However, the athletes were not randomly assigned โ€” those who drank the energy drink also trained harder. (a) Identify the confounding variable. (b) Explain what 'controlling for' a confounding variable means. (c) How could the study be redesigned?

Related Concepts

Background Knowledge

These ideas may be useful before you work through the harder examples.

correlation vs causationvariables