Statistics · Grade 9-12 · 5 min read

Empirical Rule

⚡ In one breath

The empirical rule (also called the 68-95-99.

📐 The formula

P(μσ<X<μ+σ)0.68P(\mu - \sigma < X < \mu + \sigma) \approx 0.68

Orient

The one-line idea, why it matters, and the intuition.

Section 1

Quick Answer

The empirical rule (also called the 68-95-99.7 rule) states that for a normal distribution, approximately 68% of data falls within one standard deviation of the mean, about 95% falls within two standard deviations, and roughly 99.7% falls within three standard deviations. In a classroom problem, the key is not to spot the word "Empirical Rule" and rush. First identify the question, the data structure, and the conclusion being requested. Use empirical rule when the situation involves outcomes, events, trials, sample spaces, or long-run chance behavior. The recognition test is: Am I reasoning about what can happen and how likely it is, with the correct sample space or condition?

Section 2

Why This Matters

Empirical Rule helps students reason about uncertainty without guessing. It connects outcomes, sample spaces, and event rules so students can decide whether to add, multiply, condition, simulate, or compare long-run behavior.

Section 3

Intuitive Explanation

Think of Empirical Rule as a lens for answering one particular kind of data question. The lens focuses attention on chance process: what was measured, how the values or groups are arranged, and what kind of statement the final answer should make. If that structure is missing, the same numbers can lead students toward the wrong statistical tool.

a game uses a spinner and a number cube, and students need to decide which outcomes count as success. A quick response might jump straight to a number, but the stronger response asks what the number would mean. Empirical Rule is useful only when the result can be tied back to the question, the group being studied, and the way the data were gathered or displayed.

The formula gives a compact way to carry out the idea, but the formula is not the first step. The first step is deciding that the situation matches the concept: Am I reasoning about what can happen and how likely it is, with the correct sample space or condition?

A reliable habit is to say the mental model out loud: "Map outcomes before chances." Then test the situation against nearby ideas. If the task is really about relative frequency, data display, or deterministic rule, switch tools before doing arithmetic. Good statistics is less about using every possible method and more about choosing the method that matches the evidence.

Core idea

For a roughly normal distribution, the empirical rule gives the percentage of data inside one, two, and three standard deviations of the mean: about 68%, 95%, and 99.7%.

Recognize

The cues that signal this concept and how to distinguish it from look-alikes.

Section 4

When to Use

Use Empirical Rule when the situation involves outcomes, events, trials, sample spaces, or long-run chance behavior. Strong signals include **chance**, **probability**, **outcome**, **event**, **trial**, **random**, **given**. The safest workflow is to read the final question first, identify the data source and variable, and then test the structure. Do not use empirical rule just because familiar numbers or words appear; first decide whether the situation answers "Am I reasoning about what can happen and how likely it is, with the correct sample space or condition?" with yes.

✨ Pro tip

Ask: Am I reasoning about what can happen and how likely it is, with the correct sample space or condition?

Section 5

How to Recognize It

Before using Empirical Rule, ask: does the prompt require you to compare values to the centre and spread of the distribution?

  1. Does the prompt give mean, standard deviation, shape of the distribution, and where the value sits relative to centre, and does it ask you to compare values to the centre and spread of the distribution?

    Yes means empirical rule is in play; no means the prompt is probably asking for Normal Distribution or another neighboring idea.

  2. Does the requested answer call for shape, or is it really about Normal Distribution?

    Choose Empirical Rule when the final answer needs compare values to the centre and spread of the distribution; choose Normal Distribution when the prompt centers on normal distribution instead.

  3. Do the given details include mean, standard deviation, shape of the distribution, and where the value sits relative to centre?

    Those details are the evidence for empirical rule. If they are missing, the concept may be only a vocabulary clue.

  4. Does the prompt's distribution match how the definition of Empirical Rule uses it?

    A matching use points toward Empirical Rule; a different use usually means a sibling concept is closer.

  5. Could a watch-out apply here — for example, the prompt asks for a single probability of an event rather than a distribution feature?

    If so, reconsider Normal Distribution. If not, keep Empirical Rule and state the specific cue that made it fit.

Section 6

Empirical Rule vs Normal Distribution vs Skewness vs Mean as Fair Share

Empirical Rule, Normal Distribution, Skewness, Mean as Fair Share get mixed up because they can appear near 68-95-99.7 rule and three-sigma rule. The difference is the final job: Empirical Rule asks for shape, while the other rows point to different cues.

Empirical Rule

Meaning
The empirical rule (also called the 68-95-99.7 rule) states that for a normal distribution, approximately 68% of data falls within one standard deviation of the mean, about 95% falls within two standard deviations, and roughly 99.7% falls within three standard deviations.
Key test
Use when the prompt asks for shape: compare values to the centre and spread of the distribution.
Formula
P(μσ<X<μ+σ)0.68P(\mu - \sigma < X < \mu + \sigma) \approx 0.68
Example
Heights with μ = 170 cm, σ = 10 cm: about 68% of people are 160–180 cm, 95% are 150–190 cm.

Normal Distribution

Meaning
The normal distribution (bell curve) is a symmetric, bell-shaped probability distribution where most data clusters around the mean, with probabilities decreasing symmetrically toward the tails.
Key test
Use instead when normal distribution and bell curve is the main cue, not Empirical Rule.
Formula
Normal Distribution pattern
Example
SAT scores: Mean 1060, most students 960-1160.

Skewness

Meaning
A measure of how asymmetric a probability distribution is around its mean — positive skew tails right, negative skew tails left.
Key test
Use instead when skew and distribution skew is the main cue, not Empirical Rule.
Formula
skewness=n(n1)(n2)(xixˉs)3\text{skewness} = \frac{n}{(n-1)(n-2)} \sum\left(\frac{x_i - \bar{x}}{s}\right)^3
Example
Income distribution is right-skewed: most earn moderate incomes, but a few earn millions, pulling the mean up.

Mean as Fair Share

Meaning
The mean (average) represents what each person would get if the total were divided equally among everyone.
Key test
Use instead when mean and average is the main cue, not Empirical Rule.
Formula
xˉ=x1+x2++xnn\bar{x} = \frac{x_1 + x_2 + \cdots + x_n}{n}
Example
Test scores: 70, 80, 90.

Apply

Worked examples and the mistakes most students make.

Section 7

Formula & Notation

P(μσ<X<μ+σ)0.68P(\mu - \sigma < X < \mu + \sigma) \approx 0.68
For XN(μ,σ2)X \sim N(\mu, \sigma^2): P(μσ<X<μ+σ)0.6827P(\mu - \sigma < X < \mu + \sigma) \approx 0.6827, P(μ2σ<X<μ+2σ)0.9545P(\mu - 2\sigma < X < \mu + 2\sigma) \approx 0.9545, P(μ3σ<X<μ+3σ)0.9973P(\mu - 3\sigma < X < \mu + 3\sigma) \approx 0.9973.

Section 8

Worked Examples

Example 1 — Recognize when the empirical rule applies

Easy

Problem

A class records 200 student heights. A histogram of the data is roughly bell-shaped, with mean 165 cm and standard deviation 7 cm. About what percent of students are between 158 cm and 172 cm?

Solution

  1. Check that the data are approximately normal.

    The empirical rule applies only to roughly bell-shaped distributions; otherwise the 68/95/99.7 numbers are unreliable.

  2. Translate the interval into standard deviations from the mean.

    The rule is stated in standard-deviation units, not raw values. 158 cm and 172 cm are exactly one standard deviation (7 cm) below and above the mean (165 cm).

  3. Read off the corresponding percentage from the rule.

    About 68% of values lie within one standard deviation of the mean for a normal distribution.

Answer

About 68% of students — roughly 136 of the 200 — have heights between 158 cm and 172 cm.

Example 2 — Avoid the nearby trap

Standard

Problem

A student sees the word “chance” in a problem about a spinner and writes "By the empirical rule, P(red) ≈ 68%." Explain why the empirical rule is the wrong tool here.

Solution

  1. Decide whether the situation is a normal distribution.

    The empirical rule describes how values spread around a mean for continuous, bell-shaped data. A spinner produces a discrete chance outcome, not a normal distribution.

  2. Match the tool to the structure.

    Probabilities for a spinner come from counting equally-likely outcomes or from a probability model, not from the 68-95-99.7 percentages.

Answer

The empirical rule does not apply because the spinner is not a normal distribution. The student should use a basic probability model (favourable outcomes over total outcomes), not the 68/95/99.7 percentages.

Example 3 — Use it for a tail estimate

Application

Problem

IQ scores are approximately normal with mean 100 and standard deviation 15. Estimate the percentage of people with IQ above 130.

Solution

  1. Express 130 as standard deviations from the mean.

    130 is exactly two standard deviations above 100, so the question is asking for the right-hand tail beyond +2σ.

  2. Use the 95% rule and split the remaining 5% symmetrically.

    About 95% of a normal distribution lies within ±2σ, leaving about 5% outside. By symmetry, roughly half — about 2.5% — sits in each tail.

Answer

About 2.5% of people have IQ above 130 under the empirical rule.

Section 9

Common Mistakes

Common slip-up

Applying the rule to non-normal distributions

The right idea

The safer move is to ask "Am I reasoning about what can happen and how likely it is, with the correct sample space or condition?" and then state the data source, denominator, or variable before interpreting the result.

Common slip-up

Confusing the percentages (e.g., saying 95% for one sigma)

The right idea

The safer move is to ask "Am I reasoning about what can happen and how likely it is, with the correct sample space or condition?" and then state the data source, denominator, or variable before interpreting the result.

Common slip-up

Forgetting the rule gives approximate, not exact, percentages

The right idea

The safer move is to ask "Am I reasoning about what can happen and how likely it is, with the correct sample space or condition?" and then state the data source, denominator, or variable before interpreting the result.

Common slip-up

Choosing empirical rule from a keyword alone

The right idea

Keywords like chance, probability, outcome are only clues; the data structure must match the concept.

Practice

Try it, then see where this concept fits in the path.

Section 10

Mini Practice

Try these on your own. Tap Reveal when you want to check.

  1. A problem asks students to interpret a game uses a spinner and a number cube, and students need to decide which outcomes count as success. What is the first clue that Empirical Rule might apply?

    Hint: Look for the question type, not just a keyword.

  2. Write one sentence explaining why Empirical Rule is not just a formula or graph label.

    Hint: Mention the interpretation.

  3. A student confuses Empirical Rule with Relative frequency. What should they compare?

    Hint: Compare what each idea answers.

  4. What information must be stated in the final answer when using Empirical Rule?

    Hint: Think units, group, and meaning.

  5. Give one reason a problem that mentions probability might still NOT use Empirical Rule.

    Hint: Use the "not" condition.

  6. Rewrite this weak explanation: "I used Empirical Rule because it was in the problem."

    Hint: Use the recognition test.

Want the full set?

50 practice questions for this concept — free to try, every one with a complete worked solution showing the why, not just the answer.

Section 11

Frequently Asked Questions

What is Empirical Rule in simple terms?

Empirical Rule is a statistics idea for situations where the situation involves outcomes, events, trials, sample spaces, or long-run chance behavior. In simple terms, it helps turn chance process into a probability, event description, or long-run expectation with the sample space named.

How do I know when to use Empirical Rule?

Use empirical rule when the problem passes this recognition test: Am I reasoning about what can happen and how likely it is, with the correct sample space or condition? Also check for signal words such as chance, probability, outcome, event, trial, but do not rely on keywords alone.

What is the most common mistake with Empirical Rule?

The common mistake is choosing empirical rule because a familiar word appears, without checking the data structure. A safer habit is to name the data source, variable or event, and final answer form before calculating.

How is Empirical Rule different from Relative frequency?

Empirical Rule is used when the situation involves outcomes, events, trials, sample spaces, or long-run chance behavior. Relative frequency is different because relative frequency uses observed data; probability may describe a model before or after data is collected. Compare the final question before choosing.

Does Empirical Rule always require a formula?

This concept often uses the formula P(μσ<X<μ+σ)0.68P(\mu - \sigma < X < \mu + \sigma) \approx 0.68, but the formula should come after recognition. First decide that the situation really asks for a probability, event description, or long-run expectation with the sample space named.

What should a complete answer include?

A complete answer should include the result or judgment, the context of the data, and a clear interpretation. For empirical rule, that means explaining how the evidence supports a probability, event description, or long-run expectation with the sample space named without overstating the conclusion. When possible, also name the group, variable, event, or study condition so a reader can tell exactly what the statement describes.

Section 12

Learning Path

Empirical Rule

You are here

Next →

You're at the end!
Before this, students should be comfortable with Normal Distribution. This page focuses on the recognition cue: Am I reasoning about what can happen and how likely it is, with the correct sample space or condition? That cue connects earlier data habits to later reasoning because students learn to choose the right representation, calculation, or interpretation before writing a conclusion. After this, students can use Empirical Rule as one tool inside broader statistical reasoning.

Section 13

See Also