Practice Outlier Detection in Statistics

Use these practice problems to test your method after reviewing the concept explanation and worked examples.

Quick Recap

Outlier detection is the process of identifying data points that are unusually far from the rest of the dataset, using techniques like the IQR rule, z-scores, or visual inspection of box plots and scatter plots. These anomalous values may indicate measurement errors, data entry mistakes, or genuinely extreme observations.

Outliers are data points that don't fit the pattern. A 7-foot student in a class of average heights, or a \$10 million house in a neighborhood of \$300k homes. They may be errors or genuinely unusual.

Showing a random 20 of 50 problems.

Example 1

hard
A data set has mean xห‰=100\bar{x} = 100 and standard deviation s=15s = 15. Using the z-score method, determine whether the values 60, 145, and 155 are outliers (using the threshold โˆฃzโˆฃ>2|z| > 2).

Example 2

medium
Why might using ONLY the z-score method miss outliers in a skewed data set?

Example 3

hard
Why does the IQR rule generally identify FEWER outliers than the โˆฃzโˆฃ>3|z|>3 rule for skewed data?

Example 4

hard
For data {0.5,1,1.2,1.5,1.8,2,2.2,5}\{0.5, 1, 1.2, 1.5, 1.8, 2, 2.2, 5\} with Q1=1.1,Q3=2.1Q_1 = 1.1, Q_3 = 2.1, is 55 an outlier?

Example 5

easy
A data set has Q1=10Q_1 = 10 and Q3=20Q_3 = 20. Find the IQR.

Example 6

medium
For ages of attendees at a children's birthday party, the value 4242 appears next to a long list of values from 44 to 1010. Is 4242 likely an outlier?

Example 7

challenge
Data: 2,4,6,8,10,12,14,162, 4, 6, 8, 10, 12, 14, 16 with Q1=5Q_1=5, Q3=13Q_3=13. Find both fences and state whether any value is an outlier.

Example 8

easy
Data: 4,5,6,7,8,9,1004, 5, 6, 7, 8, 9, 100. Which value is most likely an outlier?

Example 9

medium
Test scores: 72, 75, 78, 80, 82, 85, 88, 90, 92, 95. A new student's score of 25 is added. How does this outlier affect the mean and median?

Example 10

easy
True or false: outliers should always be removed before analysis.

Example 11

easy
With Q1=10Q_1=10, Q3=20Q_3=20, IQR=10IQR=10, find the lower fence Q1โˆ’1.5โ‹…IQRQ_1 - 1.5\cdot IQR.

Example 12

easy
In the โˆฃzโˆฃ>3|z| > 3 rule, zz stands for what?

Example 13

medium
A z-score is calculated as z=โˆ’3.4z = -3.4 for some value xx. What does that mean?

Example 14

hard
With Q1=25,Q3=75Q_1=25, Q_3=75, find both IQR-rule fences.

Example 15

easy
With Q1=30,Q3=50,IQR=20Q_1=30, Q_3=50, IQR=20, what is the upper fence?

Example 16

challenge
Explain why the IQR rule is more robust to extreme values than the z-score rule (one concise reason).

Example 17

medium
A value has z-score โˆ’2.5-2.5 in a roughly normal distribution. By the โˆฃzโˆฃ>3|z|>3 rule, is it an outlier?

Example 18

medium
For Q1=12,Q3=20Q_1 = 12, Q_3 = 20, is 3030 an outlier by the IQR rule?

Example 19

hard
A teacher records 3030 students' typing speeds. A z-score โˆฃzโˆฃ>3|z| > 3 rule flags two values. What should the teacher do?

Example 20

medium
A value is 88, the mean is 2020, and the SD is 44. Is it an outlier by the โˆฃzโˆฃ>3|z|>3 rule?