Outlier Detection Statistics Example 3

Follow the full solution, then compare it with the other examples linked below.

Example 3

medium
A scientist records reaction times (ms): 245, 260, 255, 270, 250, 980, 265, 258. Use the 1.5ร—IQR1.5 \times IQR rule to determine if 980 is an outlier. Should it be removed from the analysis?

Solution

  1. 1
    Step 1: Sorted: 245,250,255,258,260,265,270,980. Q1=250+2552=252.5Q_1 = \frac{250+255}{2} = 252.5, Q3=265+2702=267.5Q_3 = \frac{265+270}{2} = 267.5, IQR=15IQR = 15. Upper fence: 267.5+22.5=290267.5 + 22.5 = 290. Since 980 > 290, it is an outlier.
  2. 2
    Step 2: Whether to remove depends on context. If 980 ms resulted from a distraction or equipment malfunction, removal is justified. If it represents a genuine slow reaction, it should be kept but noted. Investigate the cause before deciding.

Answer

980 ms is an outlier (exceeds the upper fence of 290 ms). Whether to remove it depends on whether it resulted from an error or a genuine observation โ€” investigate the cause.
Statistical outlier detection identifies unusual values, but the decision to remove them requires domain knowledge. Outliers may represent errors (remove), rare but genuine phenomena (keep), or data entry mistakes (correct). Blindly removing outliers can introduce bias.

About Outlier Detection

Outlier detection is the process of identifying data points that are unusually far from the rest of the dataset, using techniques like the IQR rule, z-scores, or visual inspection of box plots and scatter plots. These anomalous values may indicate measurement errors, data entry mistakes, or genuinely extreme observations.

Learn more about Outlier Detection โ†’

More Outlier Detection Examples