Descriptive StatisticsTopic #3 of 33

Range and Interquartile Range

Simple measures of spread: range, quartiles, IQR, and identifying outliers using the 1.5×IQR rule.

Overview

Range and IQR are measures of spread that describe how dispersed the data is. IQR is more robust to outliers than range.

Range

Definition

Range=MaximumMinimum\text{Range} = \text{Maximum} - \text{Minimum}

Properties

  • Simplest measure of spread
  • Uses only two values (extremes)
  • Very sensitive to outliers
  • Increases with sample size

Example

Data: 3, 7, 8, 12, 15, 18, 22

Range=223=19\text{Range} = 22 - 3 = 19

Interquartile Range (IQR)

Definition

IQR=Q3Q1\text{IQR} = Q_3 - Q_1

Where:

  • Q1Q_1 = 25th percentile (first quartile)
  • Q3Q_3 = 75th percentile (third quartile)

Properties

  • Measures spread of middle 50% of data
  • Resistant to outliers
  • Used in boxplots
  • More stable than range

Calculating Quartiles

  1. Order data from lowest to highest
  2. Find the median (Q2Q_2)
  3. Q1Q_1 = median of lower half
  4. Q3Q_3 = median of upper half

Five-Number Summary

A complete summary of data distribution:

ValueDescription
MinimumSmallest value
Q1Q_125th percentile
Median (Q2Q_2)50th percentile
Q3Q_375th percentile
MaximumLargest value

Outlier Detection

Using IQR Method

Lower fence:

Q11.5×IQRQ_1 - 1.5 \times \text{IQR}

Upper fence:

Q3+1.5×IQRQ_3 + 1.5 \times \text{IQR}
  • Values below lower fence or above upper fence are considered outliers
  • For extreme outliers: use 3.0×IQR3.0 \times \text{IQR}

Boxplot Components

                    ┌───────────────┐
    ─────────────── │      │        │ ───────────────
         Min       Q₁     Q₂       Q₃            Max
                    └───────────────┘
                         IQR
PartRepresents
BoxMiddle 50% (IQR)
Line in boxMedian
WhiskersData within 1.5×IQR
PointsOutliers

Examples

Example 1: Calculating IQR

Data: 2, 3, 5, 6, 7, 8, 9, 11, 12

n=9,Median=7n = 9, \quad \text{Median} = 7

Lower half: 2, 3, 5, 6 → Q1=3+52=4Q_1 = \frac{3+5}{2} = 4

Upper half: 8, 9, 11, 12 → Q3=9+112=10Q_3 = \frac{9+11}{2} = 10

IQR=104=6\text{IQR} = 10 - 4 = 6

Example 2: Outlier Detection

Using Example 1: Q1=4Q_1 = 4, Q3=10Q_3 = 10, IQR=6\text{IQR} = 6

Lower fence=41.5(6)=49=5\text{Lower fence} = 4 - 1.5(6) = 4 - 9 = -5 Upper fence=10+1.5(6)=10+9=19\text{Upper fence} = 10 + 1.5(6) = 10 + 9 = 19

Any value below -5 or above 19 is an outlier.

Example 3: With Outliers

Data: 2, 3, 5, 6, 7, 8, 9, 11, 12, 50

  • Range = 50 - 2 = 48 (heavily affected by outlier)
  • Q1=4.5Q_1 = 4.5, Q3=10.5Q_3 = 10.5, IQR = 6 (minimally affected)
  • Upper fence = 10.5 + 9 = 19.5
  • 50 > 19.5, so 50 is an outlier

Comparison: Range vs IQR

AspectRangeIQR
FormulaMax - MinQ3Q1Q_3 - Q_1
UsesExtreme valuesMiddle 50%
Outlier resistanceNoneHigh
Information used2 pointsAll data
Best forQuick overviewRobust analysis

Semi-Interquartile Range

Also called quartile deviation:

SIQR=Q3Q12=IQR2\text{SIQR} = \frac{Q_3 - Q_1}{2} = \frac{\text{IQR}}{2}