Probability DistributionsTopic #17 of 33

Student's t-Distribution

Distribution for small samples: degrees of freedom, t-tables, and comparison to normal.

Overview

The t-distribution is used when making inferences about population means when the population standard deviation is unknown and must be estimated from the sample.

When to Use

Use t-distribution instead of z when:

  • Population standard deviation σ\sigma is unknown
  • Using sample standard deviation ss
  • Sample size is small (though valid for any nn)

Degrees of Freedom

df=n1df = n - 1

Where nn is the sample size.

Properties

Propertyt-Distribution
ShapeSymmetric, bell-shaped
Mean0
Variancedfdf2\frac{df}{df - 2} for df>2df > 2
TailsHeavier than normal
As dfdf \to \inftyApproaches standard normal

Comparison to Normal

         Normal
           ↓
     ╭─────────╮
    ╱     t     ╲
───╱─────────────╲───
  • t has heavier tails
  • More probability in the tails
  • Same center (mean = 0)

Key t-Values

dfdft0.10t_{0.10}t0.05t_{0.05}t0.025t_{0.025}t0.01t_{0.01}t0.005t_{0.005}
13.0786.31412.70631.82163.657
51.4762.0152.5713.3654.032
101.3721.8122.2282.7643.169
201.3251.7252.0862.5282.845
301.3101.6972.0422.4572.750
\infty1.2821.6451.9602.3262.576

Note: tt_\infty equals z-values (standard normal).

Notation

tα,dft_{\alpha, df}

Example: t0.025,10=2.228t_{0.025, 10} = 2.228 (two-tailed 95% critical value with df=10df = 10)

T-Statistic for Sample Mean

t=xˉμ0s/nt = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}

Where:

  • xˉ\bar{x} = sample mean
  • μ0\mu_0 = hypothesized population mean
  • ss = sample standard deviation
  • nn = sample size

Applications

Confidence Interval for Mean

xˉ±tα/2,df×sn\bar{x} \pm t_{\alpha/2, df} \times \frac{s}{\sqrt{n}}

Hypothesis Testing

Test statistic follows t-distribution with df=n1df = n - 1 when testing means.

Examples

Example 1: Finding Critical Value

For 95% confidence interval with n=15n = 15:

df=151=14df = 15 - 1 = 14 α=0.05α/2=0.025\alpha = 0.05 \Rightarrow \alpha/2 = 0.025 t0.025,14=2.145t_{0.025, 14} = 2.145

Example 2: Confidence Interval

Sample: n=20n = 20, xˉ=85\bar{x} = 85, s=12s = 12

95% CI for μ\mu:

df=19,t0.025,19=2.093df = 19, \quad t_{0.025, 19} = 2.093 CI=85±2.093×1220=85±2.093×2.683=85±5.62\text{CI} = 85 \pm 2.093 \times \frac{12}{\sqrt{20}} = 85 \pm 2.093 \times 2.683 = 85 \pm 5.62 CI=(79.38,90.62)\text{CI} = (79.38, 90.62)

Example 3: T-Test

Testing H0:μ=100H_0: \mu = 100 vs H1:μ100H_1: \mu \neq 100

Sample: n=25n = 25, xˉ=95\bar{x} = 95, s=15s = 15

t=9510015/25=53=1.67t = \frac{95 - 100}{15/\sqrt{25}} = \frac{-5}{3} = -1.67 df=24,Critical value: t0.025,24=2.064df = 24, \quad \text{Critical value: } t_{0.025, 24} = 2.064 1.67<2.064fail to reject H0\lvert -1.67 \rvert < 2.064 \Rightarrow \text{fail to reject } H_0

When to Use Z vs T

SituationDistribution
σ\sigma knownZ
σ\sigma unknown, n30n \geq 30Z or T (similar)
σ\sigma unknown, n<30n < 30T
ProportionsZ

Assumptions for T-Tests

  1. Random sampling
  2. Independence of observations
  3. Population is approximately normal (or n30n \geq 30)