Overview
The normal distribution (Gaussian distribution) is the most important continuous probability distribution. It describes many natural phenomena and is fundamental to statistical inference.
Probability Density Function
f ( x ) = 1 σ 2 π e − ( x − μ ) 2 2 σ 2 f(x) = \frac{1}{\sigma\sqrt{2\pi}} e^{-\frac{(x-\mu)^2}{2\sigma^2}} f ( x ) = σ 2 π 1 e − 2 σ 2 ( x − μ ) 2
Parameters
μ \mu μ (mu) : Mean (center of the distribution)
σ \sigma σ (sigma) : Standard deviation (spread)
Notation
X ∼ N ( μ , σ 2 ) X \sim N(\mu, \sigma^2) X ∼ N ( μ , σ 2 )
or
X ∼ Normal ( μ , σ ) X \sim \text{Normal}(\mu, \sigma) X ∼ Normal ( μ , σ )
Properties
Property Value Mean μ \mu μ Median μ \mu μ Mode μ \mu μ Variance σ 2 \sigma^2 σ 2 Standard Deviation σ \sigma σ Skewness 0 Kurtosis (excess) 0
The Bell Curve
╭───────╮
╱ ╲
╱ ╲
╱ ╲
────╱───────────────╲────
μ
← σ → ← σ →
Symmetric about the mean
Asymptotic to x-axis (never touches)
Total area under curve = 1
The Empirical Rule (68-95-99.7)
For normally distributed data:
P ( μ − σ < X < μ + σ ) ≈ 68 % P(\mu - \sigma < X < \mu + \sigma) \approx 68\% P ( μ − σ < X < μ + σ ) ≈ 68%
P ( μ − 2 σ < X < μ + 2 σ ) ≈ 95 % P(\mu - 2\sigma < X < \mu + 2\sigma) \approx 95\% P ( μ − 2 σ < X < μ + 2 σ ) ≈ 95%
P ( μ − 3 σ < X < μ + 3 σ ) ≈ 99.7 % P(\mu - 3\sigma < X < \mu + 3\sigma) \approx 99.7\% P ( μ − 3 σ < X < μ + 3 σ ) ≈ 99.7%
Range Approximate % μ ± 1 σ \mu \pm 1\sigma μ ± 1 σ 68.27% μ ± 2 σ \mu \pm 2\sigma μ ± 2 σ 95.45% μ ± 3 σ \mu \pm 3\sigma μ ± 3 σ 99.73%
Standard Normal Distribution
The standard normal has μ = 0 \mu = 0 μ = 0 and σ = 1 \sigma = 1 σ = 1 :
Z ∼ N ( 0 , 1 ) Z \sim N(0, 1) Z ∼ N ( 0 , 1 )
Standardization
Convert any normal to standard normal:
Z = X − μ σ Z = \frac{X - \mu}{\sigma} Z = σ X − μ
Convert back:
X = μ + Z ⋅ σ X = \mu + Z \cdot \sigma X = μ + Z ⋅ σ
Working with Normal Distributions
Finding Probabilities
Standardize: Z = ( X − μ ) / σ Z = (X - \mu) / \sigma Z = ( X − μ ) / σ
Use Z-table or calculator
Adjust for direction (left, right, between)
Finding Values from Probabilities
Find Z Z Z from the probability
Convert: X = μ + Z ⋅ σ X = \mu + Z \cdot \sigma X = μ + Z ⋅ σ
Z-Table Reference
Z Z Z P ( Z < z ) P(Z < z) P ( Z < z ) Z Z Z P ( Z < z ) P(Z < z) P ( Z < z ) -3.0 0.0013 0.0 0.5000 -2.0 0.0228 1.0 0.8413 -1.0 0.1587 2.0 0.9772 -0.5 0.3085 3.0 0.9987
Examples
Example 1: Finding Probability
Test scores are N ( 70 , 10 ) N(70, 10) N ( 70 , 10 ) . Find P ( X < 85 ) P(X < 85) P ( X < 85 ) .
Z = 85 − 70 10 = 1.5 Z = \frac{85 - 70}{10} = 1.5 Z = 10 85 − 70 = 1.5
P ( Z < 1.5 ) = 0.9332 P(Z < 1.5) = 0.9332 P ( Z < 1.5 ) = 0.9332
Example 2: Greater Than
Same distribution. Find P ( X > 60 ) P(X > 60) P ( X > 60 ) .
Z = 60 − 70 10 = − 1.0 Z = \frac{60 - 70}{10} = -1.0 Z = 10 60 − 70 = − 1.0
P ( Z > − 1.0 ) = 1 − P ( Z < − 1.0 ) = 1 − 0.1587 = 0.8413 P(Z > -1.0) = 1 - P(Z < -1.0) = 1 - 0.1587 = 0.8413 P ( Z > − 1.0 ) = 1 − P ( Z < − 1.0 ) = 1 − 0.1587 = 0.8413
Example 3: Between Two Values
Find P ( 65 < X < 80 ) P(65 < X < 80) P ( 65 < X < 80 ) .
Z 1 = 65 − 70 10 = − 0.5 Z_1 = \frac{65 - 70}{10} = -0.5 Z 1 = 10 65 − 70 = − 0.5
Z 2 = 80 − 70 10 = 1.0 Z_2 = \frac{80 - 70}{10} = 1.0 Z 2 = 10 80 − 70 = 1.0
P ( − 0.5 < Z < 1.0 ) = P ( Z < 1.0 ) − P ( Z < − 0.5 ) = 0.8413 − 0.3085 = 0.5328 P(-0.5 < Z < 1.0) = P(Z < 1.0) - P(Z < -0.5) = 0.8413 - 0.3085 = 0.5328 P ( − 0.5 < Z < 1.0 ) = P ( Z < 1.0 ) − P ( Z < − 0.5 ) = 0.8413 − 0.3085 = 0.5328
Example 4: Finding a Percentile
Find the score at the 90th percentile.
P ( Z < z ) = 0.90 ⇒ z ≈ 1.28 P(Z < z) = 0.90 \quad \Rightarrow \quad z \approx 1.28 P ( Z < z ) = 0.90 ⇒ z ≈ 1.28
X = μ + Z ⋅ σ = 70 + 1.28 × 10 = 82.8 X = \mu + Z \cdot \sigma = 70 + 1.28 \times 10 = 82.8 X = μ + Z ⋅ σ = 70 + 1.28 × 10 = 82.8
Linear Combinations
If X ∼ N ( μ X , σ X 2 ) X \sim N(\mu_X, \sigma_X^2) X ∼ N ( μ X , σ X 2 ) and Y ∼ N ( μ Y , σ Y 2 ) Y \sim N(\mu_Y, \sigma_Y^2) Y ∼ N ( μ Y , σ Y 2 ) are independent:
a X + b Y ∼ N ( a μ X + b μ Y , a 2 σ X 2 + b 2 σ Y 2 ) aX + bY \sim N(a\mu_X + b\mu_Y, a^2\sigma_X^2 + b^2\sigma_Y^2) a X + bY ∼ N ( a μ X + b μ Y , a 2 σ X 2 + b 2 σ Y 2 )
Why So Important?
Many natural phenomena are approximately normal
Central Limit Theorem: sample means are normally distributed
Basis for many statistical tests
Convenient mathematical properties