Overview
A sampling distribution is the probability distribution of a statistic (like the sample mean) calculated from all possible samples of a given size from a population.
Key Concepts
| Term | Definition |
|---|---|
| Population | The entire group of interest |
| Sample | A subset of the population |
| Parameter | A numerical characteristic of a population (, ) |
| Statistic | A numerical characteristic of a sample (, ) |
| Sampling distribution | Distribution of a statistic over all possible samples |
Sampling Distribution of the Mean
If we take all possible samples of size from a population and calculate for each:
Mean of
The mean of sample means equals the population mean.
Standard Error of the Mean
The standard deviation of sample means (standard error) decreases as increases.
Standard Error
The standard error (SE) measures sampling variability:
Interpretation
Smaller SE means:
- Sample means cluster more tightly around
- Estimates are more precise
- Larger sample sizes give smaller SE
Properties
| Property | Formula |
|---|---|
| Mean of | |
| Variance of | |
| Standard Error |
Effect of Sample Size
| SE relative to | |
|---|---|
| 1 | |
| 4 | |
| 9 | |
| 25 | |
| 100 |
Quadrupling cuts SE in half.
Other Sampling Distributions
Sample Proportion
For proportion from samples of size :
Difference of Means
For from independent samples:
Sampling Variability
Population
↓
┌─────────────────────────────┐
│ Sample 1 → x̄₁ │
│ Sample 2 → x̄₂ │ → Sampling Distribution
│ Sample 3 → x̄₃ │ of x̄
│ ... │
│ Sample k → x̄ₖ │
└─────────────────────────────┘
Each sample gives a different , creating variability.
Examples
Example 1: Standard Error
Population: ,
Sample size :
Sample size :
Example 2: Probability Using SE
Population: , , Sample
?
Example 3: Required Sample Size
To cut SE in half from current value, need:
To reduce SE from 10 to 5 when :
Importance
- Inference foundation: Understanding sampling variability enables hypothesis testing and confidence intervals
- Precision planning: Calculate required sample sizes
- Estimating parameters: Quantify uncertainty in estimates