Overview
Variance and standard deviation measure the spread or dispersion of data around the mean. They tell us how much individual values typically differ from the average.
Key Concepts
| Measure | Symbol | Description |
|---|---|---|
| Variance | (population), (sample) | Average squared deviation |
| Standard Deviation | (population), (sample) | Square root of variance |
Key Formulas
Population Variance
Sample Variance
Note: We divide by for samples (Bessel's correction) to get an unbiased estimate.
Population Standard Deviation
Sample Standard Deviation
Computational Formulas
These are algebraically equivalent but often easier to calculate:
Variance (Computational Form)
Standard Deviation
Calculation Steps
- Calculate the mean ()
- Find each deviation from the mean ()
- Square each deviation
- Sum the squared deviations
- Divide by (sample) or (population)
- For SD, take the square root
Properties
Variance Properties
- Always non-negative ()
- Zero only when all values are identical
- Units are squared (e.g., meters² for length data)
- More sensitive to outliers than range
Standard Deviation Properties
- Same units as the original data
- Approximately 68% of data within 1 SD of mean (for normal distributions)
- Approximately 95% within 2 SD
- Approximately 99.7% within 3 SD
Transformations
For :
Note: Adding a constant doesn't change the spread.
Coefficient of Variation
A relative measure of variability:
Useful for comparing variability across datasets with different units or scales.
Examples
Example 1: Basic Calculation
Data: 2, 4, 4, 4, 5, 5, 7, 9
| 2 | -3 | 9 |
| 4 | -1 | 1 |
| 4 | -1 | 1 |
| 4 | -1 | 1 |
| 5 | 0 | 0 |
| 5 | 0 | 0 |
| 7 | 2 | 4 |
| 9 | 4 | 16 |
| Sum | 0 | 32 |
Example 2: Computational Formula
Same data: 2, 4, 4, 4, 5, 5, 7, 9
Example 3: Comparing Datasets
| Dataset | Mean | SD | CV |
|---|---|---|---|
| A | 100 | 15 | 15% |
| B | 50 | 10 | 20% |
Dataset B has higher relative variability despite lower absolute SD.
Population vs Sample
| Aspect | Population | Sample |
|---|---|---|
| Symbol | , | , |
| Divisor | ||
| Mean | ||
| Use | Known full population | Estimating from sample |