4.6 : Variance
The deviations show how spread out the data are about the mean. A positive deviation occurs when the data value exceeds the mean, whereas a negative deviation occurs when the data value is less than the mean. If the deviations are added, the sum is always zero. So one cannot simply add the deviations to get the data spread. By squaring the deviations, the numbers are made positive; thus, their sum will also be positive.
The standard deviation measures the spread in the same units as the data. The variance is defined as the square of the standard deviation. Thus, its units differ from that of the original data. The sample variance is represented by , while the population variance is represented by
.
For variance, the calculation uses a division by n – 1 instead of n because the data is a sample. This change is due to the sample variance being an estimate of the population variance. Based on the theoretical mathematics behind these calculations, dividing by (n – 1) gives a better estimate of the population variance.
This text is adapted from Openstax, Introductory Statistics, Section 2.7 Measure of the Spread of the Data.
From Chapter 4:
Now Playing
4.6 : Variance
Measures of Variation
9.8K Views
4.1 : What is Variation?
Measures of Variation
11.9K Views
4.2 : Range
Measures of Variation
11.7K Views
4.3 : Standard Deviation
Measures of Variation
16.7K Views
4.4 : Standard Error of the Mean
Measures of Variation
5.9K Views
4.5 : Calculating Standard Deviation
Measures of Variation
7.5K Views
4.7 : Coefficient of Variation
Measures of Variation
3.9K Views
4.8 : Range Rule of Thumb to Interpret Standard Deviation
Measures of Variation
9.1K Views
4.9 : Empirical Method to Interpret Standard Deviation
Measures of Variation
5.3K Views
4.10 : Chebyshev's Theorem to Interpret Standard Deviation
Measures of Variation
4.3K Views
4.11 : Mean Absolute Deviation
Measures of Variation
2.7K Views