Computing with the normal distribution requires knowledge of the population parameters $\mu$ and $\sigma$. In practice, these parameters are often unknown. In such cases, we use the sample mean $\overline{X}$ and the sample standard deviation $s$ to estimate the population mean and standard deviation. But how good are these estimates?
Let us start with the sample mean $\overline X$. Different random samples will give different sample means. Thus the sample mean $\overline X$ is a random variable as well. What are the properties of this random variable, such as the shape of its distribution, its expected value and its standard deviation? The Central Limit Theorem (CLT) provides the answer to this question.
To illustrate the relation between population mean and all sample means consider the following data on the number of hours per week four people have spent watching videos on the internet:$$ \begin{array}{l|c|c|c|c} \text { Person } & \text{ Ann } & \text { Bob } & \text { Cathy } & \text { Dylan } \\ \hline \text { Hours } & 16 & 8 & 4 & 10 \end{array}$$The mean number of videos watched in this (very small) population is$\mu = \frac{16 + 8 + 4 + 10}{4} = 9.5$ hours. From this population we can draw without replacement all possible samples of size $n=2$. The sample means for all possible samples of size 2 are:$$ \begin{array}{l|c|c} \text { Sample } & \text{ Sample Mean } \\ \hline \text { Ann, Bob } & 12 \\ \text { Ann, Cathy } & 10 \\ \text { Ann, Dylan } & 13 \\ \text { Bob, Cathy } & 6 \\ \text { Bob, Dylan } & 9 \\ \text { Cathy, Dylan } & 7 \end{array}$$The mean of all possible sample means is$$ \mu_{\overline X} = \frac{12 + 10 + 13 + 6 + 9 + 7}{6} = 9.5 $$which is the same as the population mean. This is no coincidence. The claim that the mean of all possible sample means is equal to the population mean holds for any sample size and any population distribution. This is one of the assertions of the Central Limit Theorem.
For large samples of size $ n > 30$ the distribution of sample means $\overline X$ is approximately normal with$$\mu_{\overline X} = \mu \quad \text{and} \quad \sigma_{\overline X} = \frac{\sigma}{\sqrt{n}}$$
1. The CLT makes claims about the shape of the distribution of $\overline X$, its expected value and its standard deviation. The first claim is that the distribution of $\overline X$ is approximately bell-shaped. The second claim is that the mean of all possible sample means is equal to the population mean $\mu$. The third claim is that the standard deviation of the distribution of $\overline X$ is equal to the population standard deviation $\sigma$ divided by the square root of the sample size $n$. Thus with a larger sample size $n$ the standard deviation of the distribution of $\overline X$ is smaller and the values of $\overline X$ are closer to the population mean $\mu$.
2. The Central Limit Theorem is one of the most important theorems in statistics. It is the foundation for many statistical procedures. Much more detailed information on the Central Limit Theorem can be found in textbooks and online courses on mathematical statistics.
The population of common ladybugs has a mean size of $\mu = 7.4\ mm$ and a standard deviation of $\sigma = 1.5\ mm.$
a) Assuming that the population distribution is approximately normal, what is the probability that a randomly selected ladybug has a size smaller than $7\ mm?$
b) What is the probability that the mean size of a random sample of $50$ ladybugs is smaller than $7\ mm?$
Solution: a)$$P(X < 7) = P\left(\frac{X - \mu}{\sigma} < \frac{7 - 7.4}{1.5}\right) = P(Z < -0.27) = 0.3944$$b) The mean size of a random sample of $50$ ladybugs is a random variable $\overline X$ with a mean of $\mu_{\overline X} = 7.4\ mm$ and a standard deviation of $\sigma_{\overline X} = \frac{\sigma}{\sqrt{n}} = \frac{1.5}{\sqrt{50}} = 0.2121\ mm$.$$P(\overline X < 7) = P\left(Z < \frac{7 - 7.4}{0.2121}\right) = P(Z < -1.89) = 0.0292$$Notice that it is much less likely to find a sample mean of $7\ mm$ or less than to find a single ladybug of size $7\ mm$ or less.
The average height of females in Canada is $\mu = 163.8\ cm$ with a standard deviation of $\sigma = 7.1\ cm.$ Compute the $90$th percentiles for a single Canadian female and for the mean height of a random sample of $100.$
Solution: $P_{90} \rightarrow z = 1.28$. For a single female we have$$ X = \mu + z\sigma = 163.8 + 1.28\cdot 7.1 = 173.2\ cm$$For the mean height of a random sample of $100$$$ \overline X = \mu + z\frac{\sigma}{\sqrt{n}} = 163.8 + 1.28\cdot \frac{7.1}{\sqrt{100}} = 164.1\ cm$$