Skip to main content

QUAN 2010 Notes Introduction to Business Statistics

Section 8.1 Sampling Distribution of the Mean

In addition to knowing how individual data values vary about the mean for a population, statisticians are interested in knowing how the means of samples of the same size taken from the same population vary about the population mean. (i.e., how do groups of data compare to the overall data?) This leads to arguably the most important topic in all of statistics.

Handout Preview Activity: Sampling Distributions drawing from a Normal Population

Before we start formally learning about sampling distributions, I’d like you to explore to see what happens when we take random samples from a population and calculate sample means.
Use the applet below to draw samples from a normal distribution and explore how sample means behave.
To change the sample size, enter β€œ\(n=\#\)” for the number you want to be the sample size.
(This applet was made in GeoGebra by Steve Phelps.)
  1. The Population:
  2. The Sample Means: Choose a sample size, use the applet to generate many samples, and record what you observe.
    Sample Size (\(n\)) Approximate Mean of Sample Means (\(\mu_{\bar{x}}\)) Approximate Standard Deviation of Sample Means (\(\sigma_{\bar{x}}\))
    \(n=4\)
    \(n=20\)
    \(n=35\)
  3. Compare:
    • How does the average of the sample means compare to the population mean?
    • How does the spread of the sample means change as \(n\) increases?
    • How does the shape of the distribution of sample means change as \(n\) increases?
  4. Your Rule:
    Based on your observations:
    • The mean of the sampling distribution of the sample mean is about:
    • The standard deviation of the sampling distribution of the sample mean is about:
    (We’ll check your β€œrules” together as a class and compare them with the exact formulas.)

Handout Preview Activity: Sampling Distributions drawing from a Population that is NOT Normal

Now go to the GeoGebra link below to draw samples from a distribution that is NOT normal and explore how the sample means behave.
Work through the same steps you did above to try to develop rules based on your observations.
  1. The Population:
  2. The Sample Means: Choose a sample size, use the applet to generate many samples, and record what you observe.
    Sample Size (\(n\)) Approximate Mean of Sample Means (\(\mu_{\bar{x}}\)) Approximate Standard Deviation of Sample Means (\(\sigma_{\bar{x}}\))
    \(n=4\)
    \(n=20\)
    \(n=35\)
  3. Compare:
    • How does the average of the sample means compare to the population mean?
    • How does the spread of the sample means change as \(n\) increases?
    • How does the shape of the distribution of sample means change as \(n\) increases?
  4. Your Rule:
    Based on your observations:
    • The mean of the sampling distribution of the sample mean is about:
    • The standard deviation of the sampling distribution of the sample mean is about:
    (We’ll check your β€œrules” together as a class and compare them with the exact formulas.)

Example 8.1.1.

We’re going to do an activity where we roll dice and think about the distributions of the averages of our rolls.
Use the Excel file below to record your rolls for this activity. external/sheets/DiceRollBlank.xlsx
(If you don’t have dice, you can use the tool below.)
Figure 8.1.2. (Made in GeoGebra by Duane Habecker)

(a)

Roll a die at least 20 times. Use the Excel file above to record what you get on each roll. Then use the Data Analysis β€œHistogram” tool to visualize the distribution of rolls with the β€œBins” \(1,2,3,4,5,6\text{.}\) (Before visualizing the rolls, think about what you expect the distribution to look like.)

(b)

Now roll 2 dice and in the next sheet of the Excel file, record those two rolls and then find the average of those two rolls. Repeat this at least 20 times. Use Excel to create a histogram of the averages.

(c)

Now roll 3 dice and in the next sheet of the Excel file, record those three rolls and then find the average of those three rolls. Repeat this at least 20 times. Use Excel to create a histogram of the averages.
Figure 8.1.3. 3Blue1Brown Video: β€œBut what is the Central Limit Theorem?”

Definition 8.1.4.

A sampling distribution of the mean is a distribution of the mean from numerous samples of the sample size. This distribution has mean \(\mu_{\overline{x}}\) and a standard error (i.e. the standard deviation of the sample means) \(\sigma_{\overline{x}}\text{.}\)
There are three important properties that describe the distribution of sample means:
  1. The sampling distribution of the mean of a random variable drawn from any population is approximately normal for sufficiently large sample size. The larger the sample size, the more closely the sample distribution resembles a normal distribution.
  2. The mean of the sample means will be the same as the population mean; that is,
    \begin{equation*} \mu_{\overline{x}}=\mu\text{.} \end{equation*}
  3. The standard deviation of the sample means will be smaller than the standard deviation of the population; specifically,
    \begin{equation*} \sigma_{\overline{x}}=\frac{\sigma}{\sqrt{n}}. \end{equation*}

Exercise 8.1.5.

(Donnelly 7.7)
For a population with a mean equal to 250 and a standard deviation equal to 25, calculate the standard error of the mean for the following sample sizes.

Exercise 8.1.6.

What can you conclude about the standard error as the sample size increases?
  • As the sample size increases, standard error decreases
  • As the sample size increases, standard error increases.
  • As the sample size increases, standard error stays the same.
  • As the sample size increases, the standard error might increase, decrease, or stay the same.