The Sampling Distribution of the Proportion

Section 8.3 The Sampling Distribution of the Proportion

So far, the focus in this chapter has been on the distribution of sample means. However, sometimes we deal with business scenarios where we are counting observations in a sample and in this case the sample proportion (or percentage), \(\overline{p}\text{,}\) is the statistic that is relevant rather than the sample mean, \(\overline{x}\text{.}\)

🔗

Handout Preview Activity: Sampling Distribution of the Proportion

Now go to the GeoGebra link below to investigate the sampling distribution of a proportion.

🔗

Link to GeoGebra: https://www.geogebra.org/m/srwskfpv

🔗

Try to develop rules based on your observations.

🔗

Exploration:
- Experiment 1: Varying Sample Size with \(p=0.5\text{.}\) Set the population proportion to \(p=0.5\text{.}\) Try several different sample sizes and run as many trials as is helpful to see the shape of the distribution. (I recommend somewhere from 100 to 300 trials.)
  
  \(p\) \(n\) Shape of Distribution Center (around what value?) Spread (wide/narrow)
  
  \(0.5\) \(10\)
  
  \(0.5\) \(30\)
  
  \(0.5\) \(50\)
  
  \(0.5\) \(100\)
  
  🔗
  
  🔗
- Experiment 2: Varying Proportion with \(n\) Fixed. Now fix \(n=50\text{.}\) Try several different values of \(p\) and run as many trials as is helpful to see the shape of the distribution. (I recommend somewhere from 100 to 300 trials.)
  
  \(p\) \(n\) Shape of Distribution Center (around what value?) Spread (wide/narrow)
  
  \(0.1\) \(50\)
  
  \(0.3\) \(50\)
  
  \(0.5\) \(50\)
  
  \(0.7\) \(50\)
  
  \(0.9\) \(50\)
  
  🔗
  
  🔗
- Experiment 3: Checking the Rule of Thumb. For each combination you tried above, check whether the conditions below hold:
  
  \begin{equation*} np\geq 5\text{ and } n(1-p)\geq 5 \end{equation*}
  
  \(p\) \(n\) \(np\) \(n(1-p)\) Did it look normal in the applet?
  
  0.5 10
  
  0.5 30
  
  0.5 50
  
  0.5 100
  
  0.1 50
  
  0.3 50
  
  0.5 50
  
  0.7 50
  
  0.9 50
  
  🔗
  
  🔗
🔗
🔗

🔗

Analysis Questions:
1. For \(p=0.5\text{,}\) how does the shape of the sampling distribution change as \(n\) increases?
  
  🔗
  
  🔗
2. How does the spread (variability) of the distribution change as \(n\) increases?
  
  🔗
  
  🔗
3. For a fixed \(n=50\text{,}\) how does changing \(p\) affect the distribution?
  
  🔗
  
  🔗
4. Compare your observations with the rule of thumb in Experiment 3 -- did the distribution look normal when those conditions were met?
  
  🔗
  
  🔗
🔗
🔗

🔗

Activity 8.3.1.

We’re going to get in groups, and in each group, you’ll roll a die \(30\) times. (Use this Excel document to keep track of your rolls: external/sheets/DiceSamplingProportion.xlsx ) Count how many times you roll a “1” and use that to calculate the proportion of the 30 rolls for which the result was a “1”. After the whole class finishes finding this proportion, we’re going to draw a histogram of all the sample proportions. Before doing this, though, think about the following questions:

🔗

(a)

What should the distribution of your individual rolls look like (approximately)?

🔗

(b)

What do you think the approximate shape of the histogram of all sample proportions will be? (Will it be skewed left, uniform, normal/bell-shaped, or skewed right?)

🔗

(c)

What do you expect the approximate mean of the distribution of sample proportions to be? (Will it be \(1/100\text{,}\) \(1/6\text{,}\) \(6/100\text{,}\) or \(6\text{?}\))

🔗

Definition 8.3.1.

The sampling distribution of the proportion describes the pattern that the sample proportions tend to follow when randomly drawn from a population. This distribution has a mean, \(p\text{,}\) and a standard error (i.e. the standard deviation of the sample proportions), \(\sigma_p=\sqrt{\frac{p(1-p)}{n}}\text{.}\) (The sample proportion is \(\overline{p}=\frac{x}{n}\text{.}\))

🔗

Exercise 8.3.2.

(Donnelly 7.25)

🔗

For a population with a proportion equal to 0.32, calculate the standard error of the proportion for the following sample sizes.

🔗

(a)

\(n=35\)

🔗

Answer.

\begin{equation*} \sigma_p = \sqrt{\frac{0.32(1-0.32)}{35}}\approx 0.0788 \end{equation*}

🔗

(b)

\(n=70\)

🔗

Answer.

\begin{equation*} \sigma_p = \sqrt{\frac{0.32(1-0.32)}{70}}\approx 0.0558 \end{equation*}

🔗

(c)

\(n=105\)

🔗

Answer.

\begin{equation*} \sigma_p = \sqrt{\frac{0.32(1-0.32)}{105}}\approx 0.0455 \end{equation*}

🔗

Exercise 8.3.3.

What can you conclude about the standard error as the sample size increases?

🔗

As the sample size increases, standard error decreases
🔗
As the sample size increases, standard error increases.
🔗
As the sample size increases, standard error stays the same.
🔗
As the sample size increases, the standard error might increase, decrease, or stay the same.
🔗

🔗

Exercise 8.3.4.

(Donnelly 7.32)

🔗

A social media survey found that \(69\%\) of parents “follow” their children on Instagram. A random sample of 140 parents was selected.

🔗

(a)

Calculate the standard error of the proportion.

🔗

Answer.

\begin{equation*} \sigma_p=\sqrt{\frac{0.69(1-0.69)}{140}}\approx 0.0391 \end{equation*}

🔗

(b)

What is the probability that 104 or more parents from this sample “follow” their children on Instagram?

🔗

Answer.

\begin{equation*} P\left(\overline{p}\geq \frac{104}{140}\right) = 1-P\left(\overline{p}\lt \frac{104}{140}\right)\approx 0.0881 \end{equation*}

🔗

(c)

What is the probability that between 97 and 104 parents from this sample “follow” their children on Instagram?

🔗

Answer.

\begin{equation*} P\left(\frac{97}{140}\lt \overline{p}\lt \frac{104}{140}\right) \approx 0.3827 \end{equation*}

🔗

(d)

If 81 parents responded that the “follow” their children on Instagram, does this result support the findings of the social media survey?

🔗

Answer.

\begin{equation*} P\left(\overline{p}\leq \frac{81}{140}\right) \approx 0.0022 \end{equation*}

Since \(0.0022\lt 0.05 \) (less than \(5\%\)), the sample with only 81 parents answering “yes” is unusual. Therefore, the survey results are questionable.

🔗

Prev Top Next

\(p\)	\(n\)	Shape of Distribution	Center (around what value?)	Spread (wide/narrow)
\(0.5\)	\(10\)
\(0.5\)	\(30\)
\(0.5\)	\(50\)
\(0.5\)	\(100\)

\(p\)	\(n\)	\(np\)	\(n(1-p)\)	Did it look normal in the applet?
0.5	10
0.5	30
0.5	50
0.5	100
0.1	50
0.3	50
0.5	50
0.7	50
0.9	50

Section 8.3 The Sampling Distribution of the Proportion

Handout Preview Activity: Sampling Distribution of the Proportion

Activity 8.3.1.

(a)

(b)

(c)

Definition 8.3.1.

Exercise 8.3.2.

(a)

(b)

(c)

Exercise 8.3.3.

Exercise 8.3.4.

(a)

(b)

(c)

(d)

Handout Preview Activity: Sampling Distribution of the Proportion