QUAN 2010 Two Dependent Samples

Section 12.3 Two Dependent Samples

Definition 12.3.1.

Recall that samples are dependent if there is some relationship whereby each value in one sample is paired with a corresponding value in the other sample. Therefore, a hypothesis test that uses dependent samples is sometimes called a matched pair test. We need to know the matched-pair difference for each pair, defined as

\begin{equation*} d = x_1-x_2, \end{equation*}

where \(x_1\) and \(x_2\) are the matched-pair values from Populations 1 and 2, respectively.

The mean, \(\overline{d}\text{,}\) and standard deviation, \(s_d\text{,}\) of these differences are defined by the formulas:

\begin{equation*} \overline{d}=\frac{\sum_{i=1}^n d_i}{n}\;\;\;\;\;\;\;\;\; s_d=\sqrt{\frac{\sum_{i=1}^n d_i^2 - \frac{\left( \sum_{i=1}^n d_i \right)^2}{n}}{n-1}} \end{equation*}

where

\(d_i\)	\(=\)	the \(i\)th matched-pair difference
\(n\)	\(=\)	the number of matched-pairs

Recall that we can also use the AVERAGE and STDEV.S Excel formulas to compute the mean and standard deviation when we have all of the matched-pair differences.

Next, we define the test statistic and confidence interval formulas for dependent (matched-pair) samples:

\begin{equation*} t_{\overline{x}}=\frac{\overline{d}-(\mu_d)_{H_0}}{\frac{s_d}{\sqrt{n}}}\;\;\;\;\;\;\; \overline{d}\pm t_{\alpha/2}\cdot\frac{s_d}{\sqrt{n}}, \end{equation*}

where \((\mu_d)_{H_0}=\text{ the population mean matched-pair difference from the null hypothesis}.\)

Again, we rely on Excel to do the computations whenever we have raw data. After stating the two hypotheses, go to the Data Analysis tool and choose “t-Test Paired Two-Sample for Means”.

Exercise 12.3.2.

(Donnelly 10.50)

Pfizer would like to test the effectiveness of a new cholesterol medication it has developed. To test the effectiveness, the LDL cholesterol level of 12 randomly selected individuals was measured before and after they took medication. The data in the Excel file below shows the LDL measurement levels.

external/sheets/LDL.xlsx

(a)

Perform a hypothesis test using \(\alpha=0.01\) to determine if the average LDL level is more than 50 points lower for patients who have taken the new medication.

Answer.

external/sheets/LDLSolution.xlsx

There is enough evidence to conclude that the LDL level is more than 50 points lower after taking the medication.

(b)

Construct a \(90\%\) confidence interval to estimate the average difference in LDL levels for people before and after they take the medication.

Answer.

external/sheets/LDLSolution.xlsx

The \(90\%\) confidence interval is \((62.39, 77.44)\text{.}\)