Skip to main content

Section 3.1 Measures of Central Tendency

In statistics, “average” or “typical” values are known as measures of central tendency.

Definition 3.1.1.

The mean (\(\mu\) or \(\overline{x}\)) is the most common measure of central tendency and is calculated by summing all the data points and dividing by the size of the data set.
\begin{equation*} \boxed{ \text{Excel formula: AVERAGE} } \end{equation*}
The mean is the sum of the data items divided by the number of items:
\begin{equation*} \text{Mean}=\frac{\sum x}{n}, \end{equation*}
where \(\sum x\) represents the sum of all the data items and \(n\) represents the numbef of items.
  • The mean of a sample is symbolized by \(\overline{x}\text{.}\)
  • The mean of an entire population is symbolized by \(\mu\text{.}\)

Exercise 3.1.2.

The Excel file below has the earnings for the top 10 highest paid actors in 2023. (The data is taken from Yahoo
 1 
finance.yahoo.com/news/10-highest-paid-actors-2023-185921075.html
.)
Find the mean earnings, in millions of dollars, for the ten highest-earning actors.
Answer.
\(=AVERAGE(B2:B11)=44.9\) million dollars

Definition 3.1.3.

The median is the value in the data set for which half of the observations are higher and half are lower.
\begin{equation*} \boxed{ \text{Excel formula: MEDIAN} } \end{equation*}
To find the median of a group of data items,
  1. Arrange the data items in order, from smallest to largest.
  2. If the number of data items is odd, the median is the data item in the middle of the list.
  3. If the number of data items is even, the median is the mean of the two middle data items.

Exercise 3.1.4.

(a)

\(84,90,98,95,88\)
Answer.
data items in order: \(84,88,90,95,95\)
median: \(90\)

(b)

\(68,74,7,13,15,25,28,59,34,47\)
Answer.
data items in order: \(7,13,15,25,28,34,47,59,68,74\)
median: \(\frac{28+34}{2}=\frac{62}{2}=31\)

Exercise 3.1.5.

The Excel file below has the earnings for the top 10 highest paid actors in 2023. (The data is taken from Yahoo
 2 
finance.yahoo.com/news/10-highest-paid-actors-2023-185921075.html
.)
Find the median earnings, in millions of dollars, for the ten highest-earning actors.
Is the median different thant the mean you found in Exercise 3.1.2, and if so, why?
Answer.
\begin{equation*} \text{median}=MEDIAN(B2:B11)=42.5. \end{equation*}
The median is less than the mean that we found in Exercise 3.1.2.
Figure 3.1.6. Skewed Distribution, Mean, and Median (Made in GeoGebra by Casper Buisman)
Figure 3.1.7. Left-skewed
Figure 3.1.8. Symmetric
Figure 3.1.9. Right-skewed

Exercise 3.1.10.

How are the mean and median related to each other in each of the following frequency distribution shapes?

(a)

symmetric
Answer.
the mean and median are approximately equal

(b)

left-skewed
Answer.
the median is greater than the mean; potential low outliers pull the mean left

(c)

right-skewed
Answer.
the median is less than the mean; potential high outliers pull the mean right

Question 3.1.11.

Statisticians generally use the median instead of the mean when reporting income. Why do you think this is?

Exercise 3.1.12.

Five employees in the assembly section of a television manufacturing company earn salaries of \(\$ 19700, \$ 20400, \$ 21500, \$ 22600,\) and \(\$ 23000\) annually. The section manager has an annual salary of \(\$ 95000\text{.}\)

(a)

Find the median annual salary for the six people.
Answer.
\(\text{median}=\frac{21500+22600}{2}=\$ 22,050\)

(b)

Find the mean annual salary for the six people.
Answer.
\(\text{mean}=\frac{19700+20400+21500+22600+23000+95000}{6}=\$33,700\)

Definition 3.1.13.

An outlier is a value that is much higher or lower than most of the data.
(We’ll have a more formal/specific definition of an outlier later.)

Definition 3.1.14.

The mode is the data value that occurs most often in a data set. If more than one data value has the highest frequency, then each of these data values is a mode. If there is no data value that occurs most often, then the data set has no mode.

Definition 3.1.15.

If a dataset is bimodal if it has two modes.

Exercise 3.1.16.

Find the mode for each of the following groups of data:

(a)

7, 2, 4, 7, 8, 10
Answer.
mode: 7

(b)

2, 1, 4, 5, 3
Answer.
There is no mode since no number occurs more than once.

(c)

3, 3, 4, 5, 6, 6
Answer.
The modes are 3 and 6. (The data set is bimodal.)

Exercise 3.1.17.

Consider the following data values:
\begin{equation*} 5, 13, 18, 2, 16, 18, 5, 20 \end{equation*}
Calculate the mean, median, and mode, and then describe the shape of the distribution.
Answer.
\(\text{mean}=\frac{5+13+18+2+16+18+5+20}{8}=\frac{97}{8}=12.125\)
data items in order: \(2,5,5,13,16,18,18,20\)
\(\text{median}=\frac{13+16}{2}=14.5\)
modes: \(5,18\)
The distribution is left-skewed.

Exercise 3.1.18.

The data in the Excel file below lists daily Dow Jones Industrial Average values from June 2019 through May 2024.

(a)

Calculate the mean, median, and mode(s) (using Excel).
Answer.
  • mean: \(=AVERAGE(B12:B1270)\approx 31946.76\)
  • median: \(=MEDIAN(B12:B1270)\approx 33072.88\)
  • modes: \(27691.49, 34798, 32798.4, 33309.51\)

(b)

Describe the shape of the distribution.
Answer.
The distribution is left-skewed.