Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Measuring Central Tendency: Mean, Median, and Mode, Study notes of Statistics

The concept of measures of central tendency and provides examples of calculating the mean, median, and mode using data sets. It also discusses the importance of these measures in understanding statistical data and distinguishes between population and sample means.

Typology: Study notes

2021/2022

Uploaded on 09/12/2022

ekapad
ekapad 🇮🇳

5

(17)

266 documents

1 / 64

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Measures of central tendency
When thinking about questions such as: “how many
calories do I eat per day?” or “how much time do I spend
talking per day?, we quickly realize that the answer will
vary from day to day and often modify our question to
something like “how many calories do I consume on a
typical day?” or “on average, how much time do I spend
talking per day?”.
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a
pf2b
pf2c
pf2d
pf2e
pf2f
pf30
pf31
pf32
pf33
pf34
pf35
pf36
pf37
pf38
pf39
pf3a
pf3b
pf3c
pf3d
pf3e
pf3f
pf40

Partial preview of the text

Download Measuring Central Tendency: Mean, Median, and Mode and more Study notes Statistics in PDF only on Docsity!

Measures of central tendency

When thinking about questions such as: “how many calories do I eat per day?” or “how much time do I spend talking per day?, we quickly realize that the answer will vary from day to day and often modify our question to something like “how many calories do I consume on a typical day?” or “on average, how much time do I spend talking per day?”.

Measures of central tendency

In this section we will study three ways of measuring central tendency in data, the mean, the median and the mode. Each measure give us a single value (the mode may give more than one) that might be considered typical. As we will see however, any one of these values can give us a skewed picture if the given data has certain characteristics.

Measures of central tendency

A sample is a subset of the population, for example, we might collect the data on the number of home runs scored in a random sample of 20 games played by Babe Ruth. If we calculate the mean, median and mode using the data from a sample, the results are called the sample mean, sample median and sample mode.

Measures of central tendency

A sample is a subset of the population, for example, we might collect the data on the number of home runs scored in a random sample of 20 games played by Babe Ruth. If we calculate the mean, median and mode using the data from a sample, the results are called the sample mean, sample median and sample mode. The Mean: The population mean of m numbers x 1 , x 2 ,... , xm (the data for every member of a population of size m) is denoted by μ and is computed as follows:

μ =

x 1 + x 2 + · · · + xm m

The sample mean of the numbers x 1 , x 2 ,... , xn (data for a sample of size n from the population) is denoted by ¯x and is computed similarly:

x¯ =

x 1 + x 2 + · · · + xn n

Measures of central tendency

Example Consider the following set of data, showing the number of times a sample of 5 students check their e-mail per day: 1 , 3 , 5 , 5 , 3.

Here n = 5 and x 1 = 1, x 2 = 3, x 3 = 5, x 4 = 5 and x 5 = 3.

Measures of central tendency

Example Consider the following set of data, showing the number of times a sample of 5 students check their e-mail per day: 1 , 3 , 5 , 5 , 3.

Here n = 5 and x 1 = 1, x 2 = 3, x 3 = 5, x 4 = 5 and x 5 = 3.

Calculate the sample mean ¯x. 1 + 3 + 5 + 5 + 3 5

Calculating the mean more efficiently:

We can calculate the mean above more efficiently here by using frequencies. We can see from the calculation above that

x¯ =

0 + (1 × 2) + (2 × 8) + (3 × 4) + (4 × 5)

The frequency distribution for the data is:

Books Frequency # Books ×

Frequency 0 1 0 × 1 1 2 1 × 2 2 8 2 × 8 3 4 3 × 4 4 5 4 × 5 ¯x = Sum 20 = 5020 = 2. 5

Calculating the mean more efficiently:

The general case can be dealt with as follows: If our frequency/relative frequency table for our sample of size n, looks like the one below, (where the observations are denoted 0i, the corresponding frequencies, fi and the relative frequencies fi/n):

Observation Frequency Relative Frequency (^0) i fi fi/n 01 f 1 f 1 /n 02 f 2 f 2 /n 03 f 3 f 3 /n .. .

(^0) R fR fR/n then,

Calculating the mean more efficiently:

Alternatively we can use the relative frequencies, instead of dividing by the n at the end.

Outcome Frequency Relative Frequency Outcome × Relative Frequency (^0) i fi fi/n (^0) i × fi/n 01 f 1 f 1 /n 01 × f 1 /n 02 f 2 f 2 /n 02 × f 2 /n 03 f 3 f 3 /n 03 × f 3 /n ... ... ... ... (^0) R .fR fR/n (^0) R × fR/n SUM = ¯x

You can of course choose any method for calculation from the three methods listed above.

Calculating the mean more efficiently:

Alternatively we can use the relative frequencies, instead of dividing by the n at the end.

Outcome Frequency Relative Frequency Outcome × Relative Frequency (^0) i fi fi/n (^0) i × fi/n 01 f 1 f 1 /n 01 × f 1 /n 02 f 2 f 2 /n 02 × f 2 /n 03 f 3 f 3 /n 03 × f 3 /n ... ... ... ... (^0) R .fR fR/n (^0) R × fR/n SUM = ¯x

You can of course choose any method for calculation from the three methods listed above.The easiest method to use will depend on how the data is presented.

Calculating the mean more efficiently:

Example The number of goals scored by the 32 teams in the 2014 world cup are shown below: 18 , 15 , 12 , 11 , 10 , 8 , 7 , 7 , 6 , 6 , 6 , 5 , 5 , 5 , 4 , 4 , 4 , 4 , 4 , 4 , 3 , 3 , 3 , 3 , 3 , 2 , 2 , 2 , 2 , 1 , 1 , 1 Make a frequency table for the data and taking the soccer teams who played in the world cup as a population, calculate the population mean, μ.

Outcome Frequency 1 3 2 4 3 5 4 6 5 3 6 3 7 2

Outcome Frequency 8 1 10 1 11 1 12 1 15 1 18 1 μ = 5. 34375

Calculating the mean more efficiently:

μ =^1 ·^ 3 + 2^ ·^ 4 + 3^ ·^ 5 + 4^ ·^ 6 + 5^ ·^ 3 + 6^ ·^ 3 + 7^ ·^ 2 + 8 32 ·^ 1 + 10^ ·^ 1 + 11^ ·^ 1 + 12^ ·^ 1 + 15^ ·^ 1 + 18^ ·^1

= 3 + 8 + 15 + 24 + 15 + 18 + 14 + 8 + 10 + 11 + 12 + 15 + 18 32 =^17132

Estimating the mean from a histogram

Example Approximate the mean for the set of data used to make the following histogram, showing the time (in seconds) spent waiting by a sample of customers at Gringotts Wizarding bank.

250 - 300 300 -^350

2

4

6

8

10

12

50 - 100 100 - 150 150 - 200 200 - 250 Time spent waiting (in seconds)

midpoints:

approximation of sample mean:

Estimating the mean from a histogram

midpoints:

Outcome Frequency 75 12 125 10 175 4 225 2 275 1 325 1 Sample size 30