Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

The Central Limit Theorem: Understanding the Distribution of Sample Means, Exams of Statistics

University of Denver (DU)Statistics

The Central Limit Theorem explains how the distribution of sample means is related to the distribution of the population they are drawn from. the concept of the sampling distribution of the mean, its mean and standard deviation, and how it tends towards normality as the sample size increases.

What you will learn

How does the shape of the sampling distribution of the mean change with the sample size?
What is the mean and standard deviation of the sampling distribution of the mean?
What is the significance of the Central Limit Theorem in statistics?

How is the distribution of sample means related to the population distribution?
What is the Central Limit Theorem?

Typology: Exams

2021/2022

Uploaded on 09/27/2022

sadayappan 🇺🇸

4.5

(15)

246 documents

1 / 13

This page cannot be seen from the preview

Don't miss anything!

The Sampling Distribution of the Mean

January 9, 2021

Contents



The Central Limit Theorem



The sampling distribution of the mean of IQ scores



Example 1



Example 2



Example 3



Questions

Happy birthday to Jasmine Nichole Morales!

This tutorial should be easy to understand if you understand the z-table tutorial and the

normal distribution tutorial.

In science, we often want to estimate the mean of a population. But all we can typically do

is sample members of the population and calculate sample means. How well does a sample

mean represent the population mean?

The mean is an unbiased statistic, which means that on average a sample mean will be

equal to the population mean. Of course, any given sample mean will typically be different

from the population mean, but since it’s unbiased we can be sure that it won’t on average

be higher or lower than the population mean.

How close is a typical sample mean to the population mean? You probably have the intuition

that this answer depends on the size of the sample. The larger the sample, the more confident

you can be that your sample mean is close to the population mean.

The Central Limit Theorem

The Central Limit Theorem is a formal description of this intuition. It’s a theorem that

tells you about the distribution of sample means.

Let’s take a moment to think about the term ”distribution of sample means”. Every time

you draw a sample from a population, the mean of that sample will be different. Some

means will be more likely than other means. So it makes sense to think about means has

having their own distribution, which we call the sampling distribution of the mean.

The Central Limit Theorem tells us how the shape of the sampling distribution of the mean

relates to the distribution of the population that these means are drawn from.

1

Partial preview of the text

Download The Central Limit Theorem: Understanding the Distribution of Sample Means and more Exams Statistics in PDF only on Docsity!

The Sampling Distribution of the Mean

January 9, 2021

The mean is an unbiased statistic, which means that on average a sample mean will be equal to the population mean. Of course, any given sample mean will typically be different from the population mean, but since it’s unbiased we can be sure that it won’t on average be higher or lower than the population mean.

How close is a typical sample mean to the population mean? You probably have the intuition that this answer depends on the size of the sample. The larger the sample, the more confident you can be that your sample mean is close to the population mean.

The Central Limit Theorem

The Central Limit Theorem is a formal description of this intuition. It’s a theorem that tells you about the distribution of sample means.

Let’s take a moment to think about the term ”distribution of sample means”. Every time you draw a sample from a population, the mean of that sample will be different. Some means will be more likely than other means. So it makes sense to think about means has having their own distribution, which we call the sampling distribution of the mean. The Central Limit Theorem tells us how the shape of the sampling distribution of the mean relates to the distribution of the population that these means are drawn from.

To define some terms, if samples from a population are labeled with the variable X, we define the parameters of mean as μx and the standard deviation as σx. Remember, the greek letter is the parameter, and the subscript is the name of the thing that we’re talking about.

Now consider the sampling distribution of the mean. You know that sample means are written as ¯x. Using the same notation, the sampling distribution of the mean has its own mean, called μ¯x, and its own standard deviation, called σ¯x.

There are three parts to the Central Limit Theorem:

The sampling distribution of the mean will have the same mean as the population mean. Formally, we state: μ¯x = μx.

This just means what I said earlier, that the mean is unbiased, so that sample means will be, on average, equal to the population mean.

For a sample size n, the standard deviation of the sampling distribution of the mean will be σx¯ = √σx n

The name for σx¯ is sometimes shortened to the standard error of the mean, and some- times shortened even more to ’s.e.m.’.

This is a formalization of the intuition above. Since

n is in the denominator, it means that as your sample size gets bigger, the standard deviation of the distribution of means, σ¯x, gets smaller. So as you increase sample size, any given sample mean will be on average closer to the population mean.

The sampling distribution of the mean will tend to be close to normally distributed. Moreover, the sampling distribution of the mean will tend towards normality as (a) the population tends toward normality, and/or (b) the sample size increases.

This last part is the most remarkable. It means that even if the population is not normally distributed, the sampling distribution of the mean will be roughly normal if your sample size is large enough.

The Central Limit Theorem is powerful because, as we’ve learned from previous chapters, if you know that a distribution is normal, and you know its mean and standard deviation, then you know everything about this distribution.

This is why we’ve been doing all this work converting scores to z-scores and looking things up in Table A. In science, we typically grab samples and calculate means to get an estimate of the population mean. With the Central Limit Theorem, we can now say something about how close a given mean should be to the population mean.

The sampling distribution of the mean of IQ scores

Here is the distribution of IQ scores which are normally distributed with a mean μx of 100 and a standard deviation σx of 15:

points?

All we need to do is convert our score of 103 to a z score and use Table A. This is just like the problems that we did earlier, but we’re now talking about ¯x instead of X.

Remember to convert to a z score, we first subtract the mean and then divide by the standard deviation. But now, the mean is μx¯ and the standard deviation is σx¯.

z = Xσ−μx¯ ¯x = X−σxμx √ n

=^103 − 3 100 = 1

91 94 97 100 103 106 109 Mean IQ

area =0.

-3 -2 -1 0 1 2 3 z

area =0.

1

From table A, the area under the standard normal distribution above z = 1 is 0.1587. So the probability of obtaining a mean of 103 or more is 0.1587.

Example 2

Suppose you measure the heights of 100 randomly sampled women from a population that has a standard deviation of 64 inches and a standard deviation of 2.5 inches. For what mean height will 5% of the heights fall above?

We need to first find the z-score for which 5% of the area under the standard normal distribution lies above. Using table A, z = 1.64.

-3 -2 -1 0 1 2 3 z

area =0.

To convert this to the distribution of mean heights, we use the Central Limit Theorem. With a sample size of 100, we can assume that mean heights will be normally distributed with a mean of 64 and a standard deviation of σx¯ = √^2.^5 100

So our desired mean height will be:

x¯ = μx¯ + zσx¯ = μx + z √σx n

63.25 63.5 63.75 64 64.25 64.5 64. Mean Height (in)

area =0.

So we can conclude that 5% of the mean heights will fall above a mean of 64.41 inches.

μx + zσx¯ = 100+(1.96)(3.75) = 107.

In other words, for a sample size of 16, we can expect to obtain a mean IQ between 92. and 107.35 points 95% of the time.

Questions

Your turn again. Here are 10 questions and answers, including R commands. See how similar these problems are to those in the normal distribution tutorial. The main differences is that since talking about distributions of means, we’re using standard errors of the mean instead of standard deviations.

Suppose you draw 23 samples a population with mean of 32 and a standard deviation of 2.5. What is the probability of obtaining a mean of 33 or less?

Given μx = 32 and σx = 2.5, Find P r(¯x < 33)

σ¯x = √σx n

= √^2.^5

z = ¯x−σxμx¯ =^330. 5213 −^32 = 1. 9183

P r(z < 1 .9183) = 0. 9725

Answer: p = 0.

sem <- 2.5/sqrt(23) print(sem) [1] 0. p <- pnorm(33,32,sem) print(p) [1] 0.

Suppose you draw 21 samples a population with mean of 78 and a standard deviation of 8.8. What is the probability of obtaining a mean of 78.2 or more?

Given μx = 78 and σx = 8.8, Find P r(¯x > 78 .2)

σ¯x = √σx n

= √^8.^8

z = ¯x−σxμx¯ =^781 .. 92032 −^78 = 0. 1042

P r(z > 0 .1042) = 0. 4585

Answer: p = 0.

sem <- 8.8/sqrt(21) print(sem) [1] 1. p <- 1-pnorm(78.2,78,sem) print(p)

Suppose you draw 26 samples a population with mean of 13 and a standard deviation of 2.5. What range of means covers the middle 65.83 percent of means?

Given μx = 13, σx = 2.5, and p = 0. 6583

The area in each tail is:^1 −^0. 26583 = 0. 17085

P r(z > 0 .9508) = 0.17085, so z = − 0. 9508

σ¯x = √σx n

= √^2.^5

x¯ 1 = μx − (z)(σx¯) = 13 − (0.9508)(0.4903) = 12. 53

x¯ 2 = μx + (z)(σx¯) = 13 + (0.9508)(0.4903) = 13. 47

Answer: 65.83 percent of the means fall between 12.53 and 13.47.

sem <- 2.5/sqrt(26) print(sem) [1] 0. p <- (1-0.6583)/ print(p) [1] 0. x <- c(qnorm(p,13,sem),qnorm(1-p,13,sem)) print(x) [1] 12.53383 13.

Suppose you draw 24 samples a population with mean of 63 and a standard deviation of 8.7. What is the probability of obtaining a mean of 58.8 or less?

Given μx = 63 and σx = 8.7, Find P r(¯x < 58 .8)

σ¯x = √σx n

= √^8.^7

z = ¯x−σμx x¯

=^581 .. 77598 −^63 = − 2. 365

P r(z < − 2 .365) = 0. 009

Answer: p = 0.

sem <- 8.7/sqrt(24) print(sem) [1] 1. p <- pnorm(58.8,63,sem) print(p)

[1] 0.

Suppose you draw 23 samples a population with mean of 11 and a standard deviation of 5.8. What is the probability of obtaining a mean of 13.4 or more?

Given μx = 11 and σx = 5.8, Find P r(¯x > 13 .4)

σ¯x = √σx n

= √^5.^8

z = ¯x−σμx x¯

=^131 .. 20944 −^11 = 1. 9845

P r(z > 1 .9845) = 0. 0236

Answer: p = 0.

sem <- 5.8/sqrt(23) print(sem) [1] 1. p <- 1-pnorm(13.4,11,sem) print(p) [1] 0.

Suppose you draw 21 samples a population with mean of 17 and a standard deviation of 6.2. For what mean do 81.24 percent of the sample means fall below?

Given μx = 17 and σx = 6.2, find ¯x so that P r(¯x < μx) = 0. 8124

P r(z < 0 .8869) = 0.8124, so z = 0. 8869

σ¯x = √σx n

= √^6.^2

x¯ = μx + (z)(σ¯x) = 17 + (0.8869)(1.353) = 18. 2

Answer: ¯x = 18. 2

sem <- 6.2/sqrt(21) print(sem) [1] 1. x <- qnorm(0.8124,17,sem) print(x) [1] 18.

Mean IQ

[1] 102.9669 103.
- 88.75 92.5 96.25 100 103.75 107.5 111.
- area =0.
  - - area =0.

The Central Limit Theorem: Understanding the Distribution of Sample Means, Exams of Statistics

Related documents

Partial preview of the text

Download The Central Limit Theorem: Understanding the Distribution of Sample Means and more Exams Statistics in PDF only on Docsity!

The Sampling Distribution of the Mean

January 9, 2021

Contents

The Central Limit Theorem

The sampling distribution of the mean of IQ scores

=^103 − 3 100 = 1

Example 2

Questions

= √^2.^5

= √^8.^8

= √^2.^5

= √^8.^7

=^581 .. 77598 −^63 = − 2. 365

[1] 0.

= √^5.^8

=^131 .. 20944 −^11 = 1. 9845

= √^6.^2

Mean IQ