Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

Frequency Distributions and Graphs: Organizing and Representing Data, Slides of Statistics

University of Nebraska at Kearney Statistics

Instructions on how to organize data using frequency distributions and construct appropriate graphs to represent the data. Topics include constructing frequency distributions for categorical and grouped data, calculating relative frequencies and cumulative frequencies, and creating histograms, frequency polygons, ogives, Pareto charts, pie graphs, and scatter plots.

What you will learn

What is the difference between a grouped frequency distribution and an ungrouped frequency distribution?
What is the purpose of a cumulative frequency polygon (ogive)?
How do you construct a frequency distribution for categorical data?

How do you calculate the relative frequency of a class in a frequency distribution?
How do you construct a scatter plot for paired data?

Typology: Slides

2021/2022

Uploaded on 09/27/2022

amritay 🇺🇸

4.7

(14)

256 documents

1 / 11

This page cannot be seen from the preview

Don't miss anything!

1

C

CH

HA

AP

PT

TE

ER

R

2

Frequency Distributions and Graphs

O

Ob

bj

je

ec

ct

ti

iv

ve

es

s

• Organize data using frequency distributions.

• Represent data in frequency distributions graphically using histograms, frequency

polygons, and ogives.

• Represent data using Pareto charts, time series graphs, and pie graphs.

• Draw and interpret a stem and leaf plot.

• Draw and Interpret a scatter plot for a set of paired data.

This chapter will show how to organize data and then construct appropriate graphs to

represent the data in a concise, easy-to-understand form.

S

Se

ec

ct

ti

io

on

n

2

2-

-1

1

O

Or

rg

ga

an

ni

iz

zi

in

ng

g

D

Da

at

ta

a

B

Ba

as

si

ic

c

V

Vo

oc

ca

ab

bu

ul

la

ar

ry

y

• When data are collected in original form, they are called raw data.

• A frequency distribution is the organization of raw data in table form, using classes

and frequencies.

• The two most common distributions are categorical frequency distribution and the

grouped frequency distribution.

F

Fr

re

eq

qu

ue

en

nc

cy

y

D

Di

is

st

tr

ri

ib

bu

ut

ti

io

on

ns

s

Categorical Frequency Distributions count how many times each distinct category

has occurred and summarize the results in a table format

Example 1: Letter grades for Math 227 Spring 2005:

C A B C D F B B A C C F C

B D A C C C F C C A A C

a) Construct a frequency distribution for the categorical data.

b) What percentage of the students pass the class with the grade C or better?

Partial preview of the text

Download Frequency Distributions and Graphs: Organizing and Representing Data and more Slides Statistics in PDF only on Docsity!

CH CHAAPPTTEERR 2 2

Frequency Distributions and Graphs

Ob Objjeeccttiivveess

Organize data using frequency distributions.
Represent data in frequency distributions graphically using histograms, frequency polygons, and ogives.
Represent data using Pareto charts, time series graphs, and pie graphs.
Draw and interpret a stem and leaf plot.
Draw and Interpret a scatter plot for a set of paired data.

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form.

Se Seccttiioonn 2 (^2) - - 11 OOrrggaanniizziinngg DDaattaa

Ba Bassiicc VVooccaabbuullaarryy

When data are collected in original form, they are called raw data.
A frequency distribution is the organization of raw data in table form, using classes and frequencies.
The two most common distributions are categorical frequency distribution and the grouped frequency distribution.

Fr Freeqquueennccyy DDiissttrriibbuuttiioonnss Categorical Frequency Distributions count how many times each distinct category has occurred and summarize the results in a table format

Example 1: Letter grades for Math 227 Spring 2005: C A B C D F B B A C C F C B D A C C C F C C A A C

a) Construct a frequency distribution for the categorical data.

b) What percentage of the students pass the class with the grade C or better?

Fr Freeqquueennccyy DDiissttrriibbuuttiioonnss

Gr Groouuppeedd FFrreeqquueennccyy DDiissttrriibbuuttiioonnss--When the range of the data is large, the data must be grouped into classes that are more than one unit in width

Ba Bassiicc VVooccaabbuullaarryy The lower class limit represents the smallest value that can be included in the class.

The upper class limit represents the largest value that can be included in the class.

The class boundaries are used to separate the classes so that there are no gaps in the frequency distribution.

Cl Claassss BBoouunnddaarriieess SSiiggnniiffiiccaanntt FFiigguurreess

Rule of Thumb: Class limits should have the same decimal place value as the data, but the class boundaries have one additional place value and end in a 5.

e.g. data were whole numbers lower class boundary = lower class limit – 0. upper class boundary = upper class limit + 0.

e.g. data were one decimal place lower class boundary = lower class limit – 0. upper class boundary = upper class limit +0.

Cl Claassss MMiiddppooiinnttss The class midpoint (mark) is found by adding the lower and upper boundaries (or limits) and dividing by 2.

Cl Claassss WWiiddtthh The class width for a class in a frequency distribution is found by subtracting the lower (or upper) class limit of one class from the the lower (or upper) class limit of the next class.

Frequency Distributions An ungrouped frequency distribution is used for numerical data and when the range of data is small.

Example: The number of incoming telephone calls per day over the first 25 days of business:

4, 4, 1, 10, 12, 6, 4, 6, 9, 12, 12, 1, 1, 1, 12, 10, 4, 6, 4, 8, 8, 9, 8, 4, 1

Construct an ungrouped frequency distribution

T Tyyppeess ooff FFrreeqquueennccyy DDiissttrriibbuuttiioonnss (^) ((ssuummmmaarryy))

A ccaatteeggoorriiccaall ffrreeqquueennccyy ddiissttrriibbuuttiioonn is used when the data is nominal.

A grgroouuppeedd ffrreeqquueennccyy ddiissttrriibbuuttiioonn is used when the range is large and classes of several units in width are needed.
An ununggrroouuppeedd ffrreeqquueennccyy ddiissttrriibbuuttiioonn is used for numerical data and when the range of data is small.

Wh Whyy CCoonnssttrruucctt FFrreeqquueennccyy DDiissttrriibbuuttiioonnss??

To organize the data in a meaningful, intelligible way.
To enable the reader to make comparisons among different data sets.
To facilitate computational procedures for measures of average and spread.
To enable the reader to determine the nature or shape of the distribution.
To enable the researcher to draw charts and graphs for the presentation of data.

Se Seccttiioonn 2 (^2) - - 22 HHiissttooggrraamm,, FFrreqequueennccyy (^) PPoollyyggoonnss,, OOggiivveess

This section will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form.

Th Thee RRoollee ooff GGrraapphhss

The purpose of graphs in statistics is to convey the data to the viewer in pictorial form.
Graphs are useful in getting the audience’s attention in a publication or a presentation.

Th Thrreeee MMoosstt CCoommmmoonn GGrraapphhss

The histogram displays the data by using vertical bars of various heights to represent the frequencies.
The frequency polygon displays the data by using lines that connect points plotted for the frequencies at the midpoints of the classes.
The cumulative frequency polygon or ogive represents the cumulative frequencies for the classes in a frequency distribution.

d) Construct an ogive. (x-axis: class boundaries; y-axis: cumulative frequency)

e) Construct a (i) relative frequency histogram, (ii) relative frequency polygon, and (iii) relative cumulative frequency ogive.

Di Dissttrriibbuuttiioonn sshhaappeess

Se Seccttiioonn 2 (^2) - - 33 OOtthheerr TTyyppeess ooff GGrraapphhss

A Pareto chart is used to represent a frequency distribution for categorical variable, and the frequencies are displayed by the heights of vertical bars, which are arranged in order from highest to lowest. (x-axis: categorical variables; y-axis: frequencies, which are arranged in order from highest to lowest)

A pie graph is a circle that is divided into sections or wedges according to the percentage of frequencies in each category of the distribution.

Example 1: Grade received for Math 227 C A B B D C C C C B B A F F a) Construct a pareto chart.

b) Construct a pie chart.

Example 2 : Use the data in example 1 to construct a double stem and leaf plot. e.g. split each stem into two parts, with leaves 0 – 4 on one part and 5 – 9 on the other.

A stem-and-leaf plot portrays the shape of a distribution and maintains the original data values. It is also useful for spotting outliers. Outliers are data values that are extremely large or extremely small in comparison to the norm.

22 - - 44 PaPaiirreedd DDaattaa aanndd SSccaatttteerr PPlloottss

Many times researchers are interested in determining if a relationship between two variables exist.
To do this, the researcher collects data consisting of two measures that are paired with another.
The variable first mentioned is called the independent variable ; the second variable is the dependent variable.

Scatter Plot – is a graph of order pairs of data values that is used to determine if a relationship exists between two variables.

An Anaallyyzziinngg tthhee SSccaatttteerr PPlloott

A popossiittiivvee lliinneeaarr rreellaattiioonnsshhiipp exists when the points fall approximately in an ascending straight line and both the x and y values increase at the same time.
A neneggaattiivvee lliinneeaarr rreellaattiioonnsshhiipp exists when the points fall approximately in a straight line descending from left to right.
A nononnlliinneeaarr rreellaattiioonnsshhiipp exists when the points fall along a curve.
(^) NNoo rreellaattiioonnsshhiipp exists when there is no discernable pattern of the points.

Examples of Scatter Plots and Relationships

Example 1: A researcher wishes to determine if there is a relationship between the number of days an employee missed a year and the person’s age. Draw a scatter plot and comment on the nature of the relationship.

Age (x) 22 30 25 35 65 50 27 53 42 58

Days missed (y) 0 4 1 2 14 7 3 8 6 4

S Sumummmaarryy ooff GGrraapphhss aanndd UUsseess

• HHiissttooggrraammss ,, frfreeqquueennccyy ppoollyyggoonnss ,, and ogogiivveess are used when the data are contained in

a grouped frequency distribution.

• PPaarreettoo cchhaarrttss are used to show frequencies for nominal variables.

• TTiimmee sseerriieess ggrraapphhss are used to show a pattern or trend that occurs over time.

• PPiiee ggrraapphhss are used to show the relationship between the parts and the whole.

• When data are collected in pairs, the relationship, if one exists, can be determined by

looking at a ssccaatttteerr pplloott.

Co Conncclluussiioonnss

• Data can be organized in some meaningful way using frequency distributions. Once

the frequency distribution is constructed, the representation of the data by graphs is a simple task.