Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

How are estimated marginal means calculated?, Exams of Business Statistics

University of Puget Sound (UPS)Business Statistics

One approach to understand these estimates is to calculate the estimated marginal means (sometimes referred to as least square means, predicted means, or ...

Typology: Exams

2021/2022

Uploaded on 09/27/2022

weldon 🇺🇸

4.5

(10)

223 documents

1 / 4

This page cannot be seen from the preview

Don't miss anything!

Cornell Statistical Consulting Unit

How are estimated marginal means calculated?

Statnews #93

Created winter 2018. Last updated August 2020

Introduction

In a linear model with categorical variables, the table of model parameter estimates can be

difficult to interpret. One approach to understand these estimates is to calculate the estimated

marginal means (sometimes referred to as least square means, predicted means, or expected

means). Most statistical software packages offer procedures to obtain predictions of the response

variable for the different levels of categorical variables after fitting linear models. However,

these procedures should be used carefully as the results obtained can be very different depending

on the statistical software package used.

Example

Consider a simulated dataset containing information about employees of a company, with

information on their salary, age, gender, and job category. The continuous variables are

summarized in Table 1; the categorical variables are summarized in Tables 2 and 3.

Table 1: Mean and standard deviation of the continuous variables in the employee dataset

Mean

SD

Salary

6806.43

3148.26

Age

39.16

45.71

Table 2: Summary of the job category variable

Values

Count

Proportion

0 (clerical)

227

0.479

1 (trainee)

168

0.354

2 (security)

32

0.068

3 (technical)

47

0.099

Table 3: Summary of the gender variable

Values

Count

Proportion

0 (male)

258

0.544

1 (female)

216

0.456

Partial preview of the text

Download How are estimated marginal means calculated? and more Exams Business Statistics in PDF only on Docsity!

How are estimated marginal means calculated?

Statnews

Created winter 2018. Last updated August 2020

Introduction

In a linear model with categorical variables, the table of model parameter estimates can be difficult to interpret. One approach to understand these estimates is to calculate the estimated marginal means (sometimes referred to as least square means, predicted means, or expected means). Most statistical software packages offer procedures to obtain predictions of the response variable for the different levels of categorical variables after fitting linear models. However, these procedures should be used carefully as the results obtained can be very different depending on the statistical software package used.

Example

Consider a simulated dataset containing information about employees of a company, with information on their salary, age, gender, and job category. The continuous variables are summarized in Table 1; the categorical variables are summarized in Tables 2 and 3. Table 1: Mean and standard deviation of the continuous variables in the employee dataset Mean SD Salary 6806.43 3148. Age 39.16 45. Table 2: Summary of the job category variable Values Count Proportion 0 (clerical) 227 0. 1 (trainee) 168 0. 2 (security) 32 0. 3 (technical) 47 0. Table 3: Summary of the gender variable Values Count Proportion 0 (male) 258 0. 1 (female) 216 0.

In this newsletter, we will investigate the relationship between salary and gender controlling for job category and age. Table 4 contains the results of a linear model with salary as the dependent variable with gender, job category, and age as predictor variables. Note that in our example, we are applying dummy coding for categorical variables; we are considering the reference level to be the lowest level of these categorical variable (i.e. male (0) for gender and clerical (0) for job category). For more information about dummy coding, please refer to our Dummy and Effect Coding Newsletter (statnews #72). Table 4: Linear model summary with salary as the response and age, gender, job as predictors. Coefficient Estimate SE p-value Intercept (𝛽 0 ) 6963.73^ 235.94^ <0. Gender: female (𝛽 1 ) - 2456.7 240.92 <0. Age (𝛽 2 ) 0.81 2.52 0. Job: trainee (𝛽 3 ) 1302.53^ 254.02^ <0. Job: security (𝛽 4 ) 167.83 481.22 0. Job: technical (𝛽 5 ) 4613.43 407.11 <0. Coefficients obtained from the linear model are used to estimated marginal means. For gender , our independent variable of interest, 0 represents a male subject while as 1 represents female subject. But what values are used for the other variables in the model: age and job category? For continuous variables like age , marginal means procedures typically substitute the overall mean values for calculations (unless the user specifies otherwise); in our example, 39.16 is the average age. For categorical variables, some software packages calculate marginal means as if the data is from a balanced population, while others assume an unbalanced population. The term “balanced population” means that the sample is uniformly split across the different bins of the categorical variable; in terms of the 4-valued categorical variable job category , that would mean that 25 percent of the population falls into each bin. Thus, the predicted salary values obtained for each job category would be weighted equally when calculating the marginal mean for each gender. For an unbalanced population, the predicted salaries would be weighted according to the distribution of jobs in the data (see proportions in Table 2). We see that our data is not balanced in terms of the job category variable. The job category percentages range from 6.75 to 47.89 percent in the sample. Below we show how different software packages treat this categorical variable when calculating marginal means—specifically, whether they assume a balanced or unbalanced population.

Balanced Estimated Marginal Means

In R, SAS, SPSS, and JMP, the marginal means procedure by default assumes a balanced population. To see this, we first calculate marginal means for each job category, for both male and female employees. We take the linear model equation and use the coefficients from Table 4, along with the appropriate values for gender (0 for males, 1 for females), age (the mean value, 39.16, and job category (1 for the indicated job, 0 for the others). For example, a female trainee’s predicted salary would be calculated as follows:

These are the marginal means computed by Stata. The same marginal means can be obtained directly from the coefficients of the linear equation by replacing each job category dummy variable with its corresponding proportion in our sample. For males, we have

7 + 0 × (− 2456. 7 ) + 39. 15 × ( 0. 81 ) + 1302. 5 × 0. 354 + 167. 8 × 0. 068

( 4613. 4 ) × 0. 099 = 7925. 9 , and for females, we have

7 + 1 × (− 2456. 7 ) + 39. 15 × ( 0. 81 ) + 1302. 5 × 0. 354 + 167. 8 × 0. 068

( 4613. 4 ) × 0. 099 = 5469. 2. We see that marginal means in Stata assumes an unbalanced population using the distribution of the sample by default. However, by using the option asbalanced , Stata’s margins command can replicate the behavior of other software packages and compute balanced marginal means. Table 6 summarizes these findings. Table 6: Commands to compute estimated marginal means in each software package. Software Treatment of Categorical Variables Command R Balanced (default) emmeans() R Unbalanced emmeans (... , weights="proportional") SAS Balanced (default) lsmeans SAS Unbalanced lsmeans ... /om JMP Balanced Analyze, Fit Model, Effect Details SPSS Balanced EMMEANS Stata Unbalanced (default) margins Stata Balanced margins..., asbalanced For more information on how to use these methods, see also our handout on Post-hoc Analyses. If you need assistance with estimated marginal means or have any other statistical consulting questions, please feel free to contact the statistical consultants at CSCU. References

Margins Manual from Stata: https://www.stata.com/manuals13/rmargins.pdf
LSMeans from SAS: https://documentation.sas.com/?docsetId=statug&docsetTarget=statug_glm_syntax10.htm& docsetVersion=15.1&locale=en
Emmeans package for R: https://cran.r-project.org/web/packages/emmeans/index.html
Emmeans in SPSS: https://www.ibm.com/support/knowledgecenter/SSLVMB_24.0.0/spss/advanced/syn_mixe d_emmeans.html
LSMeans in JMP: https://www.jmp.com/support/help/en/15.0/index.shtml#page/jmp/effect- details.shtml Author: Michael Ko

How are estimated marginal means calculated?, Exams of Business Statistics

Related documents

Partial preview of the text

Download How are estimated marginal means calculated? and more Exams Business Statistics in PDF only on Docsity!

How are estimated marginal means calculated?

Statnews

Introduction

Example

Balanced Estimated Marginal Means