Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Statistical Package for Social Sciences for Beginners - Study Guide | GEOG 547, Exams of Geography

Material Type: Exam; Class: Spatial Analysis; Subject: Geography; University: University of Wisconsin - Milwaukee; Term: Fall 1999;

Typology: Exams

Pre 2010

Uploaded on 09/02/2009

koofers-user-9x0
koofers-user-9x0 🇺🇸

10 documents

1 / 437

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
SPSS for
SPSS for
Beginners
Beginners
A book designed for students from a non-math, non-technology
background
A web site (spss.org) is dedicated to supporting readers of this book
If you don’t find some topic in this book, tell us and we will make it
available for free download or email it to your desktop
A Vijay Gupta Publication
SPSS for Beginners Vijay Gupta 1999. All rights reside with author.
SPSS for Beginners Http://www.spss.org
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a
pf2b
pf2c
pf2d
pf2e
pf2f
pf30
pf31
pf32
pf33
pf34
pf35
pf36
pf37
pf38
pf39
pf3a
pf3b
pf3c
pf3d
pf3e
pf3f
pf40
pf41
pf42
pf43
pf44
pf45
pf46
pf47
pf48
pf49
pf4a
pf4b
pf4c
pf4d
pf4e
pf4f
pf50
pf51
pf52
pf53
pf54
pf55
pf56
pf57
pf58
pf59
pf5a
pf5b
pf5c
pf5d
pf5e
pf5f
pf60
pf61
pf62
pf63
pf64

Partial preview of the text

Download Statistical Package for Social Sciences for Beginners - Study Guide | GEOG 547 and more Exams Geography in PDF only on Docsity!

SPSS forSPSS for

Beginners Beginners

 A book designed for students from a non-math, non-technology

background

 A web site (spss.org) is dedicated to supporting readers of this book

 If you don’t find some topic in this book, tell us and we will make it

available for free download or email it to your desktop

A Vijay Gupta Publication

SPSS for Beginners  Vijay Gupta 1999. All rights reside with author.

SPSS for Beginners

Copyright © 1999 Vijay Gupta

Published by Vijay Gupta

All rights reserved. No part of this book may be used or reproduced in any form or by any

means, or stored in a database or retrieval system, without prior written permission of the

publisher except in the case of brief quotations embodied in reviews, articles, and research

papers. Making copies of any part of this book for any purpose other than personal use is a

violation of United States and international copyright laws. For information contact Vijay

Gupta at vgupta1000@aol.com.

You can reach the author at vgupta1000@aol.com. The author welcomes feedback but will not

act as a help desk for the SPSS program.

Library of Congress Catalog No.: Pending

ISBN: Pending

First year of printing: 1999

Date of this copy: Aug 26, 1999

This book is sold as is, without warranty of any kind, either express or implied, respecting the

contents of this book, including but not limited to implied warranties for the book's quality,

performance, merchantability, or fitness for any particular purpose. Neither the author, the

publisher and its dealers, nor distributors shall be liable to the purchaser or any other person or

entity with respect to any liability, loss, or damage caused or alleged to be caused directly or

indirectly by the book.

This book is based on SPSS versions 7.x through 10.0. SPSS is a registered trademark of SPSS

Inc.

Publisher : Vijay Gupta

Editor : Vijay Gupta

Author : Vijay Gupta

Acknowledgments

To SPSS Inc, for their permission to use screen shots of SPSS.

To the brave souls who have to learn statistics!

Dedication

To my Grandmother, the late Mrs. Indubala Sukhadia, member of India's Parliament. The

greatest person I will ever know. A lady with more fierce courage, radiant dignity, and

leadership and mentoring abilities than any other.

Any Feedback is Welcome

You can e-mail Vijay Gupta at vgupta1000@aol.com.

5. 5 Comparing the Means and Distributions of Sub-Groups of a Variable - Error

  1. 3 The Runs Test - Determining Whether a Variable is Randomly Distributed..........................14-

To take quizzes on topics within each chapter go to

  • Table of Contents Table of Contents- - 4. 3 Comparing Means and Distributions.....................................................................................4-
      1. Multivariate Statistics................................................................................................................5-
        1. 1 Graphs....................................................................................................................................5-
        1. 2 Scatters...................................................................................................................................5-
        1. 3 Correlations...........................................................................................................................5-
        1. 4 Conducting Several Bivariate Explorations Simultaneously.................................................5-
      • Bar, T-Test, ANOVA, and Non-parametric Tests..............................................................................5-
      1. Tables...........................................................................................................................................6-
        1. 1 Tables for Statistical Attributes.............................................................................................6-
        1. 2 Tables of Frequencies............................................................................................6-
      1. Linear Regression.........................................................................................................................7-
        1. 1 Linear Regression....................................................................................................................7.
        1. 2 Interpretation of Regression Results........................................................................................7-
        1. 3 Problems Caused by the Breakdown of Classical Assumptions...........................................7-
        1. 4 Diagnostics............................................................................................................................7-
        1. 5 Formally Testing for Heteroskedasticity: White’s Test........................................................7-
      1. Correcting for Breakdown of Classical Assumptions...............................................................8-
        1. 1 Correcting for Collinearity......................................................................................................8-
        1. 2 Correcting for Heteroskedasticity............................................................................................8-
        1. 3 Correcting for Incorrect Functional Form.............................................................................8-
        1. 4 Correcting for Simultaneity Bias: 2SLS................................................................................8-
        1. 5 Correcting for other Breakdowns..........................................................................................8-
      1. MLE: Logit and Non-linear Regression.....................................................................................9-
        1. 1 Logit.........................................................................................................................................9-
        1. 1 Non-linear Regression.............................................................................................................9-
      1. Comparative Analysis................................................................................................................10-
        1. 1 Using Split File to Compare Results.....................................................................................10-
      1. Formatting and Editing Output................................................................................................11-
        1. 1 Formatting and Editing Tables..............................................................................................11-
        1. 2 Formatting and Editing Charts............................................................................................11-
      1. Reading ASCII Text Data..........................................................................................................12-
        1. 1 Understanding ASCII Text Data...........................................................................................12-
        1. 2 Reading Data Stored in ASCII Tab-delimited Format..........................................................12-
  • Table of Contents Table of Contents- - delimited.............................................................................................................................................12- 12. 3 Reading Data Stored in ASCII Delimited (or Freefield) Format other than Tab- - 12. 4 Reading Data Stored in Fixed Width (or Column) Format ..............................……………12-
      1. Merging: Adding Cases & Variables........................................................................................13-
        1. 1 Adding New Observations.....................................................................................................13-
        1. 2 Adding New Variables (Merging).........................................................................................13-
      1. Non-parametric Testing.............................................................................................................14-
        1. 1 Binomial Test........................................................................................................................14-
        1. 2 Chi-square..............................................................................................................................14-
      1. Setting System Defaults..............................................................................................................15-
        1. 1 General Settings.....................................................................................................................15-
        1. 2 Choosing the Default View of the Data and Screen..............................................................15-
      1. Reading Data from Database Formats.....................................................................................16-
      1. Time Series Analysis...................................................................................................................17-
        1. 1 Sequence Charts (Line Charts with Time on the X-Axis).....................................................17-
        1. 2 Checking for Unit Roots / Non-stationarity (PACF)...........................................................17-
        1. 3 Determining Lagged Effects of other Variables (CCF).......................................................17-
      • Lag, etc. ...........................................................................................................................................17- 17. 4 Creating New Variables (Using Time Series Specific Formulae: Difference,
        1. 5 ARIMA................................................................................................................................17-
      • (AUTOREGRESSION)....................................................................................................................17- 17. 6 Correcting for First-order Autocorrelation Among Residuals
        1. 7 Co-integration......................................................................................................................17-
  • Table of Contents Table of Contents-
    1. DATA HANDLING.........................................................................................................................1- Conventions Used in this Book...........................................................................................................................iv
    • 1.1 Reading (Opening) the Data Set..................................................................................................1-
      • 1.1.A. Reading SPSS Data.................................................................................................................1-
      • 1.1.B. Reading Data from Spreadsheet Formats - e.g. - Excel...........................................................1-
      • 1.1.C. Reading Data from Simple Database Formats - e.g. - Dbase..................................................1-
      • 1.1.D. Reading Data from other Statistical Programs (SAS, STATA, etc.).......................................1-
    • 1.2 Defining the Attributes of Variables...........................................................................................1-
      • 1.2.A. Variable Type..........................................................................................................................1-
      • 1.2.B. Missing Values......................................................................................................................1-
      • 1.2.C. Column Format......................................................................................................................1-
      • 1.2.D. Variable Labels......................................................................................................................1-
      • 1.2.E. Value Labels for Categorical and Dummy Variables............................................................1-
      • 1.2.F. Perusing the Attributes of Variables......................................................................................1-
      • 1.2.G. The File Information Utility..................................................................................................1-
    • 1.3 Weighing Cases...........................................................................................................................1-
    • 1.4 Creating a Smaller Data Set by Aggregating Over a Variable..............................................1-
    • 1.5 Sorting..........................................................................................................................................1-
    • 1.6 Reducing Sample Size.................................................................................................................1-
      • 1.6.A. Using Random Sampling.......................................................................................................1-
      • 1.6.B. Using a Time/Case Range.....................................................................................................1-
    • 1.7 Filtering Data..............................................................................................................................1-
      • 1.7.A. A Simple Filter......................................................................................................................1-
      • 1.7.B. What to Do After Obtaining the Sub-set...............................................................................1-
  • Table of Contents Table of Contents- - 1.7.C. What to Do After the Sub-set is No Longer Needed.............................................................1- - than One Variable...............................................................................................................................1- 1.7.D. Complex Filter: Choosing a Sub-set of Data Based On Criterion from More
    • 1.8 Replacing Missing Values..........................................................................................................1-
    • 1.9 Using Sub-sets of Variables (and Not of Cases, as in 1.7).......................................................1-
    1. CREATING NEW VARIABLES....................................................................................................2-
    • 2.1 Creating Dummy, Categorical, and Semi-continuos Variables .............................................2-
      • 2.1.A. What Are Dummy and Categorical Variables?.....................................................................2-
      • 2.1.B. Creating New Variables Using Recode.................................................................................2-
      • 2.1.C. Replacing Existing Variables Using Recode.........................................................................2-
      • 2.1.D. Obtaining a Dummy Variable as a By-product of Filtering..................................................2-
      • 2.1.E. Changing a Text Variable into a Numeric Variable..............................................................2-
        • Compute.................................................................................................................................2- 2.2 Using Mathematical Computations to Create New Continuous Variables:
      • 2.2.A. A Simple Computation..........................................................................................................2-
      • 2.2.B. Using Built-in SPSS Functions to Create a Variable............................................................2-
        • Categorical Variables...........................................................................................................2- 2.3 Multiple Response Sets-- Using a "Set" Variable Consisting of Several
        • Values Across a Group of Variables...................................................................................2- 2.4 Creating a "Count" Variable to Add the Number of Occurrences of Similar
    • 2.5 Continuous Variable Groupings Created Using Cluster Analysis.........................................2-
    1. UNIVARIATE ANALYSIS............................................................................................................3-
    • 3.1 Graphs (Bar, Line, Area and Pie).............................................................................................3-
      • 3.1.A. Simple Bar Graphs.................................................................................................................3-
      • 3.1.B. Line Graphs...........................................................................................................................3-
      • 3.1.C. Graphs for Cumulative Frequency........................................................................................3-
      • 3.1.D. Pie Graphs..............................................................................................................................3-
    • 3.2 Frequencies and Distributions...................................................................................................3-
      • 3.2.A. The Distribution of Variables - Histograms and Frequency Statistics..................................3-
      • 3.2.B. Checking the Nature of the Distribution of Continuous Variables.......................................3-
  • Table of Contents Table of Contents- - 3.2.C. Transforming a Variable to Make it Normally Distributed...................................................3- - 3.2.D. Testing for other Distributions..............................................................................................3- - 3.2.E. A Formal Test to Determine the Distribution Type of a Variable.........................................3-
    • 3.3 Other Basic Univariate Procedures (Descriptives and Boxplots)...........................................3-
      • 3.3.A. Descriptives...........................................................................................................................3-
      • 3.3.B. Boxplots.................................................................................................................................3-
        • Bar).........................................................................................................................................3- 3.4 Testing if the Mean is Equal to a Hypothesized Number (the T-Test and Error
      • 3.4.C. Error Bar (Graphically Showing the Confidence Intervals of Means)..................................3-
      • 3.4.A. A Formal Test: The T-Test....................................................................................................3-
    1. COMPARING SIMILAR VARIABLES...........................................................................................4-
    • 4.1 Graphs (Bar, Pie)........................................................................................................................4-
    • 4.2 Boxplots.......................................................................................................................................4-
    • 4.3 Comparing Means and Distributions........................................................................................4-
      • 4.3.A. Error Bars..............................................................................................................................4-
      • 4.3.B. The Paired Samples T-Test....................................................................................................4-
      • Samples Non-parametric Test............................................................................................................4- 4.3.C. Comparing Distributions when Normality Cannot Be Assumed - 2 Related
    1. MULTIVARIATE STATISTICS.....................................................................................................5-
    • 5.1 Graphs.........................................................................................................................................5-
      • 5.1.A. Graphing a Statistic (e.g. - the Mean) of Variable "Y" by Categories of X..........................5-
      • "Z" ..............................................................................................................................................5- 5.1.B. Graphing a Statistic (e.g. - the Mean) of Variable "Y" by Categories of "X" and
      • 5.1.C. Using Graphs to Capture User-designed Criterion................................................................5-
      • 5.1.D. Boxplots.................................................................................................................................5-
    • 5.2 Scatters.........................................................................................................................................5-
      • 5.2.A. A Simple Scatter....................................................................................................................5-
      • 5.2.B. Plotting Scatters of Several Variables Against Each other....................................................5-
      • 5.2.C. Plotting Two X-Variables Against One Y.............................................................................5-
    • 5.3 Correlations.................................................................................................................................5-
      • 5.3.A. Bivariate Correlations............................................................................................................5-
  • Table of Contents Table of Contents- - 5.3.B. Non-parametric (Spearman's) Bivariate Correlation.............................................................5- - 5.3.C. Partial Correlations................................................................................................................5-
    • 5.4 Conducting Several Bivariate ExplorationsSimultaneously...................................................5- - Error Bar, T-Test, ANOVA, and Non-parametric Tests..................................................5- 5.5 Comparing the Means and Distributions of Sub-groups of a Variable -
      • 5.5.A. Error Bars..............................................................................................................................5-
      • 5.5.B. The Independent Samples T-Test..........................................................................................5-
      • 5.5.C. ANOVA (one-way)...............................................................................................................5-
      • 5.5.D. Non-parametric Testing Methods..........................................................................................5-
    1. TABLES......................................................................................................................................6-
    • 6.1 Tables for Statistical Attributes................................................................................................6-
      • 6.1.A. Summary Measure of a Variable...........................................................................................6-
      • 6.1.B. Obtaining More Than One Summary Statistic......................................................................6-
      • 6.1.C. Summary of a Variable's Values Categorized by Three Other Variables.............................6-
    • 6.2 Tables of Frequencies.................................................................................................................6-
    1. LINEAR REGRESSION..................................................................................................................7-
      1. 1 Linear Regression.........................................................................................................................7-
      1. 2 Interpretation of Regression Results..........................................................................................7-
      1. 3 Problems Caused by Breakdown of Classical Assumptions...................................................7-
      1. 4 Diagnostics...................................................................................................................................7-
        1. 4.A. Collinearity............................................................................................................................7-
        1. 4.B. Misspecification.....................................................................................................................7-
        1. 4.C. Incorrect Functional Form.....................................................................................................7-
        1. 4.D. Omitted Variable...................................................................................................................7-
        1. 4.E. Inclusion of an Irrelevant Variable........................................................................................7-
        1. 4.F. Measurement Error................................................................................................................7-
        1. 4.G. Heteroskedasticity.................................................................................................................7-
  • Table of Contents Table of Contents-
      1. 5 Formally Testing for Heteroskedasticity: White’s Test..........................................................7-
    1. CORRECTING FOR BREAKDOWN OF CLASSICAL ASSUMPTIONS..............................................8-
      1. 1 Correcting for Collinearity..........................................................................................................8-
        1. 1.A. Dropping All But One of the Collinear Variables from the Model.........................................8-
      1. 2 Correcting for Heteroskedasticity...............................................................................................8-
        1. 2.A. WLS When Exact Nature of Heteroskedasticity is Not Known..............................................8-
        1. 2.B. Weight Estimation When the Weight is Known.....................................................................8-
      1. 3 Correcting for Incorrect Functional Form...............................................................................8-
      1. 4 Correcting for Simultaneity Bias: 2SLS...................................................................................8-
      1. 5 Correcting for Other Breakdowns............................................................................................8-
        1. 5.C. Omitted Variable...................................................................................................................8-
        1. 5.A. Irrelevant Variable.................................................................................................................8-
        1. 5.B. Measurement Error in Dependent Variable...........................................................................8-
        1. 5.C. Measurement Error in Independent Variable(s)....................................................................8-
    1. MLE: LOGIT AND NON-LINEAR REGRESSION..........................................................................9-
      1. 1 Logit...............................................................................................................................................9-
      1. 1 Non-linear Regression..................................................................................................................9-
        1. 1.A. Curve Estimation.....................................................................................................................9-
        1. 1.B. General Non-linear Estimation (and Constrained Estimation)..............................................9-
    1. COMPARATIVE ANALYSIS.........................................................................................................10-
      1. 1 Using Split File to Compare Results.........................................................................................10-
        1. 1.A. Example of a Detailed Comparative Analysis......................................................................10-
    1. FORMATTING AND EDITING OUTPUT......................................................................................11-
      1. 1 Formatting and Editing Tables.................................................................................................11-
        1. 1.A. Accessing the Window for Formatting / Editing Tables......................................................11-
        1. 1.B. Changing the Width of Columns..........................................................................................11-
  • Table of Contents Table of Contents- - 11. 1.C. Deleting Columns.................................................................................................................11- - 11. 1.D. Transposing..........................................................................................................................11- - 11. 1.E. Finding Appropriate Width and Height................................................................................11- - 11. 1.F. Deleting Specific Cells.........................................................................................................11- - 11. 1.G. Editing (Data or Text) in Specific Cells...............................................................................11- - 11. 1.H. Changing the Font................................................................................................................11- - 11. 1.I. Inserting Footnotes...............................................................................................................11- - 11. 1.J. Picking from Pre-set Table Formatting Styles.....................................................................11- - 11. 1.K. Changing Specific Style Properties....................................................................................11- - 11. 1.L. Changing the Shading of Cells...........................................................................................11- - 11. 1.M Changing the Data Format of Cells....................................................................................11- - 11. 1.N. Changing the Alignment of the Text or Data in Cells........................................................11- - 11. 1.O. Formatting Footnotes..........................................................................................................11- - 11. 1.P. Changing Borders and Gridlines.........................................................................................11- - 11. 1.Q. Changing the Font of Specific Components (Data, Row Headers, etc.)............................11-
      1. 2 Formatting and Editing Charts...............................................................................................11-
        1. 2.A. Accessing the Window for Formatting / Editing Charts....................................................11-
        1. 2.B. Using the Mouse to Edit Text.............................................................................................11-
        1. 2.C. Changing a Chart from Bar Type to Area/Line Type (or Vice Versa)...............................11-
        1. 2.D. Making a Mixed Bar/Line/Area Chart................................................................................11-
        1. 2.E. Converting into a Pie Chart.................................................................................................11-
        1. 2.F. Using the Series Menu: Changing the Series that are Displayed.......................................11-
        1. 2.G. Changing the Patterns of Bars, Areas, and Slices...............................................................11-
        1. 2.H. Changing the Color of Bars, Lines, Areas, etc...................................................................11-
        1. 2.I. Changing the Style and Width of Lines..............................................................................11-
        1. 2.J. Changing the Format of the Text in Labels, Titles, or Legends.........................................11-
        1. 2.K. Flipping the Axes................................................................................................................11-
        1. 2.L. Borders and Frames............................................................................................................11-
        1. 2.M Titles and Subtitles..............................................................................................................11-
        1. 2.N. Footnotes............................................................................................................................11-
        1. 2.O. Legend Entries....................................................................................................................11-
        1. 2.P. Axis Formatting...................................................................................................................11-
        1. 2.Q. Adding/Editing Axis Titles.................................................................................................11-
        1. 2.R. Changing the Scale of the Axes..........................................................................................11-
        1. 2.S. Changing the Increments in which Values are Displayed on an Axis................................11-
        1. 2.T. Gridlines.............................................................................................................................11-
        1. 2.U. Formatting the Labels Displayed on an Axis.....................................................................11-
    1. READING ASCII TEXT DATA....................................................................................................12-
      1. 1 Understanding ASCII Text Data...............................................................................................12-
        1. 1.A. Fixed-field/Fixed-column.....................................................................................................12-
  • Table of Contents Table of Contents- - 12. 1.B. Delimited/Freefield...............................................................................................................12-
      1. 2 Reading Data Stored in ASCII Tab-delimited Format...........................................................12-
      1. 4 Reading Data Stored in Fixed Width (or Column) Format....................................................12-
    1. MERGING: ADDING CASES & VARIABLES..............................................................................13-
      1. 1 Adding New Observations.........................................................................................................13-
      1. 2 Adding New Variables (Merging).............................................................................................13-
        1. 2.A. One-way Merging.................................................................................................................13-
        1. 2.B. Comparing the Three Kinds of Merges: A Simple Example................................................13-
    1. NON-PARAMETRIC TESTING....................................................................................................14-
      1. 1 Binomial Test..............................................................................................................................14-
      1. 2 Chi-square...................................................................................................................................14-
      1. 3 The Runs Test - Checking Whether a Variable is Randomly Distributed..........................14-
    1. SETTING SYSTEM DEFAULTS....................................................................................................15-
      1. 1 General Settings..........................................................................................................................15-
      1. 2 Choosing the Default View of the Data and Screen.................................................................15-
    1. READING DATA FROM DATABASE FORMATS.........................................................................16-
    1. TIME SERIES ANALYSIS............................................................................................................17-
      1. 1 Sequence Charts (Line Charts with Time on the X-Axis).......................................................17-
        1. 1.A. Graphs of the "Level” (Original, Untransformed) Variables...............................................17-
        1. 1.B. Graphs of Transformed Variables (Differenced, Logs)........................................................17-
  • Table of Contents Table of Contents-
      1. 2 Formal Checking for Unit Roots / Non-stationarity..............................................................17-
        1. 2.A. Checking the “Level” (Original, Untransformed) Variables..............................................17-
        1. 2.B. The Removal of Non-stationarity Using Differencing and Taking of Logs.......................17-
      1. 3 Determining Lagged Effects of other Variables....................................................................17-
        • Lag, etc.)...............................................................................................................................17- 17. 4 Creating New Variables (Using Time Series-specific Formulae: Difference,
        1. 4.A. Replacing Missing Values..................................................................................................17-
      1. 5 ARIMA......................................................................................................................................17-
      1. 6 Correcting for First-order Autocorrelation Among Residuals ...........................................17-
      1. 7 Co-integration...........................................................................................................................17-

Introduction i Introduction

Merits of the book

This book is the only user-oriented book on SPSS:

 It uses a series of pictures and simple instructions to teach each procedure. Users can

conduct procedures by following the graphically illustrated examples. The book is

designed for the novice - even those who are inexperienced with SPSS, statistics, or

computers. Though its content leans toward econometric analysis, the book can be used

by those in varied fields, such as market research, criminology, public policy,

management, business administration, nursing, medicine, psychology, sociology,

anthropology, etc.

 Each method is taught in a step-by-step manner.

 An analytical thread is followed throughout the book - the goal of this method is to

show users how to combine different procedures to maximize the benefits of using

SPSS.

 To ensure simplicity, the book does not get into the details of statistical procedures.

Nor does it use mathematical notation or lengthy discussions. Though it does not

qualify as a substitute for a statistics text, users may find that the book contains most of

the statistics concepts they will need to use.

Organization of the Chapters

The chapters progress naturally, following the order that one would expect to find in a

typical statistics project.

Chapter 1, “Data Handling," teaches the user how to work with data in SPSS.

It teaches how to insert data into SPSS, define missing values, label variables, sort data,

filter the file (work on sub-sets of the file) and other data steps. Some advanced data

procedures, such as reading ASCII text files and merging files, are covered at the end of

the book (chapters 12 and 13).

Chapter 2, “Creating New Variables,” shows the user how to create new categorical and

continuous variables.

The new variables are created from transformations applied to the existing variables in

the data file and by using standard mathematical, statistical, and logical operators and

functions on these variables.

Chapter 3, “Univariate Analysis,” highlights an often-overlooked step - comprehensive analysis

of each variable.

Introduction iii

Chapter 11, “Formatting Output,” teaches how to format output.

This is a SPSS feature ignored by most users. Reviewers of reports will often equate

good formatting with thorough analysis. It is therefore recommended that users learn

how to properly format output.

Chapters 1-11 form the sequence of most statistics projects. Usually, they will be sufficient

for projects/classes of the typical user. Some users may need more advanced data

handling and statistical procedures. Chapters 12-17 explore several of these procedures.

The ordering of the chapters is based on the relative usage of these procedures in

advanced statistical projects and econometric analysis.

Chapter 12, “Reading ASCII Text Data,” and chapter 13 “Adding Data,”deal specifically with

reading ASCII text files and merging files.

The task of reading ASCII text data has become easier in SPSS 9.0 (as compared to all

earlier versions). This text teaches the procedure from versions 7.x forward.

Chapter 14, "Non-Parametric Testing," shows the use of some non-parametric methods.

The exploration of various non-parametric methods, beyond the topic-specific methods

included in chapters 3, 4, and 5, are discussed herein.

Chapter 15, "Setting System Options," shows how to set some default settings.

Users may wish to quickly browse through this brief section before reading Chapter 1.

Chapter 16 shows how to read data from any ODBC source database application/format.

SPSS 9.0 also has some more database-specific features. Such features are beyond the

scope of this book and are therefore not included in this section that deals specifically

with ODBC source databases.

Chapter 17 shows time series analysis.

The chapter includes a simple explanation of the non-stationarity problem and

cointegration. It also shows how to correct for non-stationarity, determine the

specifications for an ARIMA model, and conduct an ARIMA estimation. Correction

for first-order autocorrelation is also demonstrated.

Book 2 in this series ("SPSS for Beginners: Advanced Methods") will include chapters on

hierarchical cluster analysis, discriminant analysis, factor analysis, optimal scaling,

correspondence analysis, reliability analysis, multi-dimensional scaling, general log-linear

models, advanced ANOVA and GLM techniques, survival analysis, advanced ranking, using

programming in SPSS syntax, distance (Euclidean and other) measurement, M-estimators, and

Probit and seasonal aspects of time series.

As these chapters are produced, they will be available for free download at www.spsss.org.

This may be the first interactive book in academic history! Depending on your comments/

feedback/requests, we will be making regular changes to the book and the free material on

the web site.

The table of contents is exhaustive. Refer to it to find topics of interest.

Introduction iv

The index is in two parts - part 1 is a menu-to-chapter (and section) mapping, whereas part 2 is

a regular index.

Conventions used in this book

 All menu options are in all-caps. For example, the shortened version of: “Click on the

menu^1 "Statistics," choose the option "Regression," within that menu, choose the option

"Linear regression," will read:

“Go to STATISTICS / REGRESSION / LINEAR REGRESSION."

 Quotation marks identify options in pictures. For example: Select the button “Clustered.”

 Variable names are usually in italics. For example, gender , wage , and fam_id. Variable

names are expanded sometimes within the text. For example, work_ex would read work

experience.

 Text and pictures are placed side-by-side. When a paragraph describes some text in a

picture, the picture will typically be to the right of the paragraph.

 Written instructions are linked to highlighted portions of the picture they describe. The

highlighted portions are denoted either by a rectangle or ellipse around the relevant picture-

component or by a thick arrow, which should prompt the user to click on the image.

 Some terms the user will need to know: a dialog box is the box in any Windows® software

program that opens up when a menu option is chosen. A menu option is the list of

procedures that the user will find on the top of the computer screen.

 Text that is shaded but not boxed is a note, reminder, or tip that digresses a bit from the

main text.

 Text that is shaded a darker gray and boxed highlights key features.

Data set used in the example followed through this book

One data set is used for most of the illustrations and examples in this book. This allows the user

to use the book as a tutorial. Unfortunately, as of present, we cannot find an uncorrupted

version of the data file (we had a virus problem). As and when we can obtain such a copy (from

an ex-student hopefully) we will place it at http://www.spss.org/wwwroot/spssdown.asp

The variables in the data set:

1. Fam_id : an id number, unique for each family surveyed.

2. Fam_mem : the family member responding to the survey. A family (with a unique fam_id )

may have several family members who answered the survey.

3. Wage : the hourly wage of the respondent.

4. Age : the age (in years) of the respondent.

5. Work_ex : the work experience (in years) of the respondent.

6. Gender : a dummy variable taking the value “0” for male respondents and “1” for female

respondents.

(^1) A “menu” is a list of options available from the list on the top of the computer screen. Most software applications have these standard menus: FILE, EDIT, WINDOW, and HELP.