Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
The CORRESP Procedure

Types of Tables Used as Input

The following example explains correspondence analysis and illustrates some capabilities of PROC CORRESP.

   data Neighbor;
      input Name $ 1-10 Age $ 12-18 Sex $ 19-25
            Height $ 26-30 Hair $ 32-37;
      datalines;
   Jones      Old    Male   Short White
   Smith      Young  Female Tall  Brown
   Kasavitz   Old    Male   Short Brown
   Ernst      Old    Female Tall  White
   Zannoria   Old    Female Short Brown
   Spangel    Young  Male   Tall  Blond
   Myers      Young  Male   Tall  Brown
   Kasinski   Old    Male   Short Blond
   Colman     Young  Female Short Blond
   Delafave   Old    Male   Tall  Brown
   Singer     Young  Male   Tall  Brown
   Igor       Old           Short
   ;

There are several types of tables, N, that can be used as input to correspondence analysis -all tables can be defined using a binary matrix, Z.

With the BINARY option, N = Z is directly analyzed. The binary matrix has one column for each category and one row for each individual or case. A binary table constructed from m categorical variables has m partitions. The following table has four partitions, one for each of the four categorical variables. Each partition has a 1 in each row, and each row contains exactly four 1s since there are four categorical variables. More generally, the binary design matrix has exactly m 1s in each row. The 1s indicate the categories to which the observation applies.

Table 24.2: Z, The Binary Coding of Neighbor Data Set
ZHair ZHeight ZSex ZAge
Blond Brown White Short Tall Female Male Old Young
001100110
010011001
010100110
001011010
010101010
100010101
010010101
100100110
100101001
010010110
010010101

With the MCA option, the Burt table (Z'Z) is analyzed. A Burt table is a partitioned symmetric matrix containing all pairs of crosstabulations among a set of categorical variables. Each diagonal partition is a diagonal matrix containing marginal frequencies (a crosstabulation of a variable with itself). Each off-diagonal partition is an ordinary contingency table. Each contingency table above the diagonal has a transposed counterpart below the diagonal.

Table 24.3: Z'Z, The Burt Table
  Blond Brown White Short Tall Female Male Old Young
Blond300211212
Brown060242433
White002111120
Short221502341
Tall141062424
Female121224022
Male241340743
Old132422460
Young230142305

This Burt table is composed of all pairs of crosstabulations among the variables Hair, Height, Sex, and Age. It is composed of sixteen individual subtables -the number of variables squared. Both the rows and the columns have the same nine categories (in this case Blond, Brown, White, Short, Tall, Female, Male, Old, and Young). The off-diagonal partitions are crosstabulations of each variable with every other variable. Below the diagonal are the following crosstabulations (from left to right, top to bottom): Height * Hair, Sex * Hair, Sex * Height, Age * Hair, Age * Height, and Age * Sex. Each crosstabulation below the diagonal has a transposed counterpart above the diagonal. Each diagonal partition contains a crosstabulation of a variable with itself (Hair * Hair, Height * Height, Sex * Sex, and Age * Age). The diagonal elements of the diagonal partitions contain marginal frequencies of the off-diagonal partitions.

For example, the table Hair * Height has three rows for Hair and two columns for Height. The values of the Hair * Height table, summed across rows, sum to the diagonal values of the Height * Height table, as displayed in the following table.

Table 24.4: ZHair,Height'ZHeight, The (Hair Height) × Height Crosstabulation
  Short Tall
Blond21
Brown24
White11
Short50
Tall06

A simple crosstabulation of Hair × Height is N = ZHair'ZHeight. Crosstabulations such as this, involving only two variables, are the input to simple correspondence analysis.

Table 24.5: ZHair'ZHeight, The Hair × Height Crosstabulation
  Short Tall
Blond21
Brown24
White11

Tables such as the following (N = ZHair'ZHeight,Sex), made up of several crosstabulations, can also be analyzed in simple correspondence analysis.

Table 24.6: ZHair'ZHeight,Sex, The Hair × (Height Sex) Crosstabulation
  Short Tall Female Male
Blond2112
Brown2424
White1111

Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
Top
Top

Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.