Chapter Contents

Previous

Next
The COMPARE Procedure

ID Statement


Lists variables to use to match observations.

See also: A Comparison with an ID Variable
Featured in: Comparing Observations with an ID Variable


ID <DESCENDING> variable-1
<...<DESCENDING> variable-n>
<NOTSORTED>;


Required Arguments

variable
specifies the variable that the procedure uses to match observations. You can specify more than one variable, but the data set must be sorted by the variable or variables you specify. These variables are ID variables. ID variables also identify observations on the printed reports and in the output data set.


Options

DESCENDING
specifies that the data set is sorted in descending order by the variable that immediately follows the word DESCENDING in the ID statement.

If you use the DESCENDING option, you must sort the data sets. The SAS System does not use an index to process an ID statement with the DESCENDING option. Further, the use of DESCENDING for ID variables must correspond to the use of the DESCENDING option in the BY statement in the PROC SORT step that was used to sort the data sets.

NOTSORTED
specifies that observations are not necessarily sorted in alphabetic or numeric order. The data are grouped in another way, for example, chronological order.
See also: Comparing Unsorted Data


Requirements for ID Variables


Comparing Unsorted Data
If you do not want to sort the data set by the ID variables, you can use the NOTSORTED option. When you specify the NOTSORTED option, or if the ID statement is omitted, PROC COMPARE matches the observations one-to-one. That is, PROC COMPARE matches the first observation in the base data set with the first observation in the comparison data set, the second with the second, and so on. If you use NOTSORTED, and the ID values of corresponding observations are not the same, PROC COMPARE prints an error message and stops processing.

If the data sets are not sorted by the common ID variables and you do not specify the NOTSORTED option, PROC COMPARE prints a warning message and continues to process the data sets as if you had specified NOTSORTED.


Avoiding Duplicate ID Values
The observations in each data set should be uniquely labeled by the values of the ID variables. If PROC COMPARE finds two successive observations with the same ID values in a data set, it

When the data sets are not sorted, PROC COMPARE detects only those duplicate observations that occur in succession.


Chapter Contents

Previous

Next

Top of Page

Copyright 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.