Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
The SYSLIN Procedure

SUR, 3SLS, and FIML Estimation

In a multivariate regression model, the errors in different equations may be correlated. In this case the efficiency of the estimation may be improved by taking these cross-equation correlations into account.

Seemingly Unrelated Regression

Seemingly unrelated regression (SUR), also called joint generalized least squares (JGLS) or Zellner estimation, is a generalization of OLS for multi-equation systems. Like OLS, the SUR method assumes that all the regressors are independent variables, but SUR uses the correlations among the errors in different equations to improve the regression estimates. The SUR method requires an initial OLS regression to compute residuals. The OLS residuals are used to estimate the cross-equation covariance matrix.

The SUR option on the PROC SYSLIN statement specifies seemingly unrelated regression, as shown in the following statements:


   proc syslin data=in sur;
      demand: model q = p y s;
      supply: model q = p u;
   run;

INSTRUMENTS and ENDOGENOUS statements are not needed for SUR, since the SUR method assumes there are no endogenous regressors. For SUR to be effective, the models must use different regressors. SUR produces the same results as OLS unless the model contains at least one regressor not used in the other equations.

Three-Stage Least Squares

The three-stage least-squares method generalizes the two-stage least-squares method to take account of the correlations between equations in the same way that SUR generalizes OLS. Three-stage least squares requires three steps: first-stage regressions to get predicted values for the endogenous regressors; a two-stage least-squares step to get residuals to estimate the cross-equation correlation matrix; and the final 3SLS estimation step.

The 3SLS option on the PROC SYSLIN statement specifies the three-stage least-squares method, as shown in the following statements.


   proc syslin data=in 3sls;
      endogenous  p;
      instruments y u s;
      demand: model q = p y s;
      supply: model q = p u;
   run;

The 3SLS output begins with a two-stage least-squares regression to estimate the cross-model correlation matrix. This output is the same as the 2SLS results shown in Figure 19.3 and Figure 19.4, and is not repeated here. The next part of the 3SLS output prints the cross-model correlation matrix computed from the 2SLS residuals. This output is shown in Figure 19.5 and includes the cross-model covariances, correlations, the inverse of the correlation matrix, and the inverse covariance matrix.

 
The SYSLIN Procedure
Three-Stage Least Squares Estimation

Cross Model Covariance
  DEMAND SUPPLY
DEMAND 0.027892 -.011283
SUPPLY -.011283 0.018991
 
Cross Model Correlation
  DEMAND SUPPLY
DEMAND 1.00000 -0.49022
SUPPLY -0.49022 1.00000
 
Cross Model Inverse Correlation
  DEMAND SUPPLY
DEMAND 1.31634 0.64530
SUPPLY 0.64530 1.31634
 
Cross Model Inverse Covariance
  DEMAND SUPPLY
DEMAND 47.1945 28.0380
SUPPLY 28.0380 69.3130
Figure 19.5: Estimated Cross-Model Covariances used for 3SLS Estimates

The final 3SLS estimates are shown in Figure 19.6.

 
The SYSLIN Procedure
Three-Stage Least Squares Estimation

System Weighted MSE 0.5711
Degrees of freedom 113
System Weighted R-Square 0.9627
 
Model DEMAND
Dependent Variable q
Label Quantity
 
Parameter Estimates
Variable DF Parameter
Estimate
Standard Error t Value Pr > |t| Variable
Label
Intercept 1 1.980261 1.169169 1.69 0.0959 Intercept
p 1 -1.17654 0.605012 -1.94 0.0568 Price
y 1 0.404115 0.117179 3.45 0.0011 Income
s 1 0.359204 0.085077 4.22 <.0001 Price of Substitutes
 
Model SUPPLY
Dependent Variable q
Label Quantity
 
Parameter Estimates
Variable DF Parameter
Estimate
Standard Error t Value Pr > |t| Variable
Label
Intercept 1 -0.51878 0.490999 -1.06 0.2952 Intercept
p 1 1.333080 0.059271 22.49 <.0001 Price
u 1 -1.14623 0.243491 -4.71 <.0001 Unit Cost
Figure 19.6: Three-Stage Least Squares Results

This output first prints the system weighted mean square error and system weighted R2 statistics. The system weighted MSE and system weighted R2 measure the fit of the joint model obtained by stacking all the models together and performing a single regression with the stacked observations weighted by the inverse of the model error variances. See the section "The R2 Statistics" for details.

Next, the table of 3SLS parameter estimates for each model is printed. This output has the same form as for the other estimation methods.

Note that the 3SLS and 2SLS results may be the same in some cases. This results from the same principle that causes OLS and SUR results to be identical unless an equation includes a regressor not used in the other equations of the system. However, the application of this principle is more complex when instrumental variables are used. When all the exogenous variables are used as instruments, linear combinations of all the exogenous variables appear in the third-stage regressions through substitution of first-stage predicted values.

In this example, 3SLS produces different (and, it is hoped, more efficient) estimates for the demand equation. However, the 3SLS and 2SLS results for the supply equation are the same. This is because the supply equation has one endogenous regressor and one exogenous regressor not used in other equations. In contrast, the demand equation has fewer endogenous regressors than exogenous regressors not used in other equations in the system.

Full Information Maximum Likelihood

The FIML option on the PROC SYSLIN statement specifies the full information maximum likelihood method, as shown in the following statements.


   proc syslin data=in fiml;
      endogenous  p q;
      instruments y u s;
      demand: model q = p y s;
      supply: model q = p u;
   run;

The FIML results are shown in Figure 19.7.

 
The SYSLIN Procedure
Full-Information Maximum Likelihood Estimation

NOTE: Convergence criterion met at iteration 3.
 
Model DEMAND
Dependent Variable q
Label Quantity
 
Parameter Estimates
Variable DF Parameter
Estimate
Standard Error t Value Pr > |t| Variable
Label
Intercept 1 1.988529 1.233625 1.61 0.1126 Intercept
p 1 -1.18147 0.652274 -1.81 0.0755 Price
y 1 0.402310 0.107269 3.75 0.0004 Income
s 1 0.361345 0.103816 3.48 0.0010 Price of Substitutes
 
Model SUPPLY
Dependent Variable q
Label Quantity
 
Parameter Estimates
Variable DF Parameter
Estimate
Standard Error t Value Pr > |t| Variable
Label
Intercept 1 -0.52443 0.479522 -1.09 0.2787 Intercept
p 1 1.336083 0.057939 23.06 <.0001 Price
u 1 -1.14804 0.237793 -4.83 <.0001 Unit Cost
Figure 19.7: FIML Results

Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
Top
Top

Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.