Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
The VARCLUS Procedure

Computational Resources

Let
n &=& {number of observations}\v &=& {number of variables}\c &=& {number of clusters}

It is assumed that, at each stage of clustering, the clusters all contain the same number of variables.

Time

The time required for PROC VARCLUS to analyze a given data set varies greatly depending on the number of clusters requested, the number of iterations in both the alternating least-squares and search phases, and whether centroid or principal components are used.

The time required to compute the correlation matrix is roughly proportional to nv2.

Default cluster initialization requires time roughly proportional to v3. Any other method of initialization requires time roughly proportional to cv2.

In the alternating least-squares phase, each iteration requires time roughly proportional to cv2 if centroid components are used or

(c+5[v/(c2)])v2
if principal components are used.

In the search phase, each iteration requires time roughly proportional to v3/c if centroid components are used or v4/c2 if principal components are used. The HIERARCHY option speeds up each iteration after the first split by as much as c/2.

Memory

The amount of memory, in bytes, needed by PROC VARCLUS is approximately
v2+2vc+20v+15c

Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
Top
Top

Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.