Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
PROC CAPABILITY and General Statements

Descriptive Statistics

This section provides computational details for the descriptive statistics which are computed with the PROC CAPABILITY statement. These statistics can also be saved in the OUT= data set by specifying the keywords listed in Table 7.1 in the OUTPUT statement.

Standard algorithms (Fisher 1973) are used to compute the moment statistics. The computational methods used by the CAPABILITY procedure are consistent with those used by other SAS procedures for calculating descriptive statistics. For details on statistics also calculated by base SAS software, see SAS Language Reference: Dictionary.

The following sections give specific details on several statistics calculated by the CAPABILITY procedure.

Mean

The sample mean is calculated as
\frac{\sum^n_{i=1} w_i x_i}{\sum^n_{i=1} w_i}
where n is the number of nonmissing values for a variable, xi is the i th value of the variable, and wi is the weight associated with the i th value of the variable. If there is no WEIGHT= variable, the formula reduces to \frac{1}n \sum^n_{i=1} x_i.

Sum

The sum is calculated as \sum^n_{i=1} w_i x_i, where n is the number of nonmissing values for a variable, xi is the i th value of the variable, and wi is the weight associated with the i th value of the variable. If there is no WEIGHT= variable, the formula reduces to \sum^n_{i=1} x_i.

Sum of the Weights

The sum of the weights is calculated as \sum^n_{i=1} w_i,where n is the number of nonmissing values for a variable and wi is the weight associated with the i th value of the variable. If there is no WEIGHT= variable, the sum of the weights is n.

Variance

The variance is calculated as

		\(
 \frac{1}d \sum^n_{i=1} w_i (x_i-{\bar{X}}_w)^2
\)
where n is the number of nonmissing values for a variable, xi is the i th value of the variable, {\bar{X}}_w is the weighted mean, wi is the weight associated with the i th value of the variable, and d is the divisor controlled by the VARDEF= option in the PROC CAPABILITY statement. If there is no WEIGHT= variable, the formula reduces to

		\(
 \frac{1}d \sum^n_{i=1} (x_i-{\bar{X}}_w)^2
\)

Standard Deviation

The standard deviation is calculated as

		\(
 \sqrt{ \frac{1}d \sum^n_{i=1} w_i (x_i-{\bar{X}}_w)^2 }
\)
where n is the number of nonmissing values for a variable, xi is the i th value of the variable, {\bar{X}}_wis the weighted mean, wi is the weight associated with the i th value of the variable, and d is the divisor controlled by the VARDEF= option in the PROC CAPABILITY statement. If there is no WEIGHT= variable, the formula reduces to

		\(
 \sqrt{ \frac{1}d \sum^n_{i=1} (x_i-{\bar{X}}_w)^2 }
\)

Skewness

The sample skewness is calculated as
\frac{n}{(n-1)(n-2)} \sum^n_{i=1}
 ( \frac{x_i-\bar{X}}s )^3
where n is the number of nonmissing values for a variable and must be greater than 2, xi is the i th value of the variable, \bar{X} is the sample average, and s is the sample standard deviation.

The sample skewness can be positive or negative; it measures the asymmetry of the data distribution and estimates the theoretical skewness \sqrt{\beta_1} = \mu_3 \mu_2^{-\frac{3}2},where \mu_2 and \mu_3 are the second and third central moments. Observations that are normally distributed should have a skewness near zero.

Kurtosis

The sample kurtosis is calculated as
\frac{n (n+1)}{(n-1)(n-2)(n-3)} \sum_{i=1}^n
 ( \frac{x_i-\bar{X}}s )^4 -
 \frac{3 (n-1)^2}{(n-2)(n-3)}
where n > 3. The sample kurtosis measures the heaviness of the tails of the data distribution. It estimates the adjusted theoretical kurtosis denoted as \beta_2-3, where \beta_2 = \frac{\mu_4}{{\mu_2}^2},and \mu_4 is the fourth central moment. Observations that are normally distributed should have a kurtosis near zero.

Coefficient of Variation (CV)

The coefficient of variation is calculated as
CV = \frac{100 x s}{\bar{X}}

Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
Top
Top

Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.