next up previous

STAT 350: Lecture 14

Reparametrization

Suppose X is the design matrix of a linear model and that tex2html_wrap_inline50 is the design matrix of the linear model we get by imposing some linear restrictions on the model using X. A good example is on assignment 3 but here is another. Consider the one way layout, also called the K sample problem. We have K samples from K populations with means tex2html_wrap_inline60 for tex2html_wrap_inline62 . Suppose the ith sample size is n+i. This is a linear model, provided we are able to assume that all the population variances are the same.

The resulting design matrix is

displaymath68

where there are tex2html_wrap_inline70 copies of the first row, tex2html_wrap_inline72 and then tex2html_wrap_inline74 copies of tex2html_wrap_inline76 and so on. Under the restriction tex2html_wrap_inline78 there is just one parameter, say tex2html_wrap_inline80 , and the design matrix tex2html_wrap_inline82 is just a column of tex2html_wrap_inline84 1s. Notice that tex2html_wrap_inline82 is not a submatrix of the original design matrix X. However, if A is the tex2html_wrap_inline92 matrix with all entries equal to 1 we have tex2html_wrap_inline94 so that the column space of tex2html_wrap_inline82 is a subspace (of dimension 1) of the K dimensional column space of X. The extra sum of squares principle can thus be used to test the null hypothesis tex2html_wrap_inline102 by fitting the two models and computing, using the notation of ANOVA from STAT 330:

displaymath104

The numerator of this sum simplifies algebraically to

displaymath106

so that this extra Sum of Squares F test is the usual F test in this problem; this is universal.

It is actually quite common to reparametrize the full model in such a way that the null hypothesis of interest is of the form tex2html_wrap_inline112 . For the 1 way ANOVA there are two such reparametrizations in common use.

The first of these defines a grand mean parameter tex2html_wrap_inline114 and individual ``effects'' tex2html_wrap_inline116 . This new model has K+1 parameters apparently and the corresponding design matrix, tex2html_wrap_inline50 , would not have full rank; its rank would be K although it would have K+1 columns. As such the matrix tex2html_wrap_inline126 would be singular and we could not find unique least squares estimates. The problem is that we have defined the parameters in such a way that there is a linear restriction on them, namely, tex2html_wrap_inline128 . We get around this problem by dropping tex2html_wrap_inline130 and remembering in our model equations that tex2html_wrap_inline132 .

If you now write out the model equations with tex2html_wrap_inline134 and the tex2html_wrap_inline136 as parameters you get the design matrix

displaymath138

Students will have seen this matrix in 330 in the case where all the tex2html_wrap_inline140 are the same and the fractions in the last tex2html_wrap_inline142 rows of tex2html_wrap_inline144 are all equal to -1. Notice that the hypothesis tex2html_wrap_inline78 is the same as tex2html_wrap_inline148 .

The other reparametrization is ``corner-point coding'' where we define new parameters by tex2html_wrap_inline150 and tex2html_wrap_inline152 . For this parameterization the null hypothesis of interest is tex2html_wrap_inline154 . The design matrix is

displaymath156


next up previous



Richard Lockhart
Mon Mar 3 11:10:42 PST 1997