STAT 330 Lecture 17
Reading for Today's Lecture: 7.4, 9.5
Goals of Today's Lecture:
Today's notes
Summary of Sample Size Calculations
The text gives formulas for sample sizes and power calculations
for one and two sample problems for means and proportions with one
or two sided alternatives which are useful if the solution is a
reasonably large sample size (so that estimation of
can
be more or less ignored and so that normal approximations to
and
can be made). It also gives a method of
estimating sample sizes in small samples from normal populations
for use in either a one or two sample context; see Appendix A.13.
Inference for Variances
Theorem: If
is a sample from a normal population with mean
and SD
then:
has a t distribution on n-1 degrees of freedom.
This theorem is based on:
Definition: If
are iid N(0,1) then we say that
has a chi-squared distribution on
degrees of freedom.
Fact: The density of W is
That is,
is a Gamma distribution with shape
and scale
2.
Fact: If U and V are independent, U is N(0,1) and V is
then
Related Fact: If
and
are independent and
has
a
distribution and
has a
distribution then we say
with
numerator degrees of freedom and
denominator degrees
of freedom.
Here are some plots of F densities. Notice the centering around 1 when the two degrees of freedom are both large.
Inferential Uses
we get confidence intervals at level
for
by taking
the interval between
and
Take a square root to get confidence intervals for the more meaningful
parameter
.
or
To get P values you take the one tailed P value from F tables and double it.
and rejecting the null in favour of the alternative
if
where the quantity
denotes the point such that the
area to the right of this point under an F density with n-1 numerator degrees of freedom and m-1
denominator degrees of freedom is
.
Example: Michelson data, first 20 measurements and last 20 measurements are
X's and Y's. We find
and
.
Consider first the problem of a confidence interval for
the
population standard deviation of the measurement errors for the first 20
measurements. There are 19 degrees of freedom for
and the critical
values are
and
from A.6 on page 708 in the text. This leads to the interval for
running from
to
or from 18.3 to 35.2.
Next consider the question of whether or not the precision of the
measurements has changed. We test
against the two
sided alternative.
There are 19 numerator and 19 denominator degrees of freedom. Now the F tables contain upper tail critical points only for the upper tail probabilities 0.05 and 0.01. We find
and
[NOTE: the tables give only 15 and 20 numerator degrees of freedom; I interpolated to get my numbers by linear interpolation - since 19 is four fifths of the way between 15 1nd 20 I went four fifths fo the way between the figures in the two columns under 15 and 20.]
In fact our F value is large than 3.03 so we would reject the hypothesis at the level 0.02 if our alternatie is two sided and at the level 0.01 if our alternative is one sided. Using Splus software I get a one tailed P value of 0.00298. and a 2 tailed P value twice that or roughly 0.006. In any case we conclude that there is compelling evidence of a change in the precision of the measurements from the first twenty to the last twenty made by Michelson.
Implication: the two sample t test for a change in bias is inappropriate; you should use the Satterthwaite approximate calculation of the degrees of freedom for the UNPOOLED version of the two sample test. This method is described on pp 362 and 363.