web

$next$ $up$ $previous$

Postscript version of this page

STAT 380 Week 2

Summary

Review expectation
Conditional distributions, conditional expectation.
Summarize review.

Example 3: Mean values

= total number of sons in generation .

for convenience.

Compute ${\rm E}(Z_n)$ .

Recall definition of expected value:

If is discrete then

$\displaystyle {\rm E}(X) = \sum_x x P(X=x)$

If is absolutely continuous then

$\displaystyle {\rm E}(X) = \int_{-\infty}^\infty x f(x) dx$

Theorem: If , has density then

$\displaystyle {\rm E}(Y) = {\rm E}(g(X)) =\int g(x) f(x) dx$

Key properties of ${\rm E}$ :

1: If $X\ge 0$ then ${\rm E}(X) \ge 0$ . Equals iff .

2: ${\rm E}(aX+bY) = a{\rm E}(X) +b{\rm E}(Y)$ .

3: If $0 \le X_1 \le X_2 \le \cdots$ then

$\displaystyle {\rm E}(\lim X_n) = \lim {\rm E}(X_n)$

4: ${\rm E}(1) = 1$ .

Conditional Expectations

If , , two discrete random variables then

$\displaystyle {\rm E}(Y\vert X=x) = \sum_y y P(Y=y\vert X=x)$

Extension to absolutely continuous case:

Joint pmf of and is defined as

$\displaystyle p(x,y) = P(X=x,Y=y)$

Notice: The pmf of

$\displaystyle p_X(x) = \sum_y p(x,y)$

Analogue for densities: joint density of

$\displaystyle f(x,y) dx dy \approx P(x \le X \le x+dx, y \le Y \le y+dy)$

Interpretation is that

$\displaystyle P(X \in A, Y \in B) = \int_A \int_B f(x,y) dy dx$

Property: if

have joint density

then

has density

$\displaystyle f_X(x) = \int_y f(x,y) dy$

Sums for discrete rvs are replaced by integrals.

Example:

$\displaystyle f(x,y) = \begin{cases}x+y & 0 \le x,y \le 1 \\ 0 & \text{otherwise} \end{cases}$

is a density because

$\displaystyle \iint f(x,y)dx dy$	$\displaystyle = \int_0^1\int_0^1 (x+y) dy dx$
	$\displaystyle = \int_0^1 x dx + \int_0^1 y dy$
	$\displaystyle = \frac{1}{2} + \frac{1}{2} = 1$

The marginal density of is, for $0 \le x \le 1$ .

$\displaystyle f_X(x)$	$\displaystyle = \int_{-\infty}^\infty f(x,y)dy$
	$\displaystyle = \int_0^1 (x+y) dy$
	$\displaystyle = \left.(xy+y^2/2)\right\vert _0^1 = x+\frac{1}{2}$

For not in the integral is 0 so

$\displaystyle f_X(x) = \begin{cases}x+\frac{1}{2} & 0 \le x \le 1 \\ 0 & \text{otherwise} \end{cases}$

Conditional Densities

If and have joint density $f_{X,Y}(x,y)$ then we define the conditional density of given by analogy with our interpretation of densities. We take limits:

$\begin{multline*} f_{Y\vert X}(y\vert x)dy \\ \approx \frac{ P(x \le X \le x+dx, y \le Y \le y+dy)}{P(x \le X \le x+dx)} \end{multline*}$

in the sense that if we divide through by

and let

and

tend to 0 the conditional density is the limit

$\displaystyle \frac{\lim_{dx, dy \to 0} \frac{ P(x \le X \le x+dx, y \le Y \le y+dy)}{(dx\,dy)}}{ \lim_{dx\to 0} \frac{P(x \le X \le x+dx)}{dx}}$

Going back to our interpretation of joint densities and ordinary densities we see that our definition is just

$\displaystyle f_{Y\vert X}(y\vert x) = \frac{f_{X,Y}(x,y)}{f_X(x)}$

When talking about a pair

and

of random variables we refer to $f_{X,Y}$ as the joint density and to

as the marginal density of

Example: For of previous example conditional density of given defined only for $0 \le x \le 1$ :

$\displaystyle f_{Y\vert X}(y\vert x) = \begin{cases} \frac{x+y}{x+\frac{1}{2}} ... ...y>1 \\ 0 & 0 \le x \le 1, y < 0 \\ \text{undefined} & otherwise \end{cases}$

Example:

a Poisson $(\lambda)$ random variable. Observe

then toss a coin

times.

is number of heads.

$\displaystyle f_Y(y)$	$\displaystyle = \sum_x f_{X,Y}(x,y)$
	$\displaystyle = \sum_x f_{Y\vert X}(y\vert x) f_X(x)$
	$\displaystyle = \sum _{x=0}^\infty \dbinom{x}{y} p^y(1-p)^{x-y} \times \frac{\lambda^x}{x!} e^{-\lambda}$

WARNING: in sum $0 \le y \le x$ is required and , integers so sum really runs from to $\infty$

$\displaystyle f_Y(y)$	$\displaystyle = \frac{(p\lambda)^ye^{-\lambda}}{y!} \sum_{x=y}^\infty \frac{\left[(1-p)\lambda\right]^{x-y}}{(x-y)!}$
	$\displaystyle = \frac{(p\lambda)^ye^{-\lambda}}{y!}\sum_0^\infty \frac{\left[(1-p)\lambda\right]^{k}}{k!}$
	$\displaystyle = \frac{(p\lambda)^ye^{-\lambda}}{y!}e^{(1-p)\lambda}$
	$\displaystyle = e^{-p\lambda} (p\lambda)^y/y!$

which is a Poisson( $p\lambda$ ) distribution.

Conditional Expectations

If and are continuous random variables with joint density $f_{X,Y}$ we define:

$\displaystyle E(Y\vert X=x) = \int y f_{Y\vert X}(y\vert x) dy$

Key properties of conditional expectation

1: If $Y\ge 0$ then ${\rm E}(Y\vert X=x) \ge 0$ . Equals iff $P(Y=0\vert X=x)=1$ .

2: ${\rm E}(A(X)Y+B(X)Z\vert X=x) = A(x){\rm E}(Y\vert X=x) +B(x){\rm E}(Z\vert X=x)$ .

3: If and are independent then

$\displaystyle {\rm E}(Y\vert X=x) = {\rm E}(Y)$

4: ${\rm E}(1\vert X=x) = 1$ .

Example:

$\displaystyle f(x,y) = \begin{cases}x+y & 0 \le x,y \le 1 \\ 0 & \text{otherwise} \end{cases}$

has conditional of $Y\vert X$ :

$\displaystyle f_{Y\vert X}(y\vert x) = \begin{cases} \frac{x+y}{x+\frac{1}{2}} ... ...y>1 \\ 0 & 0 \le x \le 1, y < 0 \\ \text{undefined} & otherwise \end{cases}$

so, for $0 \le x \le 1$ ,

$\displaystyle {\rm E}(Y\vert X=x)$	$\displaystyle = \int_0^1 y \frac{x+y}{x+\frac{1}{2}} dy$
	$\displaystyle = \frac{x/2 +1/3}{x+1/2}$

Computing expectations by conditioning:

Notation: ${\rm E}(Y\vert X)$ is the function of you get by working out ${\rm E}(Y\vert X=x)$ , getting a formula in and replacing by . This makes ${\rm E}(Y\vert X)$ a random variable.

Properties:

1: ${\rm E}(A(X)Y+B(X)Z\vert X) = A(X){\rm E}(Y\vert X) +B(X){\rm E}(Z\vert X)$ .

2: If and are independent then

$\displaystyle {\rm E}(Y\vert X) = {\rm E}(Y)$

3: ${\rm E}(1\vert X) = 1$ .

4: ${\rm E}\left[{\rm E}(Y\vert X)\right] = {\rm E}(Y)$ (compute average holding fixed first, then average over ).

In example:

$\displaystyle {\rm E}(Y\vert X) = \frac{X+2/3}{2X+1}$

Application to last names problem. Put $m={\rm E}(X)$

$\displaystyle {\rm E}(Z_n)$	$\displaystyle = {\rm E}\left[{\rm E}(Z_n\vert X)\right]$
	$\displaystyle = {\rm E}\left[ X{\rm E}(Z_{n-1})\right]$
	$\displaystyle = {\rm E}(X){\rm E}(Z_{n-1})$
	$\displaystyle = m {\rm E}(Z_{n-1})$
	$\displaystyle = m^2 {\rm E}(Z_{n-2})$
	$\displaystyle \quad \vdots$
	$\displaystyle = m^{n-1}{\rm E}(Z_1)$
	$\displaystyle = m^n$

For expect exponential decay. For exponential growth (if not extinction).

Summary of Probability Review

We have reviewed the following definitions:

Probability Space (or Sample Space): $\Omega =$ set of possible outcomes, $\omega$ .
Events, subsets of $\Omega$ .
Probability: function defined for events, values in satisfying:
1. $P(\emptyset)=0$ and $P(\Omega)=1$ .
2. Countable additivity: $A_1,A_2,\cdots$ pairwise disjoint ( $j\neq k$ $A_j\cap A_k=\emptyset$ )
  
  $\displaystyle P(\cup_{i=1}^\infty A_i) = \sum_{i=1}^\infty P(A_i)$
Joint, marginal, conditional pmfs: discrete:
- Joint pmf
  
  $\displaystyle p_{X,Y}(x,y) = P(X=x,Y=y).$
- Marginal pmf
  
  $\displaystyle p_X(x) = P(X=x) = \sum_y p_{X,Y}(x,y)$
- Conditional pmf
  
  $\displaystyle p_{Y\vert X}(y\vert x) =p_{X,Y}(x,y)/p_X(x).$
Joint, marginal, conditional densities: continuous:
- Joint density $f_{X,Y}(x,y)$ ; probabilities are double integrals
- Marginal density
  
  $\displaystyle f_X(x) = \int f_{X,Y}(x,y) dy$
- Conditional density
  
  $\displaystyle f_{Y\vert X}(y\vert x) =f_{X,Y}(x,y)/f_X(x).$
Expected value:

$\displaystyle {\rm E}(g(X)) = \sum g(x) p_X(x)$
or

$\displaystyle {\rm E}(g(X)) = \int g(x) f_X(x) dx$
Conditional expectation

$\displaystyle {\rm E}(g(Y)\vert X=x) = \sum g(y) p_{Y\vert X}(y\vert x)$
or

$\displaystyle {\rm E}(g(Y)\vert X=x) = \int g(y) f_{Y\vert X}(y\vert x) dy$
${\rm E}(g(Y)\vert X)$ is previous with replaced by after getting formula.

Tactics:

Convert English expressions to set notation:
- ``Or'' means union (remember inclusive or).
- ``And'' means intersection.
- ``not'' means complement.
Compute prob something happens as 1-prob it doesn't.
Break up event into disjoint pieces and add up probabilities.
Find independent events and write event as intersection.
Find , for which $P(A\vert B)$ is known.
Use Bayes rule: if $B_1,\ldots,B_p$ is a partition then

$\displaystyle P(B_1\vert A) = \frac{P(A\vert B_1)P(B_1)}{\sum P(A\vert B_j)P(B_j)}$
Use first step analysis. (See family names example, craps game, alternating dice tossing on asst 1.)

Tactics for expected values:

Use linearity:

$\displaystyle {\rm E}(\sum a_j X_j) = \sum a_j {\rm E}(X_j)$
Condition on another variable and use

$\displaystyle {\rm E}\left[{\rm E}(Y\vert X)\right] = {\rm E}(Y)$
Use first step analysis. (Special case of previous.)

$next$ $up$ $previous$

Richard Lockhart
2002-02-07