web

$next$ $up$ $previous$

Postscript version of this file

STAT 380 Lecture 10

Simulation

Monte Carlo computation of expected value:

To compute ${\rm E}(X)$ : do experiment times to generate independent observations all with same distribution as .

Get $X_1,\ldots,X_n$ .

Use

$\displaystyle \bar{X} = \frac{1}{n} \sum_1^n X_i$

as an estimate of ${\rm E}(X)$

To estimate ${\rm E}(g(X))$ : use same and compute

$\displaystyle \frac{1}{n} \sum_1^n g(X_i) \, .$

Random Number Generation

In practice: random quantities are not used.

Use: pseudo-random uniform random numbers.

They ``behave like'' iid Uniform(0,1) variables.

Many, many generators. One standard kind: linear congruential generators.

Start with an integer in range $0 \le x_0 < m$ . Compute

$\displaystyle x_{n+1} = a x_n+b \mod{m}$

Here the mod means compute the remainder after division by

Integers and are chosen so that

the sequence $x_0,\ldots,x_{m-1}$ runs through the set $\{0,\ldots,m-1\}$ (or as much of it is possible, sometimes).
the sequence of pairs

$\displaystyle (x_0,x_1),(x_1,x_2),\ldots, (x_{m-1},x_m)$
fills up as much of the square $\{(i,j):1\le i,j \le m-1\}$ .
similarly for triples $(x_k,x_{k+1},x_{k+2})$ .

Use

$\displaystyle U_n = \frac{x_n}{m}$

as Uniform(0,1) random numbers.

General Random Variables

We now pretend we can generate $U_1,U_2,\ldots$ which are iid Uniform(0,1).

Monte Carlo methods for generating a sample:

Transformation:
- Inverse Probability Integral Transform
- Multivariate (e.g. Box Müller)
Acceptance Rejection
Markov Chain Monte Carlo

Inverse (Probability Integral) Transform

Fact: continuous CDF and satisfy

$\displaystyle U=F(X)$

then $U\sim$ Uniform(0,1) if and only if $X\sim F$ .

Proof: For simplicity: assume strictly increasing on inverval with and .

If $U\sim$ Uniform(0,1) then

$\displaystyle P(X \le x)$	$\displaystyle = P(F(X) \le F(x))$
	$\displaystyle = P(U \le F(x))$
	$\displaystyle = F(x)$

Conversely: if $X\sim F$ and then there is a unique such that

$\displaystyle P(U \le u)$	$\displaystyle = P(F(X) \le F(x))$
	$\displaystyle = P(X \le x)$
	$\displaystyle = F(x)$
	$\displaystyle = u$

Application: generate and solve for to get $X=F^{-1}(U)$ which has cdf .

Example:For the exponential distribution

$\displaystyle F(x) = 1-e^{-\lambda x}$

Set

and solve to get

$\displaystyle X=-\log(1-U)/\lambda$

Observation: $U \sim 1-U$ so $X=-\log(U)/\lambda$ also has an Exponential( $\lambda$ ) distribution.

Example:: for the standard normal cdf solving

$\displaystyle F(X) = U$

requires numerical approximation but such solutions are built in to many languages.

Special purpose transformations:

Example:: If and are iid define $R\ge 0$ and $\Theta\in[0,2\pi)$ by

$\displaystyle X=R\cos(\Theta) \qquad Y=R\sin(\Theta)$

NOTE: Book says

$\displaystyle \Theta = \tan^{-1}(Y/X)$

but this takes values in $(-\pi/2,\pi/2)$ ; notation

$\displaystyle \tan^{-1}(x,y)$

means angle $\theta$ in $[0,2\pi)$ so that

$\displaystyle \frac{x}{\sqrt{x^2+y^2}} = \cos(\theta) \qquad \frac{y}{\sqrt{x^2+y^2}} = \sin(\theta) \, .$

Then

$\displaystyle \{(r,\theta) : r \le r^*, \theta \le \theta^*\}$

is part of circle of radius

. (Start at

and go clockwise to angle $\theta^*$ .)

Compute joint cdf of $R,\Theta$ at $r^*,\theta^*$ .

Define

$\begin{multline*} C = \{(x,y) : x^2+y^2 \le (r^*)^2,\\ 0 \le \tan^{-1}(x,y) \le \theta^*\} \end{multline*}$

Then

$\displaystyle P(R\le r,$	$\displaystyle \Theta \le\theta)$
	$\displaystyle =\iint_C \frac{e^{-(x^2+y^2)/2}}{2\pi} dx dy$

Do integral in polar co-ordinates to get

$\displaystyle \int_0^{r^*}\int_0^{\theta^*} \frac{e^{-r^2/2}}{2\pi} r dr d\theta$

The $\theta$ integral just gives

$\displaystyle \theta^*/(2\pi)$

The

integral then gives

$\displaystyle 1-e^{-(r^*)^2/2}$

$\displaystyle P(R\le r^*,\Theta \le\theta^*) = \left(1-e^{-(r^*)^2/2}\right) \times \frac{\theta^*}{2\pi}$

Product of two cdfs so:

and $\Theta$ are independent.

Moreover has cdf

$\displaystyle P(R^2 \le x) = 1-e^{-x/2}$

which is Exponential with rate

$\Theta$ is Uniform $(0,2\pi)$ .

So: generate iid Uniform(0,1).

Define

$\displaystyle \Theta = 2\pi U_2$

and

$\displaystyle R=\sqrt{-2\log(U_1)}$

Put

$\displaystyle X=R\cos(\Theta) \qquad Y=R\sin(\Theta)$

You have generated two independent variables.

Acceptance Rejection

If difficult to solve (eg hard to compute) sometimes use acceptance rejection.

Goal: generate $X\sim F$ where $F^\prime=f$ .

Tactic: find density and constant such that

$\displaystyle f(x) \le c g(x)$

and s.t. can generate

from density

Algorithm:

Generate which has density .
Generate $U\sim$ Uniform(0,1) independent of .
If $U \le f(Y)/\{cg(Y)\}$ let .
If not reject and go back to step 1; repeat, generating new and (independently) till you accept a .

Facts:

Since you repeat the same experiment each trial: number of trials, , success is Geometric (and sure to be finite).
The probability of success on a individual trial is

$\displaystyle P[U$ $\displaystyle \le f(Y)/\{cg(Y)\}]$

$\displaystyle = \int P[U \le f(y)/\{cg(y)\}\vert Y=y]g(y) dy$

$\displaystyle = \int \frac{f(y)}{cg(y)} g(y) dy$

$\displaystyle = \frac{1}{c} \int f(y) dy$

$\displaystyle = 1/c$

so

$\displaystyle {\rm E}(N) = 1/c$

Most important fact: $X\sim f$ :

Proof: this is like the old craps example:

We compute

$\displaystyle P( X \le x) = P[ Y \le x\vert U \le f(Y)/\{cg(Y)\}]$

(condition on the first iteration where the condition is met).

Condition on to get

$\displaystyle P($	$\displaystyle X \le x)$
	$\displaystyle = \frac{\int_{-\infty}^x P[U \le f(Y)/\{cg(Y)\}\vert Y=y]g(y)dy}{ P[U \le f(Y)/\{cg(Y)\}]}$
	$\displaystyle = \frac{\int_{-\infty}^x \frac{ f(y)}{cg(y)} g(y)dy}{ 1/c}$
	$\displaystyle = \int_{-\infty}^x f(y) dy$
	$\displaystyle = F(x)$

as desired.

Example:Half normal distribution: has density

$\displaystyle f(x) = \frac{2e^{-x^2/2}}{\sqrt{2\pi}}1(x>0)$

(Density of absolute value of

Use $g(x) = ae^{-ax}1(x>0$ . To find maximize

$\displaystyle \frac{f(x)}{g(x)} = \frac{e^{-x^2/2+ax}}{a\sqrt{2\pi}}$

Rewrite as

$\displaystyle \frac{e^{a^2/2} e^{-(x-a)^2/2}}{a\sqrt{2\pi}}$

Maximum is at

giving

$\displaystyle c=\frac{e^{a^2/2}}{a\sqrt{2\pi}}$

Choose to minimize (or $\log(c)$ ); get so

$\displaystyle c=\frac{e}{\sqrt{2\pi}}$

Algorithm is then: generate $Y=-\log(U_1)$ and then compute

$\displaystyle e^{-(Y-1)^2}$

Generate

and if

$\displaystyle U_2 \le e^{-(Y-1)^2}$

otherwise try again.

To generate : use a third to pick a sign at random: negative if otherwise positive.

Discrete Distributions

The inverse transformation method works for discrete distributions.

has possible values $\{x_1,x_2,\ldots\}$ with probabilities $\{p_1,p_2,\ldots\}$ compute cumulative probabilities

$\displaystyle P_k = p_1+\cdots+p_k$

with

Generate $U\sim$ Uniform(0,1).

Find such that

$\displaystyle P_{k-1} < U \le P_k$

Put .

$next$ $up$ $previous$

Richard Lockhart
2002-03-20

$\displaystyle P[U$	$\displaystyle \le f(Y)/\{cg(Y)\}]$
	$\displaystyle = \int P[U \le f(y)/\{cg(y)\}\vert Y=y]g(y) dy$
	$\displaystyle = \int \frac{f(y)}{cg(y)} g(y) dy$
	$\displaystyle = \frac{1}{c} \int f(y) dy$
	$\displaystyle = 1/c$