web

STAT 804: Notes on Lecture 5

Model identification

By model identification for a time series we mean the process of selecting values of so that the process gives a reasonable fit to our data. The most important model identification tool is a plot of (an estimate of) the autocorrelation function of ; we use the abbreviation ACF for this function. Before we discuss doing this with real data we explore what plots of the ACF of various plots should look like (in the absence of estimation error).

For an process we found that

$\displaystyle C_X(h) = \begin{cases} \sigma^2 \sum_{j=0}^{p-\vert h\vert} b_j b_{j+\vert h\vert} & \vert h\vert \le p \\ 0 & \text{otherwise} \end{cases}$

This has the important qualitative feature that it vanishes for $\vert h\vert > p$ .

For an process $X_t-\mu=\rho(X_{t-1}-\mu)+\epsilon_t$ the autocorrelation function is

$\displaystyle \rho_X(h) = \rho^{\vert h\vert}$

which has the qualitative feature of decreasing geometrically.

To derive the autocovariance for a general we mimic the technique for . If $X_t = \sum_1^p a_j X_{t-j} + \epsilon_t$ then

$\displaystyle C_X(h)$	$\displaystyle =$ Cov $\displaystyle (X_0,X_h)$
	$\displaystyle = \sum_{j=1}^p a_j$ Cov $\displaystyle (X_0,X_{h-j}) +$ Cov $\displaystyle (X_0,\epsilon_h)$
	$\displaystyle = \sum_{j=1}^p a_j C_X(h-j)$

for

. Take these equations and divide through by

and remember that $\rho_X(h) = C_X(h)/C_X(0)$ and $\rho_X(-k) = \rho_X(k)$ you see that the above recursions for $h=1,\ldots,p$ are

linear equations in the

unknowns $\rho_X(1),\ldots,\rho_X(p)$ . They are called the Yule Walker equations. For instance, when

we get

$\displaystyle C_X(2)$	$\displaystyle = a_1C_X(1) + a_2 C_X(0)$
$\displaystyle C_X(1)$	$\displaystyle = a_1 C_X(0) + a_2 C_X(-1)$

which becomes, after division by

$\displaystyle \rho_X(2)$	$\displaystyle = a_1\rho_X(1) + a_2$
$\displaystyle \rho_X(1)$	$\displaystyle = a_1 + a_2 \rho_X(1)$

It is possible to use generating functions to get explicit formulas for the $\rho(h)$ but here we simply observe that we have two equations in two unknowns to solve. The second equation shows that

$\displaystyle \rho(1) = \frac{a_1}{1-a_2}$

which is not possible if

(unless

) and not a correlation for some other

pairs. The first equation then gives

$\displaystyle \rho(2) =\frac{ a_1^2 +a_2(1-a_2)}{1-a_2}$

Notice that the Yule Walker equations permit $\rho(h)$ to be calculated recursively from $\rho(1)$ and $\rho(2)$ for $h \ge 3$ .

Now look at $\phi(x)$ , the characteristic polynomial, when we have

$\displaystyle \phi(x) = 1 - a_1 x -x^2 = (1-\alpha_1 x)(1-\alpha_2 x)$

where $1/\alpha_i, i=1,2$ are the two roots. Multiplying out we find that $\alpha_1\alpha_2 = -1$ so that either one of the two has modulus more than 1 (and the root $1/\alpha_i$ has modulus less than 1) or both have modulus 1. The two roots may be seen to be real so they would have to be $\pm 1$ . Since $\alpha_1+\alpha_2 = a_1$ (again from multiplying it out and examining the coefficient of

) we would then know

. In either case there is no stationary solution.

Qualitative features: It is possible to prove that the solutions of these Yule-Walker equations decay to 0 at a geometric rate meaning that they satisfy $\vert\rho_X(h)\vert \le a^{\vert h\vert}$ for some $a\in (0,1)$ . However, for general they are not too simple.

Periodic Processes

If are iid $N(0,\sigma^2)$ then we saw

$\displaystyle X_t = Z_1 \cos(\omega t) + Z_2 \sin(\omega t)$

is a strictly stationary process with mean 0 and autocorrelation $\rho(h) = \cos(\omega h)$ . Thus the autocorrelation would be perfectly periodic.

Linear Superposition

If and are jointly stationary then is stationary and

$\displaystyle C_Z(h) = a^2 C_X(h)+b^2 C_Y(h) +ab(C_{XY}(h)+C_{YX}(h))$

Thus you could hope, for example, to recognize a periodic component to a series by looking for a periodic component to a plotted autocorrelation.

Periodic versus AR processes

In fact you can make AR processes which behave very much like periodic processes. Consider the process

$\displaystyle X_t = X_{t-1} -aX_{t-2}+\epsilon_t$

Here are graphs of trajectories and autocorrelations for and .

You should observe the slow decay of the waves in the autocovariances, particularly for near 1. When the characteristic polynomial is which has roots

$\displaystyle \frac{1 \pm \sqrt{-3}}{2}$

Both these roots have modulus

so there is no stationary trajectory with

. The point is that some

processes have nearly periodic components.

To get more insight consider the differential equation describing a sine wave:

$\displaystyle \frac{d^2}{dx^2} f(x) = -\omega^2 f(x) \, ;$

the solution if $f(x) = a\sin(\omega x + \phi)$ . If we replace the derivative by differences we get the approximation

$\displaystyle \frac{d^2}{dx^2} f(x) \approx \frac{f(x+h) - 2f(x)+f(x-h}{h^2}$

so that

$\displaystyle \frac{f(x+h) - 2f(x)+f(x-h}{h^2} \approx -\omega^2 f(x)$

Take

in the approximation and reorganize to get

$\displaystyle f(x+1) = (2-\omega^2) f(x) -f(x-1)$

If we add noise, change notation to

and replace the letter

we get

$\displaystyle X_t = (2-\omega^2) X_{t-1} - X_{t-2} +\epsilon_t$

This is formalism only; there is no stationary solution of this equation. However, we see that processes are at least analogous to the solutions of second order differential equations with added noise.