Introduction to the ML Estimation of ARMA processes

University of Pavia Introduction to the ML Estimation of ARMA processes Eduardo Rossi Introduction We consider the AR(p) model: Yt = c + φ1 Yt−1 + ...

Author: Molly Stevens

265 downloads 0 Views 142KB Size

Report

Download PDF

Recommend Documents

Estimation of ARMA processes

Maximum likelihood estimation of pure GARCH and ARMA-GARCH processes

Chapter 6. ARMA Models. 6.1 ARMA Processes

Lesson 12: Estimation of the parameters of an ARMA model

Robust ML-estimation of the Transmitter Location

Maximum Likelihood (ML) Estimation

ML Estimation in Stata

Introduction to Stochastic Processes

Estimation of Vector Autoregressive Processes

Part 3: ML Parameter Estimation

Introduction to Bulk-Forming Processes

1 Introduction to Stochastic Processes

1 Introduction. 2 Estimation of the effect

Chapter 1 Pseudo ML and Moment Estimation

MODIFIED ML ESTIMATION: SIZE-BIASED LOMAX DISTRIBUTION

ML-Based Fringe Frequency Estimation for InSAR

Introduction to Ratio and Regression Estimation

Physics 509: Introduction to Parameter Estimation

Introduction to Potable Water Treatment Processes

Introduction to point processes. Frederic Paik Schoenberg

Chapter 2: Introduction to Point Processes

STOCHASTIC PROCESSES, DETECTION AND ESTIMATION Course Notes

Group Processes in Software Effort Estimation

GRUMETES ARMA

University of Pavia

Introduction to the ML Estimation of ARMA processes Eduardo Rossi

Introduction We consider the AR(p) model: Yt = c + φ1 Yt−1 + . . . + φp Yt−p + εt

t = 1, . . . , T

εt ∼ W N (0, σ 2 ) where y0 , y−1 , . . . , y1−p are given. Notation as a regression model yt = z′t θ + εt with θ = (c, φ1 , . . . , φp )′ and zt = (1, yt−1 , . . . , yt−p )′ :        c  y1 1 y0 . . . y1−p     θ1       ..    ..   .. ..   +  . = .  .   ..   . ...     .    1 yT −1 . . . yT −p  yT θp c Eduardo Rossi

-

Time series econometrics 2011

ε1 .. . εT

    

2

OLS Estimation of AR(p) The model is: y = Zθ + ε The OLS estimator: b θ

=

(Z′ Z)−1 Z′ y

=

(Z′ Z)−1 Z′ (Zθ + ε)

=

θ + (Z′ Z)−1 Z′ ε −1 1 ′ 1 ′ ZZ Zε θ+ T T

=

• OLS is no longer linear in y. • Hence cannot be BLUE. In general OLS in no more unbiased. • Small sample properties are analytically difficult to derive. c Eduardo Rossi

-

Time series econometrics 2011

3

OLS Estimation of AR(p)

If Yt is a stable AR(p) process and εt is a standard white noise, then the following results hold (Mann and Wald, 1943): 1 ′ p (Z Z)−→Γ T √ 1 ′ d Z ε −→N (0, σ 2 Γ) T T

then consistency and asymptotic normality follows from Cramer’s theorem: √ d b − θ)−→N T (θ (0, σ 2 Γ−1 )

c Eduardo Rossi

-

Time series econometrics 2011

4

Impact of autocorrelation on regression results

Necessary condition for the consistency of OLS estimator with stochastic (but stationary) regressors is that zt is asymptotically 1 ′ uncorrelated with εt , i.e. plim T Z εt = 0: −1 1 ′ b − θ = plim 1 Z′ Z plimθ plim Zε T T 1 Z′ ε = Γ−1 plim T OLS is no longer consistent under autocorrelation of the regression error as 1 ′ plim Z ε 6= 0 T

c Eduardo Rossi

-

Time series econometrics 2011

5

OLS Estimation - Example

Consider an AR(1) model with first-order autocorrelation of its errors Yt

=

φYt−1 + ut

ut

=

ρut−1 + εt

εt

∼

W N (0, σ 2 )

such that Z′ = [Y0 , . . . , YT −1 ]. Then " # T T 1X 1X 1 ′ Zu =E Yt−1 ut = E[Yt−1 (ρ(Yt−1 −φYt−2 )+εt )] E T T t=1 T t=1 since ut = ρ(Yt−1 − φYt−2 ) + εt

c Eduardo Rossi

-

Time series econometrics 2011

6

OLS Estimation - Example

1 ′ Zu E T

=

ρ

1 T

T X t=1

2 E[Yt−1 ]

!

T 1 X E[Yt−1 εt ] T t=1

=

!

− φρ

1 T

T X

E[Yt−1 Yt−2 ]

t=1

!

+

ρ [γy (0) − φγy (1)]

where γy (h) is the autocovariance function of {Yt } which can be represented as an AR(2) process.

c Eduardo Rossi

-

Time series econometrics 2011

7

MLE AR(1) For the Gaussian AR(1) process, Yt = c + φYt−1 + εt

|φ| < 1

εt ∼ N ID(0, σ 2 ) the joint distribution of YT = (Y1 , . . . , YT )′ is YT ∼ N (µ, Σ) the observations y ≡ (y1 , y2 , . . . , yT ) are the single realization of YT

c Eduardo Rossi

-

Time series econometrics 2011

8

MLE AR(1)





Y  1   ..   .  ∼ N (µ, Σ)   YT 



µ    ..  µ= .    µ

c Eduardo Rossi

-



  Σ= 

γ0 .. . γT −1

. . . γT −1 .. .. . . ...

Time series econometrics 2011

γ0

    

9

MLE AR(1) The p.d.f. of the sample y = (y1 , y2 , . . . , yT )′ is given by the multivariate normal density T 1 1 fY (y; µ, Σ) = (2π)− 2 |Σ|− 2 exp − (y − µ)′ Σ−1 (y − µ) 2 Denoting Σ = σy2 Ω  γ0   . Σ =  ..  γT −1

with Ωij = φ|i−j|   . . . γT −1   .   .. . = γ  . 0 .   ... γ0 

  Σ = σy2 Ω = σy2   c Eduardo Rossi

-

1 .. . ρ(T − 1)

1 .. .

... .. .

γT −1 γ0

γT −1 γ0

...

1

. . . ρ(T − 1) .. .. . . ...

Time series econometrics 2011

1



.. .

    

    10

MLE AR(1)

ρ(j) = φj Collecting the parameters of the model in θ = (c, φ, σ 2 )′ , the joint p.d.f. becomes: T 1 1 fY (y; θ) = (2πσy2 )− 2 |Ω|− 2 exp − 2 (y − µ)′ Ω−1 (y − µ) 2σy Collecting the parameters of the model in θ = (c, φ, σ 2 )′ , the sample log-likelihood function is given by T T 1 1 2 L(θ) = − log(2π)− log(σy )− log(|Ω|)− 2 (y−µ)′ Ω−1 (y−µ) 2 2 2 2σy

c Eduardo Rossi

-

Time series econometrics 2011

11

MLE AR(1) Sequential Factorization The prediction-error decomposition uses the fact that the εt are independent, identically distributed: f (ε2 , . . . , εT ) =

T Y

fε (εt ).

t=2

and:

gY (yT , . . . , y1 ) =

"

T Y

t=2

#

gYt |Yt−1 (yt |yt−1 ) × gY1 (y1 )

by the Markov property. We assume that the marginal density of Y1 is that of ε1 with c E[Y1 ] = µ = 1−φ σ2 2 2 E[(Y1 − µ) ] = σy = 1 − φ2 c Eduardo Rossi

-

Time series econometrics 2011

12

MLE AR(1)

Since εt = Yt − (c + φYt−1 ) then gYt |Yt−1 (yt |yt−1 ) = fε (yt |yt−1 ) = fε (εt )

c Eduardo Rossi

-

Time series econometrics 2011

t = 2, . . . , T

13

MLE AR(1) Hence: gY (yT , . . . , y1 ) =

("

T Y

t=2

#

fε (yt |yt−1 ) fε (y1 )

)

For εt ∼ N ID(0, σ 2 ), the log-likelihood is given by: L(θ)

= =

log L(θ) T X t=2

=

c Eduardo Rossi

-

log fε (yt |yt−1 ; θ) + log fy (y1 ; θ) (

T T −1 1 2 − log (2π) − log (σ ) + 2 2 2 2σ 1 1 − log (σy2 ) + 2 (y1 − µ)2 2 2σy

Time series econometrics 2011

T X t=2

εt

)

14

MLE AR(1)

where Y1 ∼ N (µ, σy2 ) with µ =

c 1−φ2

and σy2 =

σ2 1−φ2 .

Maximization of the exact log likelihood for an AR(1) process must be accomplished numerically.

c Eduardo Rossi

-

Time series econometrics 2011

15

MLE AR(p)

Gaussian AR(p): Yt = c + φ1 Yt−1 + . . . + φp Yt−p + εt

εt ∼ N ID(0, σ 2 )

θ = (c, φ1 , . . . , φp , σ 2 )′ Exact MLE Using the prediction-error decomposition, the joint p.d.f is given by: # " T Y fY (y1 , y2 , . . . , yT ; θ) = fε (yt |yt−1 ; θ) fY1 ,...,Yp (y1 , . . . , yp ; θ) t=p+1

but only the p most recent observations matter fε (yt |yt−1 ; θ) = fε (yt |yt−1 , . . . , yt−p ; θ)

c Eduardo Rossi

-

Time series econometrics 2011

16

MLE AR(p) The likelihood function for the complete sample is: " T # Y fY (y1 , y2 , . . . , yT ; θ) = fε (yt |yt−1 , . . . , yt−p ; θ) fy (y1 , . . . , yp ; θ) t=p+1

With εt ∼ N ID(0, σ 2 )

1 −(yt − c − φ1 yt−1 − . . . − φp yt−p ) fε (yt |yt−1 , . . . , yt−p ; θ) = √ exp . 2 2 2σ 2πσ 2

The first p observations are viewed as the realization of a p-dimensional Gaussian variable with moments: E(Yp ) = µp ′ E (Yp − µp )(Yp − µp ) = Σp c Eduardo Rossi

-

Time series econometrics 2011

17

MLE AR(p)



   2 Σp = σ V p =    

γ0

γ1

. . . γp−1

γ1 .. .

γ0 .. .

. . . γp−2 .. ... .

γp−1

γp−2

fy (y1 , . . . , yp ; θ) = (2π)

c Eduardo Rossi

-

−p 2

|σ

−2

...

1 Vp−1 | 2

γ0 "

exp −

Time series econometrics 2011

       

(Yp − µp )

′

Vp−1 (Yp 2σ 2

− µp )

#

18

MLE AR(p) The log-likelihood is: L(θ)

=

log fY (y1 , y2 , . . . , yT ; θ) T X

=

t=p+1

=

log fε (yt |yt−1 , . . . , yt−p ; θ) + log fy (y1 , . . . , yp ; θ)

T T 1 2 − log (2π) − log (σ ) + log |Vp−1 | 2 2 2 1 − 2 (Yp − µp )′ Vp−1 (Yp − µp ) 2σ T X (yt − c − φ1 yt−1 − . . . − φp yt−p )2 − 2 2σ t=p+1

The exact MLE follows from: b = arg max L(θ) θ θ

c Eduardo Rossi

-

Time series econometrics 2011

19

MLE AR(p) Conditional MLE = OLS Take yp = (y1 , . . . , yp )′ as fixed pre-sample values b = arg max fY ,...,Y |Y ,...,Y (yp+1 , . . . , yT |yp ; θ) θ p+1 T 1 p θ

= arg max θ

T Y

t=p+1

fε (yt |yt−1 , . . . , yt−p ; θ)

Conditioning on Yp : L(θ)

= =

log fYp+1 ,...,YT |Y1 ,...,Yp (yp+1 , . . . , yT |yp ; θ) T X

t=p+1

=

c Eduardo Rossi

-

log fε (εt |Yt−1 ; θ)

T X T −p T −p 1 − log (2π) − log (σ 2 ) − 2 ε2t 2 2 2σ t+p

Time series econometrics 2011

20

MLE AR(p)

where εt = Yt − (c + φ1 Yt−1 + . . . + φp Yt−p ) Thus the MLE of (c, φ1 , . . . , φp ) results by minimizing the sum of squared residuals: arg

max

(c,φ1 ,...,φp )

L(c, φ1 , . . . , φp ) = arg

min

(c,φ1 ,...,φp )

T X

ε2t (c, φ1 , . . . , φp )

t=p+1

The conditional ML estimate of σ 2 turns out to be: T X 1 σ b2 = εb2t T − p t=p+1

c Eduardo Rossi

-

Time series econometrics 2011

21

MLE AR(p)

f1 , . . . , φ fp )′ are equivalent to OLS • The ML estimates γ e = (e c, φ estimates.

f1 , . . . , φ fp ) are consistent estimators if {Yt } is stationary and • (e c, φ √ T (e γ − γ) is asymptotically normally distributed.

• The exact ML estimates and the conditional ML estimates have the same large-sample distribution.

c Eduardo Rossi

-

Time series econometrics 2011

22

MLE AR(p)

Asymptotically equivalent • MLE of the mean-adjusted model Yt − µ = φ1 (Yt−1 − µ) + . . . + φp (Yt−p − µ) + εt where µ = (1 − φ1 − . . . − φp )−1 c. • OLS of (φ1 , . . . , φp ) in the mean adjusted model, where T 1 X µ b= Yt T t=1

c Eduardo Rossi

-

Time series econometrics 2011

23

MLE AR(p) • Yule-Walker estimation of (φ1 , . . . , φp )    −1  φb1 γ b0 ... γ bp−1       ..   ..  . .. . . .   . = .        φbp γ bp−1 . . . γ b0 where

and

γ bh = (T − h)−1

T X

γ b1 .. . γ bp

    

(yt − y)(yt−h − y)

t=h+1

T 1X yt µ b=y= T t=1

c Eduardo Rossi

-

Time series econometrics 2011

24

MLE MA(q) Gaussian MA(q): Yt = µ + εt + θ 1 εt−1 + . . . + θ q εt−q εt ∼ N ID(0, σ 2 ) Conditional MLE = NLLS Conditioning on ε0 = (ε0 , ε−1 , . . . , ε1−q )′ = 0, we can iterate on: εt = Yt − (θ 1 εt−1 + . . . + θ q εt−q ) for t = 1, . . . , T . The conditional likelihood is L(θ)

= =

log fYT |ε0 =0 (yT |ε0 = 0; θ)

T 2 X T T ε t − log (2π) − log (σ 2 ) − 2 2 2 2σ t=1

where θ = (µ, θ1 , . . . , θ q , σ 2 ).

c Eduardo Rossi

-

Time series econometrics 2011

25

MLE MA(q)

• The MLE of (µ, θ1 , . . . , θq ) results by minimizing the sum of squared residuals. • Analytical expressions for MLE are usually not available due to highly non-linear FOCs. • MLE requires to apply numerical optimization techniques.

c Eduardo Rossi

-

Time series econometrics 2011

26

MLE MA(q)

• Conditioning requires invertibility, i.e. the roots of 1 + θ1 z + θ2 z 2 + . . . + θq z q = 0 lie outside the unit circle. For MA(1) process: εt = Yt − µ − θ 1 εt−1 = (−θ 1 )t ε0 +

c Eduardo Rossi

-

Time series econometrics 2011

t X j=1

(−θ 1 )j [Yt−j − µ]

27

MLE ARMA(p,q)

Yt = c + φ1 Yt−1 + . . . + φp Yt−p + εt + θ 1 εt−1 + . . . + θ q εt−q εt ∼ N ID(0, σ 2 ) Conditional MLE = NLLS Conditioning on Y0 = (Y0 , Y−1 , . . . , Y−p+1 ) and ε0 = (ε0 , ε−1 , . . . , ε−q+1 )′ = 0, the sequence {ε1 , ε2 , . . . , εT } can be calculated from {Y1 , Y2 , . . . , YT } by iterating on: εt = Yt − (c + φ1 Yt−1 + . . . + φp Yt−p ) − (θ 1 εt−1 + . . . + θ q εt−q ) for t = 1, . . . , T .

c Eduardo Rossi

-

Time series econometrics 2011

28

MLE ARMA(p,q)

The conditional log-likelihood is: L(θ)

= =

log fYT |Y0 ,ε0 (yT |y0 , ε0 )

T X T T ε2t 2 − log (2π) − log (σ ) − 2 2 2 2σ t=1

• One option is to set the initial values equal to their expected values: Ys = (1 − φ1 − . . . − φp )−1 c s = 0, −1, . . . , −p + 1 εs = 0 s = 0, −1, . . . , −q + 1

c Eduardo Rossi

-

Time series econometrics 2011

29

MLE ARMA(p,q)

• Box and Jenkins (1976) recommended setting ε’s to zero but y’s equal to their actual values. The iteration is started at date t = p + 1, with Y1 , Y2 , . . . , Yp set to the observed values and εp = εp−1 = . . . = εp−q+1 = 0

c Eduardo Rossi

-

Time series econometrics 2011

30