Existence, Uniqueness, and Computational Theory for Time Consistent Equilibria: A Hyperbolic Discounting Example

Existence, Uniqueness, and Computational Theory for Time Consistent Equilibria: A Hyperbolic Discounting Example Kenneth L. Judd Hoover Insitution Sta...

Author: Bennett Moore

0 downloads 1 Views 434KB Size

Report

Download PDF

Recommend Documents

GOLDEN EGGS AND HYPERBOLIC DISCOUNTING*

Existence of CAPM Equilibria with Prospect Theory Preferences

A theory of (relative) discounting

Partial Cooperative Equilibria: Existence and Characterization

EXISTENCE AND UNIQUENESS OF THE UNIVERSAL W-ALGEBRA

Time-Consistent Public Policy

Slag Equilibria using Computational Thermodynamics LINA KJELLQVIST

COMPUTATIONAL LINGUISTICS AND LINGUISTIC THEORY

THE EXISTENCE AND UNIQUENESS OF A SIMPLE GROUP OF ORDER 168

PURPOSE AND MEANING: A TWO-FACTOR THEORY OF EXISTENCE

Existence and uniqueness of multi-agent equilibrium in a stochastic, dynamic consumption investment model

COMPUTATIONAL TRANSITION STATE THEORY

A UNIFYING FRAMEWORK FOR COMPUTATIONAL REINFORCEMENT LEARNING THEORY

Personal structures time space existence

PERSONAL STRUCTURES TIME SPACE EXISTENCE

Discounting Behavior: A Reconsideration

A Computational Theory of Learning Causal Relationships

Existence of Equilibria in Games with Arbitrary Strategy Spaces and Payoffs: A Full Characterization

Logarithms and Square Roots of Real Matrices Existence, Uniqueness, and Applications in Medical Imaging

F factor discounting back in time

High-Resolution Large Time-Step Schemes for Hyperbolic Conservation Laws

New techniques of self-consistent field theory

The question of existence and uniqueness of solutions of the following Hamilton Jacobi

Existence and Uniqueness Theorem. Math 216 Differential Equations. Graphic rendition of Theorem. Definition: Separable ODE

Existence, Uniqueness, and Computational Theory for Time Consistent Equilibria: A Hyperbolic Discounting Example Kenneth L. Judd Hoover Insitution Stanford, CA 94305 [email protected] National Bureau of Economic Research December, 2003∗

Abstract We present asymptotically valid analyses of a simple optimal growth model with hyperbolic discounting. We use the implicit function theorem for Banach spaces to show that for small hyperbolicity there is a unique solution in the Banach space of consumption functions with bounded derivatives. The proof is constructive and produces both an inﬁnite series characterization and a perturbation method for solving these problems. The solution uses only the contraction properties of I would like to thank Mordecai Kurz, conference participants at the 2003 meeting of the Society for Computational Economics, and seminar participants at the University of Wisconsin for their comments, and Paul Klein and Tony Smith for useful discussions on the KKS procedure. ∗

1

the exponential discounting case, suggesting that the techniques can be used for a wide variety of time consistency problems. We also compare the computational procedure implied by our asymptotic analysis to previous methods. In particular, our procedure produces a locally unique solution.

1

Introduction

Many dynamic decision problems lead to problems of time inconsistency. These include problems of government policy as well as sales of durable goods and consumption decisions under hyperbolic discounting. In general, such problems are dynamic games with special structure. This paper uses the model in Krusell, Kuruscu, and Smith (2002) (KKS) to address issues of existence and uniqueness of time consistent consumption under hyperbolic discounting. The analysis is constructive leading directly to a perturbation method of solution. While we analyze only a hyperbolic discounting problem, the analysis, however, uses few properties of the KKS problem and revolves around an abstract formulation of the problem. This indicates that the solution technique is applicable in a variety of dynamic strategic contexts. Multiplicity of equilibria is a common problem in dynamic games. One strategy has been to focus on equilibria with continuous strategies. This has proven particularly powerful in at least one problem of time consistency. Stokey (1981) showed that there exists a continuum of time consistent solutions to the problem of the durable good monopolist. However, she showed that there is a unique solution, the Coase solution, with continuous strategies. More generally, Stanford (1986) and Samuelson and Friedman (1991) (and many others) have use explicitly use continuity as a selection criterion. Furthermore, many others have implicitly made continuity restrictions. In particular, numerical solutions to time consistency problems typically examine only continuous strategies. We will use the hyperbolic discounting model

2

as a laboratory for examining theoretical and computational properties of this selection criterion. The multiplicity problem arises with hyperbolic discounting. KrusellSmith (2003) showed that the hyperbolic discounting model of growth often has a continuum of solutions with discontinuous consumption functions. In fact, even the steady state is often indeterminate. The focus on equilibria with continuous consumption functions will rule out step function solutions but there is no reason to believe that it results in uniqueness. This paper will show that continuity will select a unique equilibrium for small deviations from exponential discounting. Furthermore, we will show that this unique equilibrium is as diﬀerentiable as the underlying tastes and technology. Our analysis has two implications for computational approaches to dynamic equilibria. First, our results justify the typical numerical focus on continuous solutions for at least an open set of problems. Second, the constructive nature of our analysis itself suggests a computational approach. The basic approach of this paper is familiar. We begin with a particular case, exponential discounting, where we know there exists a unique solution. We then examine how the solution changes as we change a parameter representing the hyperbolic deviation from exponential discounting. This approach is the same as used in comparative statics, comparative dynamics, and determinacy theory for general equilibrium (see Debreu, 1976, and Shannon, 1999). Diﬀerentiability plays a key role in those analyses, and will be equally important here. However, we have an inﬁnite-dimensional problem since we must compute savings functions. The key tools in this paper come from calculus in Banach spaces. The major mathematical challenges involve ﬁnding an appropriate topology for the analysis and then checking the conditions for the implicit function theorem. Along the way we must solve an unfamiliar equation with variable arguments. However, the tools are very general. The key fact is that the problem with exponential discounting reduces to analysis of a contraction map. We provide a condition 3

which implies that a modiﬁed contraction property is inherited by problems with nearly exponential discounting. Since the key elements of the analysis are common features of dynamic economic problems, we suspect that the ideas are directly applicable to a wide range of time consistency problems. Time consistency problems present special numerical challenges, particularly in the context of hyperbolic discounting. For example, many of the solutions in Laibson and Harris (2002) appear to have only discontinuous solutions. We will examine the standard solution methods that have been used by public ﬁnance and agricultural economists to ﬁnd time consistent equilibria of policy games, as well as a recent procedure proposed in KKS. We will show that these methods, all of which are essentially projection methods as deﬁned in Judd (1992), have diﬃculties that point either to multiplicity of true solutions or the presence of extraneous solutions to the numerical approximations. These problems with projection methods indicate that computational approaches to solving dynamic strategic problems need to be very careful. We use our asymptotic theory to present a perturbation method for solving the hyperbolic discounting problem that addresses both the existence and uniqueness issues. This procedure is limited in its applicability, but is promising since it is based on more solid mathematical foundations. Furthermore, we show that it can be used to solve a wide range of hyperbolic discounting problems, and, presumably, many other dynamic strategic problems.

4

2

A Model of Growth with Hyperbolic Discounting

We will examine an optimal growth problem where the planner discounts

future utility in a hyperbolic fashion1. Suppose that ct is consumption in period t. The planner at time t = 0 values the future stream of utility according to the inﬁnite sum U0 = u (c0) + β(δu (c1) + δ 2 u (c2) + δ 3u (c3) + · · · ) whereas the agent at t = 1 values future utility according to the sum U1 = u (c1) + β(δu (c2) + δ 2u (c3) + · · · ). In general, the planner at time t discounts utility between t + 1 and t + s + 1 at rate δ s < 1 but discounts utility between time t and t + s at rate βδs . If β = 1 we have the standard discounted utility function. We will examine only Markov equilibria; that is, we assume that the time t planner believes that future savings follow the process kt+1 = h(kt )

(1)

for some function g. We will frequently use the corresponding consumption function, which is deﬁned by C (k) ≡ f (k) −h (k), to economize on notation. Therefore, we use C (k) only to stand in for f (k) − h (k). By the Markov assumption, we need only consider the problem of the time t = 0 personality. At time t = 0, the time t = 0 self chooses current consumption to solve h(k) ≡ arg max u(f (k) − x) + βδV (x) x

(2)

where V (k) is the value to the time t = 0 self of the utility ﬂow of consumption from time t = 1, ... if the capital stock at time t = 1 is k. Under the

See Kuruscu, Krusell, and Smith (2002, 2003) for a more complete description of this model, and Laibson ?? for a more general discussion of hyperbolic discounting problems. 1

5

assumption that future selves will follow (1), the value function V (k) is the solution to the equation V (k) = u(f (k) − h(k)) + δV (h(k))

(3)

which, for any h (k), has a unique solution since the right-hand side of (3) is a contraction operator on value functions V . Furthermore, the solution to (2) satisﬁes the ﬁrst-order condition u (c) = βδV (f (k) − c). However, in a Markov equilibrium, when capital is k gross savings must equal h (k) = f (k)−c. We use these equations to deﬁne our concept of equilibrium. Definition 1 A continuously diﬀerentiable Markov equilibrium will be a pair of C 1 functions V (k) and h (k) that satisfy both the value function equation V (k) = u(f (k) − h(k)) + δV (h(k)),

(4)

the ﬁrst-order condition u (f (k) − h(k)) = βδV (h(k)),

(5)

and the global optimality condition h(k) ≡ arg max u(f (k) − x) + βδV (x) x

(6)

There may be multiple Markov equilibria, but we will ignore Markov equilibria with discontinuous h and nondiﬀerentiable V. This deﬁnition precisely formulates our equilibrium selection criterion by focussing on smooth value functions and savings functions. This is the assumption explicitly made in Stokey (1981) and implicitly by many other analyses of time consistent equilibria and dynamic games in general. We will follow KKS and reduce the analysis to a single equation in h (k). This will simplify the exposition but will not aﬀect any substantive result 6

since we could proceed in the same manner with the pair of equations (4,5). Diﬀerentiating (4) with respect to k implies V (k) = u (f (k) − h (k)) (f (k) − h (k)) + δV (h (k)) h (k)

(7)

which also, by substituting h (k) for k, implies V (h (k)) = u (C (h (k))) (f (h (k)) − h (h (k))) + δV (h (h (k))) h (h (k)) (8) The ﬁrst-order condition (5), when applied when capital stock is h (k), implies u (C (f (k) − h(k))) = βδV (h (h (k))).

(9)

Combining (7) and (8), using (9) to eliminate V (h (h (k))), implies the single equation2

u (f (k) − h (k)) = βδu (f (h (k)) − h (h (k))) f (h (k)) +

1 − 1 h (h (k)) β (10)

KKS call equation (10) the Generalized Euler Equation since it eliminates the value function. Note that if β = 1, the case of exponential discounting, (10) does reduce to the usual Euler equation. We will rearrange the terms and deﬁne the critical function G for our purposes: 0 = u (C (k)) − βδu (C (h (k))) f (h (k)) −βδu (C (h (k))) β 1 − 1 h (h (k))

(11)

−

= G (k, h (k) , h (h (k)) , εh (h (k))) 2

A more complete derivation is

u (f (k)

− h (k)) =

(h (k)) = δβ (u (C (h (k))) (f (h (k)) − h (h (k))) + δV (h (h (k))) h (h (k))) = δβ u (C (h (k))) (f (h (k)) − h (h (k))) + β1 u (C (h (k))) h (h (k))

=

δβV

βδu (C (h (k)))

f

(h (k)) + β1 − 1 7

h (h (k))

where

ε = β −1 − 1

represents the deviation from exponential discounting. When ε = 0 we have ordinary exponential discounting and the unique solution is the conventional optimal consumption function. I shall work with the Generalized Euler equation. It is a simpliﬁcation of the equilibrium conditions for the dynamic game to a single equation in a single unknown function and helps keep our exposition simple. However, one could proceed with our analysis with the value function formulation; therefore, the methods below apply even when there is no Generalized Euler equation formulation. Existence and uniqueness problems arise in this model as they typically do in dynamic games, even when we restrict ourselves to Markov equilibria. Krusell and Smith (2003) prove that there is a continuum of distinct solutions to the equilibrium pair (2, 4). This is similar to Stokey’s ﬁnding that there is a continuum of solutions to the durable goods monopoly problem. Stokey argues that a continuous solution is a more plausible description of behavior. KKS also implicitly take the view that a continuous solution is more sensible. Stokey proves that there is a unique continuous solution, but KKS provides no such proof of either existence of a continuous solution nor a uniqueness result. Harris and Laibson (2001) examine a similar savings problem with hyperbolic discounting and prove existence of smooth solutions for small amounts of hyperbolic discounting. However, there are substantial diﬀerences between their analysis and the analysis presented below. First, their existence result assumes income uncertainty. This uncertainty is critical to smoothing out the problem and avoiding mathematical diﬃculties. Since deterministic problems are of substantial interest in general in time consistency problems, we will proceed with developing the tools necessary to analyze this deterministic problem. Also, they prove only that the set of solutions is a semicontinu8

ous correspondence in hyperbolic discounting whereas we construct a smooth manifold of solutions, one for each value of hyperbolic discounting. The techniques used are also diﬀerent with Harris and Laibson use techniques from the theory of functions of bounded variation whereas we use calculus methods in Banach spaces.

3

Mathematical Preliminaries

We will need to use some nonlinear functional analysis to analyze equilibrium in the hyperbolic discounting problem. This section will review the basic deﬁnitions and theorems we will use3 . We will work with a Banach spaces of functions h : I → R where I = (a, b) is some interval including the steady state of the ε = 0 case, which we denote k∗. We need to specify an appropriate norm for our purposes. We want to focus on continuous solutions for h but the presence of h (h (k)) in (10) implies that we also require diﬀerentiability. This implies that conventional norms such as L1 , L2 , or L∞ are not appropriate for this problem. The conventional approach for dealing with the presence of h in applied mathematics is to work in a Sobolev space where the notion of a generalized derivative is used. We will not take that approach since we do not want to burden this paper with generalized derivatives. Furthermore, we probably would not be able to get strong uniqueness results since the step function solutions found in Krusell-Smith (2003) lie in the standard Sobolev spaces. We will use a generalization of the supremum norm. Let C m (U, V ) denote the space of C m functions f with domain U ⊂ R and range in V ⊂ R. On this space, we deﬁne the norm, ·m , to be f m = max sup D i f (x) . 0≤i≤m x∈U

3

(12)

We take many of the critical deﬁnitions and theorems from Abraham et al. (1983).

See Abraham et al. (1983) and Joshi and Bose (1985) for a more thorough discussion of the relevant theorems from calculus on Banach spaces. 9

C m (U, V ) is a Banach space with the norm fm . However, it is not a Hilbert space. A Hilbert space approach would replace the supremum norm in (12) with a norm deﬁned by an inner product in an Lp space, and would lead to a Sobolev space. Since the Hilbert structure of a Sobolev space is not needed

here, we stay with the Banach space deﬁned by f m . Our space also diﬀers from the space used in Harris and Laibson (2001). They assumed the presence of some uncertainty in the endowment. Our formulation also diﬀers from that in Krusell and Smith (2003) who allow discontinuous consumption functions. The notion of tangency is essential. Definition 2 Suppose f and g are functions f, g : U → F where U is an open subset of E, a Banach space with norm ·. The functions f and g are tangent at x0 ∈ U if f (x) − g (x) = 0. x→x0 x − x0 lim

This notion of tangency implies an important uniqueness property. Definition 3 Let L(E, F ) denote the space of linear maps from E to F with

the norm topology. Also, the spaces of linear maps Lm (E, F ) are deﬁned inductively by the identities Lm (E, F ) = L(E, Lm−1(E, F )), m = 2, 3, .... The following fact allows us to deﬁne diﬀerentiation. See Abraham et al. for a proof.

Lemma 4 For f : U ⊂ E → F and x0 ∈ U there is at most one linear map L ∈ L(E, F ) such that the map g (x) = f(x0) + L(x − x0) is tangent to f at x0 . We now use tangents to deﬁne diﬀerentiation. 10

Definition 5 If there is an L ∈ L(E, F ) such that f (x0 ) + L(x − x0 ) is tangent to f at x0 , then we say f is diﬀerentiable (a.k.a., Fr´echet diﬀerentiable) at x0 , and deﬁne the derivative of f at x0 to be Df (x0 ) = L. Definition 6 If f is diﬀerentiable at each x0 ∈ U, then the derivative of f is a map from U to the space of linear maps Df

:

U → L(E, F )

x −→ Df (x) Definition 7 If Df : U → L(E, F ) is a continuous map then f is C 1 (U, F ) (e.g., continuously diﬀerentiable). As long as the derivatives exist, we deﬁne higher derivatives by the inductive formula Dmf = D(D m−1f ) : U ⊂ E → Lm (E, F ) If D mf exists and is norm continuous we say f is C m (U, F ) . The directional derivative is a related concept. Definition 8 Let f : U ⊂ E → F and let x ∈ U. We say that f has a derivative in the direction e ∈ E at x if d f (x + te) t→0 dt

lim

exists, in which case it is called the directional derivative. Sometimes a function may have a directional derivative for all directions, (that is, it is Gˆateaux diﬀerentiable) but may not be diﬀerentiable. The key fact is that the directional derivative is the intuitive way to compute derivatives of diﬀerentiable functions. Lemma 9 If f is diﬀerentiable at x, then the directional derivatives of f exist at x and are given by d f (x + te) = Df (x) · e. t→0 dt

lim

11

In general, we will just use the Gˆateaux approach to compute our derivatives but our theorems will guarantee that the operators are Frechet diﬀerentiable. The chain rule is a critical property of our application. It follows from the general result on composite maps. Theorem 10 (C m Composite Mapping Theorem). Suppose f : U ⊂

E → V ⊂ F and g : V ⊂ F → G are C m maps. Then the composite g ◦ f : U ⊂ E → F is also C m and D(g ◦ f ) (x) · e = Dg(f (x)) · (Df (x)) · e) See Abraham et al. (Box 2.4.A) for the formula for D (g ◦ f ) for > 1. The main advantage of the Let C m (U, V ) norm is the diﬀerentiability of the derivative map. Lemma 11 (Diﬀerentiability of Derivative map) The map D (f )

:

C m+1(I, E) → C m(I, E)

D (f ) (x) = f (x) is C m . One novel feature of the operator we will encounter is the presence of the evaluation map. The evaluation map is a map ev:C m (I; R) × R → R deﬁned by ev(f, t) = f (t).

12

Lemma 12 (Evaluation Map Lemma). The evaluation map ev(f, t) deﬁned on C m (I; R) × R is C m and the derivatives are deﬁned by the chain rule and equal Dk ev(f, t) · ((g1 , s1), ..., (gk , sk )) k = Dk f (t) · (s1, ..., sk ) + Dk−1gi (t) · (s1, ..., si−1, si+1, ..., sk )

i=1

for

(gi , si ) ∈ C m (I, R) × R,

i = 1, ..., k.

We will use the following lemma on compositions. It is proved by applying the converse to the Taylor theorem (see Abraham et al.). Lemma 13 (Composition Map Lemma) The map T (f, g)

:

C m (I, E) × C m(I, I) → C m(I, E)

T (f, g) (x) = f (g (x)) is C m . The ﬁnal tool we need is the implicit function theorem. This states that if the linearization of the equation f (x) = y is uniquely invertible then locally so is f ; i.e., we can uniquely solve f (x) = y for x as a function of y. Theorem 14 (Implicit Function Theorem) Let U ⊂ E, V ⊂ F be open

and f : U × V → G be C m , r 1. For some x0 ∈ U, y0 ∈ V assume D2f (x0 , y0) : F → G is an isomorphism. Then there are neighborhoods U0 of x0 and W0 of f(x0, y0) and a unique C m map g : U0 × W0 → V such that for all (x, w) ∈ U0 × W0, f(x, g(x, w)) = w.

13

Applying the chain rule to the relation f (x, g(x, w)) = w, one can explicitly compute the derivatives of g : D1 g(x, w) = − [D2f (x, g(x, w))]−1 ◦ D1 f(x, g(x, w))

(13)

D2 g(x, w) = [D2f (x, g(x, w))]−1 .

These formulas look familiar from ordinary calculus. However, they may be quite diﬀerent in practice. In particular, the derivatives in (13) are linear operators in function space, not just Jacobian matrices, and the inversions involve solutions to linear functional equations, not just inversion of Jacobian matrices. The exact details for our hyperbolic discounting problem will be presented below.

4

Local Analysis of the Hyperbolic Discounting Problem

We now establish some critical mathematical facts about the hyperbolic discounting problem. We saw that any equilibrium savings function h satisﬁes the functional equation 0 = G (k, h (k) , h (h (k)) , εh (h (k))) where G : R4 → R was deﬁned in (11). We restate the problem as a functional one. Let I ⊂ R be an open, convex set containing the steady state k . ¯ (I) ⊂ I. We assume that the deterministic Furthermore, choose I so that h ¯ (k) is locally asymptotically stable. Therefore, such a I exists equilibrium h ¯ (k) is a strict contraction mapping for x near since stability implies that h ∗

k. ∗

Deﬁne the operator N

:

X × E → C m (I, R)

N (h, ε) (k) = G (k, h (k) , h (h (k)) , εh (h (k))) 14

where X ⊂ C m (I, I) and E = (−ε0, ε0) for some ε0 .N is the critical operator for us. We view N as a mapping taking a continuous function h and a scalar ε to another function of k. The operator N is not deﬁned for all functions h ∈ C m (I, I). For example, if h (k) > f (k) then the current period’s consumption is negative, rendering the Euler equation undeﬁned. ¯ is suﬃciently small, f (k) − h (k) will always be positive. However, if h − h More speciﬁcally, the subset X ⊂ C m (I, I) will be a ball of radius r for some r > 0: ¯ < r X r = h| h − h m

¯ is C m+1 . Then N : X r × E → Lemma 15 Assume G is C m and that h ¯ in the topology C m (I, R) with C m (I, R) for X r ⊂ C m (I, I) containing h suﬃciently small r. ¯ (k) , ¯h h ¯ (k) , εh ¯ h ¯ (k) exists since h ¯ (k) > 0 Proof. Clearly, G k, h for k ∈ I and is C ∞. G (k, h (k) , h (h (k)) , εh (h (k))) exists if h and h (h (k)) are positive for all k ∈ I. The order m derivatives of G (k, h (k) , h (h (k)) , εh (h (k)))

with respect to ε and k exist as long as G is C m and h is C m+1 . Therefore, ¯ is suﬃciently small then G (k, h (k) , h (h (k)) , εh (h (k))) exists if h − h m and is C m in (k, ε). When ε = 0 the problem in (10) is just the ordinary optimal growth ¯ (k) problem with exponential discounting, and there is a locally unique h ¯ 0 = 0. The task is to show that there is a unique map such that N h, Y : (−ε0, ε0) → C m (I, R) such that for all ε ∈ (−ε0, ε0), N (Y (ε) , ε) = 0. We also want Y (ε) to be diﬀerentiable in ε thereby allowing us to compute

Y (ε) via Taylor series expansions. To accomplish this we must apply the implicit function theorem for Banach spaces of functions to N . We need to show that N satisﬁes the conditions for the IFT. We next need to show that N (h, ε) is (Frechet) diﬀerentiable with respect

15

¯ and ε = 0. Rewrite N as to h at h = h N (h, ε) (k) = G (k, h (k) , h (h (k)) , εh (h (k)))

= G (k, ev (h, k) , ev (h, ev (h, k)) , ε ev (h , ev (h, k)))

= G (k, ev (h, k) , ev (h, ev (h, k)) , ε ev (D (h) , ev (h, k))) The chain rule, composition theorem, the omega lemma, and the smoothness of diﬀerentiation in the ·m norm prove the following result. Lemma 16 N is C m in the ·m norm. We now compute the derivative of N with respect to h. ¯ 0 is the linear operator Nh h, ¯ 0 : C m+1 (I, I) × {0} → Lemma 17 Nh h, C m (I, R) deﬁned by ¯ 0 · ψ (k) = A (k) ψ (k) + B (k) ψ h ¯ (k) Nh h, ¯ (k) , h ¯ h ¯ (k) , 0 A (k) ≡ G2 k, h ¯ (k) , h ¯ h ¯ (k) , 0 h ¯ h ¯ (k) +G3 k, h ¯ (k) , h ¯ h ¯ (k) , 0 B (k) ≡ G3 k, h

(14)

¯ 0 is the linear operator Nε h, ¯ 0 : h ¯ × E → C m (I, R) deﬁned and Nε h, by ¯ 0 · ε (k) = εG4 k, h ¯ (k) , h ¯ h ¯ (k) , 0 h ¯ h ¯ (k) Nε h, ≡ εC (k) The last step is to show that the derivative of N (h, ε) with respect to h ¯ 0 . That is, we want to solve the linear is invertible at neighborhood of h, operator equation ¯ 0 · hε + Nε h, ¯ 0 0 = Nh h, for the unknown function hε . The formal expression for the solution is −1 ¯ 0 ¯ 0 hε = −Nh h, Nε h, 16

−1 ¯ 0 but we need to check that Nh h, exists and is unique. That is, we need m to show that for every C function C (k) there is a function ψ (k) such that ¯ 0 · ψ + C (k) 0 = Nh h, which is equivalent to ¯ (k) + C (k) 0 = A (k) ψ (k) + B (k) ψ h

(15)

This equation looks unusual at ﬁrst. However, it is really quite familiar. It is linear in the function ψ. To see this deﬁne the operator ¯ (k) + C (k) S (ψ) (k) = A (k) ψ (k) + B (k) ψ h Then it is clear that S (α1 ψ 1 + α2 ψ2) = α1S (ψ 1) + α2S (ψ 2) Let us assume that A (k) is invertible and deﬁne D = A−1 B. Then the equation has the form ¯ (k) + C (k) ψ (k) = D (k) ψ h where C (k) has absorbed the A (k)−1 term since C (k) is an arbitrary function. This form reveals an iterative nature to the problem and has a natural inﬁnite series solution. By deﬁnition ¯ (k) + C (k) ψ (k) = D (k) ψ h ¯ (k) = D h ¯ (k) ψ h ¯ h ¯ (k) + C h ¯ (k) ψ h ¯ h ¯ (k) ,... Consider the recursion and so on for ψ h ¯ (k) + C (k) ψ (k) = D (k) ψ h

¯ (k) ψ h ¯ h ¯ (k) + C h ¯ (k) + C (k) ψ (k) = D (k) D h

=

... ∞ i−1

i=1

j=0

j i ¯ (k) C h ¯ (k) + C (k) D h 17

(16)

¯ i (k) is deﬁned inductively by where h ¯ 0 (k) = k h ¯ 1 (k) = h ¯ (k) h i ¯ i+1 (k) = h ¯ h ¯ (k) h This shows that our problem has a natural recursive structure and suggests ¯ interact in a an inﬁnite series solution. The critical issue is whether D and h manner which produces a convergent series in (16). We now state the critical theorem. Theorem 18 Consider the functions A, B, and C in (14). If (i) A (k) is

positive for all k, and (ii) the magnitude of A (k)−1 B (k) is uniformly less ¯ 0 : X → C m−1 (I, R) in an invertible C m than one for all k, then Nh h, operator. Proof. Consider the equation ¯ (k) + C (k) = 0. A (k) ψ (k) + B (k) ψ h We transform this to the equivalent equation ¯ (k) + C (k) . ψ (k) = D (k) ψ h

(17)

where D (k) = A (k)−1 B (k) and, without loss of generality, we have replaced

C (k) with −A (k)−1 C (k). By assumption A (k)−1 B (k) exists and has magnitude less than 1. Assume (i) and (ii). We will show that there is a unique solution in C m (I, R) to (17). We ﬁrst show that there is a unique solution in C 0 (I, R) . Deﬁne ¯ (k) + C (k) (T ψ) (k) = D (k) ψ h

By assumption A (k)−1 B (k) exists and has magnitude less than 1. Further¯ (k) is a monotone map, h ¯ (I) ⊂ I and more, since h ¯ (k) − ψ 2 h ¯ (k) max |ψ1 (k) − ψ2 (k)| . max ψ1 h

k∈I

k∈I

18

Therefore, T is a contraction mapping and the iterates ψ0 = 0 and ψ i+1 = T ψi converge uniformly to the solution ψ∞ .

Consider the derivative equation (where we assume that ψ (k) exists) implied by (17). ¯ (k) + D (k) ψ h ¯ (k) h ¯ (k) + C (k) ψ (k) = D (k) ψ h ¯ (k) ψ h ¯ (k) + D (k) ψ h ¯ (k) + C (k) + D (k) ψ h ¯ (k) = D (k) h ¯ (k) ψ h ¯ (k) + C˜ (k) = D (k) h where C˜ (k) is C 0 (I, R). We need to prove that ψ (k) exists. Deﬁne the operator

1 ¯ (k) φ h ¯ (k) + C˜ (k) . T φ (k) = D (k) h ¯ (k) has a coeﬃcient Note that φ h ¯ (k) D (k) h which has magnitude less than 1 since |D (k)| , |h (k)| < 1. Furthermore, ¯ (k) is C 0 (I, R) by assumption. Therefore, the sequence φ0 = 0, and D (k) h

φi+1 = T 1φi converges to the unique ﬁxed point φ∞ (k) of the derivative d ψi and the convergence of the φi is equation. Furthermore, since φi = dk

d ψ∞ = d ψ (k), proving that ψ (k) uniform, we can conclude that φ∞ = dk dk exists and satisﬁes the derivative equation. This step can be repeated as long ¯ D, and C have the necessary derivatives. Therefore, the solution ψ ∞ is as h, C m.

The global contraction properties assumed in Theorem 18 are strong. We next prove a local version of the same result. Corollary 19 If (i) A (k∗) is nonsingular, and (ii) the magnitude of A (k∗ )−1 B (k∗) ¯ 0 : X → C m (I, R) is an invertible C m operator is less than one, then Nh h, ¯ in C m (I, I). for some neighborhood of h

19

Proof. Since A, B, and D are m-times diﬀerentiable, there is a neighborhood of k∗ such that the assumptions of Theorem 18 hold. The conclusions then follow from Theorem 18 The last result is the inﬁnite series representation of the asymptotic terms. Corollary 20 Under the assumptions of Theorem 18 or Corollary 19 , the solution to (15) has the inﬁnite series representation i−1 ∞

i ¯ j (k) C h ¯ (k) + C (k) ψ (k) == D h

i=1

j =0

(18)

Proof. This follows directly from the contraction map arguments in the proof of Theorem 18. This series also holds for small neighborhoods around k∗ under the assumptions of Corollary 19. The inﬁnite series representation ¯ is a strictly increasing function with ﬁxed point at k∗ holds globally since h We will also state the multidimensional version of the theorem since that will be important in future generalizations. The proof is the same as above. Corollary 21 Consider the equation ¯ (k) + C (k) = 0 A (k) ψ (k) + B (k) ψ h

(19)

¯ (k∗ ) = k∗, (ii) for C m functions A, B, C : I n →I n . If (i) k∗ ∈ I n and h ¯ (I n ) ⊂ I n , (iii) A (k∗ ) is nonsingular, and (iv) the spectral radius of A (k∗)−1 B (k∗ ) h is less than one, then 19 has a unique C m solution ψ : I n →Rn .

5

Previous Computational Approaches

Computing equilibrium savings functions in the hyperbolic discounting model presents some diﬃculties. One can use value function iteration, but that will often take a long time to converge. Convergence of value function iteration is not assured since the problem does not have a contraction property. One 20

would like to linearize around the steady state, a common approach in dynamic economics. Unfortunately, we do not know what the steady state is except for the special case of ε = 0. In this section, we review some previous methods and their strengths and weaknesses. For speciﬁcity, we will examine one particular case of the hyperbolic savings model. We assume u (c) = log c f (k) = k + (1/δ − 1) k.25 δ = .95 β = 1, .95, .9, .85, .8 which has ﬁve possibilities for the strength of hyperbolic discounting. We focus on changes in β since that is the hyperbolic discounting parameter. The results are similar for other choices of utility and production functions.

5.1

Polynomial Approximations

There have been many papers in the public ﬁnance and resource economics literature which have solved for time consistent equilibria and for Nash equilibrium policy games. For example, Wright and Williams (1984) computed the impact of the strategic oil reserve when a government is known to impose price controls when oil prices get high. Kotlikoﬀ et al. (1988) compute equilibrium bequest policies. Ha and Sibert (1997) compute Nash equilibrium tax policies between competing countries. Rui and Miranda (1996) compute Nash equilibrium commodity stockpiling policies. Judd (1998) examines a simple problem of time consistent tax policy. These papers all used ﬂexible polynomial methods for computing equilibrium policies. Since they use polynomial approximations, they were searching only for continuous equilibria. Our approach shares that objective. The problem with these methods is that they are subject to a curse of dimensionality. Our perturbation method does not suﬀer from as bad a curse 21

of dimensionality. On the other hand, our approach will be local in contrast to the more global approach in many previous papers. The polynomial approach can be easily and (apparently) reliably applied to the hyperbolic savings problem. More speciﬁcally, we ﬁrst hypothesize that the solution is approximated by ˆ (k) = h

n

ai k i

=0

i

where the ai are unknown coeﬃcients. We then ﬁx the n + 1 coeﬃcients by solving the system of equations ˆ (k) , h ˆ h ˆ (k) , εh ˆ h ˆ (k) φ (k) , j = 0, .., n G k, h (20) j where the φj (k) are linearly independent functions. Essentially, we ﬁx a by projecting the Generalized Euler Equation in n + 1 directions, and the φj (k) represent those directions. Speciﬁcally, we let φj (k) be Chebyshev polynomials and the integral in (20) used Chebyshev quadrature, producing a Chebyshev collocation method (see Judd, 1992, for details). For each problem, we easily found4 a degree 31 polynomial for which the maximum Euler equation error was 10−13 for capital stocks between .25 and 1.75. For the case of exponential discounting, the steady state capital stock is k∗ = 1. The deviations from k∗ = 1 in the other cases give us some idea about the economic signiﬁcance of the hyperbolic discounting. The steady state for each problem is listed in Table 1. We see that a value of β = .8 produces very diﬀerent long-run dynamics.

A Mathematica program on a 1 GHz Pentium machine found a solution for each problem in less than ﬁve seconds. 4

22

Table 1: Steady State Capital Stock from Projection Method β steady state k

5.2

1.00 .95

1.00 .858

.90 .85

.727 .607

.80

.499

The KKS Procedure and the Projection Method

KKS propose a procedure which searches for a steady state which implies a reasonable Taylor expansion at that point. The KKS procedure begins with the Generalized Euler equation 0 = G (k, h (k) , h (h (k)) , εh (h (k)))

(21)

They want to solve for the steady state k∗, which must solve 0 = G (k∗, k∗ , k∗ , εh (k∗))

(22)

Unfortunately, (22) has two unknowns: k∗ and h (k∗). They need another equation to pin down the unknowns. They diﬀerentiate (21) with respect to k and impose the steady state conditions to arrive at 0 = G(1) (k∗ , k∗, k∗, εh (k∗ ) , εh (k∗))

(23)

The new equation (23) does add a condition but it also produces a new unknown, h (k∗). They continue this diﬀerentiation until they arrive at a list of n + 1 equations 0 = G0 = G (k∗ , k∗ , k∗ , εh (k∗)) 0 = G1 = G(1) (k∗, k∗, k∗, εh (k∗) , εh (k∗ )) ...

0 = Gn = G(n) k∗, k∗ , k∗ , εh (k∗) , εh (k∗) , ..., εh(n+1) (k∗) 23

(24)

with n + 2 unknowns, whereupon they append the condition 0 = h(n+1) (k∗)

(25)

This now produces a system of n + 2 equations with n + 2 unknowns. They are, however, nonlinear. To solve this system they form the least squares criterion n +1 2 KKS = h(n+1) (k∗ )2 + Gi i=0

and then choose k∗ and the various derivatives of h (k) at k∗ to minimize KKS. There is a problem with the KKS procedure; there may be multiple solutions. Table 2 displays solutions for k∗ for our speciﬁc problem and various orders of approximation. For example, if the order is 1, then we set h (k∗) = 0 to ﬁx the steady state. The multiplicity of solutions in Table 2 is not due to numerical error. This is because we reduce (24,25) to one equation in the unknown k∗, which was then computed with 192 digits of decimal precision. Each of the results in Table 1 can be proven to lie within within 10−4 of diﬀerent root by application of the intermediate value theorem. Since each solution in Table 1 is at least 10−4 away from the others, each reported solution in Table 2 represents a distinct solution to the KKS equations. While there is an increasing number of solutions to the KKS procedure, there is one, k∗ = 0.728, which is always present and is close to the k∗ = 0.727 solution found for this case in Table 1. Perhaps the persistence of the k∗ = 0.728 solution is evidence of its superiority, but I know of no mathematical reason to support that logic.

24

Approx. Order

Table 2: KKS Solutions Stable solutions for steady state k.

2 3

0.7278 0.7281, 0.7975

5 10

0.7281, 0.7518, 0.8187 0.7281, 0.7342, 0.7488, 0.7721, 0.8084, 0.8587

parameters: α = 1/4; γ = -.01; δ = .95; β=0.9; The KKS procedure is an appealing one. Initially, it does not appear to ﬁt into any particular kind of family. It resembles a perturbation method since it does create a Taylor series expansion around the unknown steady state k∗, but k∗ is unknown whereas perturbation methods based on an implicit function theorem are expansions around a known point. A closer examination of (24,25) shows that it is basically a projection method. This is because we can rewrite (24,25) as the system 0 = G (k, h (k) , h (h (k)) , εh (h (k))) δ (k∗ ) (k) dk 0 = G (k, h (k) , h (h (k)) , εh (h (k))) δ (k∗ ) (k) dk ... 0 =

(26)

(n−1)

G (k, h (k) , h (h (k)) , εh (h (k))) δ (k∗ ) (k) dk k∗ +ε

0 =

G (k, h (k) , h (h (k)) , εh (h (k))) dk

k∗ −ε

which is a collection of projections. The ﬁrst n + 1 projection functions are the Dirac delta function and its derivatives, and the last projection is a simple projection over a small interval around the unknown steady state k∗. Given the other projections, this last projection is arbitrarily close to the condition that Gn = 0 for small ε when we restrict the potential solutions to degree n polynomials. This is an unusual set of projection conditions but there is nothing obviously objectionable. The Dirac delta function and its derivatives may appear to be strange choices for projection directions, but remember that 25

they are “close” to perfectly smooth functions5 , implying that the projections in (26) are close to projections using smooth test functions. Therefore, the diﬃculties of the KKS procedure are really examples of diﬃculties that may arise with a projection method. The diﬃculties encountered by the KKS method give us pause in applying any projection method, including those used in Wright and Williams (1984), Kotlikoﬀ et al. (1988), Ha and Sibert (1997), Rui and Miranda (1996), and Judd (1998). Perhaps the multiplicity is due to multiple solutions to the underlying problem. Perhaps the multiplicity is extraneous in that the numerical method may produce solutions not related to any true solution. It is unclear how common these problems are. Authors who have used projection methods do not report multiplicity problems, but it is diﬃcult to search for all solutions when there are dozens of unknown coeﬃcients to ﬁnd. Since the KKS procedure reduces to one unknown, it is feasible to search for all solutions. It thereby gives us a tool to explore multiplicity problems in general for projection methods. Projection methods, and the related KKS procedure may often produce good answers. However, the diﬃculties with these methods give us reason to use a computational implementation of Theorem 18. We next pursue this idea.

Recall that a Dirac delta function is like a Normal density with a very small variance. The derivatives of a Dirac delta function are like the corresponding derivatives of a Normal density. 5

26

6

Banach-Space Perturbation Method for Problems with Small (and Large) Hyperbolic Deviations

We next use our asymptotic results to construct a perturbation method for solving the hyperbolic discounting problem. More precisely, we deﬁne the function h (k, ε) to satisfy the Generalized Euler equation u (f (k) − h (k, ε)) = βδu (f (h (k, ε)) − h (h (k, ε) , ε)) (f (h (k, ε)) + εh (h (k, ε) , ε)) We ﬁrst use standard perturbation methods to compute the Taylor series approximation of the exponential discounting problem . h (k, 0) = h (k∗ , 0) + hk (k∗, 0) (k − k∗) +hkk (k∗, 0) (k − k∗ )2 /2 +hkkk (k∗, 0) (k − k∗)3 /6 +... up to degree 12. We then move to the hyperbolic discounting terms, hε (k∗, 0), hεε (k∗, 0), etc. Before we proceed we must check the stability condition from our theory. For our model, the D (k) term reduces to δ

1+δ

2 u (c∗ ) u (c∗ )