Convergence of an Adaptive Newton Algorithm

Int. Journal of Math. Analysis, Vol. 1, 2007, no. 6, 279 - 284 Convergence of an Adaptive Newton Algorithm Sanjay Kumar Khattri Stord/Haugesund Unive...

Author: Allan Pearson

1 downloads 0 Views 110KB Size

Report

Download PDF

Recommend Documents

Convergence Properties of an Adaptive Digital Lattice Filter

An Adaptive Algorithm for video compression

Outline Rates of Convergence Newton s Method. Rates of Covergence and Newton s Method

An Adaptive Hash Join Algorithm for Multiuser Environments

Time Series Modeling Using an Adaptive Gene Expression Programming Algorithm

An Adaptive Grid Algorithm for Air-Quality Modeling

On the convergence of the Gaver-Stehfest algorithm

Improving the Convergence and Stability of Congestion Control Algorithm

A Novel Adaptive Rood Pattern Search Algorithm

Development of an Adaptive Algorithm for Online Artefact Rejection in Electroencephalographic Recordings

Adaptive Optimizing Dynamic Control Allocation Algorithm for Yaw Stabilization of an Automotive Vehicle using Brakes

Self-Adaptive Mutations May Lead to Premature Convergence

Adaptive Designs : An Overview

A damped Newton algorithm for computing viscoplastic fluid

A modification of Newton method with convergence of order 2 + 6

An Alternating Direction Approximate Newton Algorithm for Ill-Conditioned Inverse Problems with Application to Parallel MRI

NEWTON Titan NEWTON Pumps NEWTON Titan Plus NEWTON Basedrain NEWTON Baseboard NEWTON High level Alarm

AN ALGORITHM OF NAVIGATIONAL DATA INTEGRATION

I R TECHNICAL RESEARCH REPORT. An Adaptive Sampling Algorithm for Solving Markov Decision Processes

AN ADAPTIVE DISTANCE VECTOR ROUTING ALGORITHM FOR MOBILE, AD HOC NETWORKS

An Adaptive Cloud Downloading Service

Linear Convergence of Comparison-based Step-size Adaptive Randomized Search via Stability of Markov Chains

An Adaptive Distance Vector Routing Algorithm for Mobile, Ad Hoc Networks

An Algorithm for Extraction of Iris Information

Int. Journal of Math. Analysis, Vol. 1, 2007, no. 6, 279 - 284

Convergence of an Adaptive Newton Algorithm Sanjay Kumar Khattri Stord/Haugesund University College Norway [email protected]

Abstract The Newton - Krylov iteration is the most prominent iterative method for solving non-linear system of equations (F(x)). Roughly speaking, the Newton Krylov iteration consists of solving a series of linear systems (Jacobian systems) of the form J x = b. Solving non-linear system of equations is very costly due to time involved in solving the large Jacobian systems. We adaptively deﬁne the tolerance of linear systems J x = b based on the accuracy of the global system (F(x)). We prove the convergence of the method. Reported numerical work shows that the new approach is computationally very eﬃcient.

Mathematics Subject Classification: 90C53, 65B99, 34A34 Keywords: Newton; Krylov; Nonlinear Iteration; Symmetric Jacobian

1

Introduction

This reseach is concerned with eﬃcient solution of non-linear system of equations with symmetric Jacobian. Let us consider the nonlinear system F(x) = 0. Here, F is vector function. That is F = [F1 , F2 , . . . , Fn ]T , and x is the unknown vector. Let the vector be x = (x1 , x2 , . . . , xn )T . A Newton iteration for solving F(x) = 0 is given as J (xk ) Δxk = − F(xk ), xk+1 = xk + Δxk

(1) k = 0, 1, 2, . . . , m.

(2)

Here, the equation (1) is referred to as the Newton correction step, and J is the Jacobian (J = [∂Fi /∂xj ]) [6, 8]. We assume that the Jacobian is symmetric in nature. For starting the above Newton iteration, we need to assume an initial value, x0 , of the solution vector, x. It is known that if the initial guess (x0 ) is close to the exact solution, and the Jacobian is invertible then the above Newton iteration will converge quadratically. That is F(xk+1 ) ≤ C F(xk )2 . The most costly part of a Newton iteration is solving the Newton correction step equation (1). Roughly speaking, the Newton method consists of solving a series of Newton correction steps [7]. Solving equations (1) to a ﬁxed tolerance can be computationally very expensive.

280

Sanjay Kumar Khattri

Let us deﬁne the tolerance of the Newton correction steps adaptively. Newton iteration where the tolerance of the correction step is deﬁned adaptively is called Adaptive Newton method. Let the tolerance of the kth Newton correction step be rk . Thus, at the kth step, we solve the equation J (xk ) Δxk = − F(xk ) + rk .

(3)

Let us further assume that after k Newton iterations the tolerance rk , and the norm of the vectors F(xk ) (residual vector) and Δxk (diﬀerence vector) are related as rk ≤ C1 F(xk )2

and

rk ≤ C2 Δxk 2 .

(4)

Then, we prove the following quadratic convergence results for the Newton iteration F(xk+1 ) ≤ C F(xk )2

and xk+1 − x ≤ C xk − x 2 .

Here, x is the exact solution of the nonlinear system F. Let us ﬁrst show that if the Jacobian matrix is symmetric and Lipschitz continuous then its inverse is bounded. For a symmetric matrix A, there exists a number k > 0 such that the following two inequalities are equivalent A−1 ≤

1 k

and A v ≥ k v,

(5)

see [9]. Here, A−1 is the inverse of the matrix A. For a Lipschitz continuous matrix B, there exists a number L > 0 such that B(y) − B(x) ≤ L y − x.

(6)

Now let us bound the inverse of the Jacobian matrix. For a vector v, we can write J (xk ) v = J (xk+1 )v + (J(xk ) − J (xk+1 ))v.

(7)

Using the following inequality a + b ≥ a − b. We get J (xk ) v ≥ J (xk+1 )v − (J(xk ) − J (xk+1 ))v,

(8)

using the inequality (5), and also the matrix norm inequality Ax ≤ A x J (xk ) v ≥ k v − J (xk ) − J (xk+1 ) v.

(9)

Using the Lipschitz continuity of the Jacobian given by the equation (6), we get J (xk ) v ≥ k v − L xk − xk+1 v, ≥ (k − L xk − xk+1 ) v.

(10) (11)

Since the Jacobian is symmetric. Thus, using the inequality (5), the inverse of the Jacobian is bounded as 1 −1 (12) J (xk ) ≤ k − L xk − xk+1

281

Convergence of an Adaptive Newton Method

2

Convergence of the Adaptive Newton Method

From the multi dimensional mean value lemma F(x) − F(y) − J(y) (x − y) ≤

l x − y2 . 2

(13)

By the equations (2) and (3) xk+1 = xk − J (xk )−1 [F(xk ) + rk ] . Combining the mean value lemma (13) and the above equation 2 −1 F(xk+1 ) − F(xk ) + J(xk ) J (xk )−1 (F(xk ) + rk ) ≤ l J (xk ) (F(xk ) + r) , 2 since J J −1 = I F(xk+1 ) − F(xk ) + F(xk ) + r ≤

l J(xk )−1 (F(xk ) + rk )2 , 2

using x + y ≤ x − y F(xk+1 ) ≤ ≤ ≤ ≤ ≤ ≤

l J(xk )−1 (F(xk ) + rk )2 + rk , 2 l J (xk )−1 2 F(xk ) + rk 2 + rk , 2 l J (xk )−1 2 (F(xk ) + rk )2 + rk , 2

l J (xk )−1 2 F(xk )2 + rk 2 + 2 F(xk ) rk + rk , 2

l J (xk )−1 2 F(xk )2 + C1 2 F(xk )4 + 2 C1 F(xk )3 + C1 F(xk )2 , 2

l F(xk )2 J (xk )−1 2 1.0 + C1 2 + 2 C1 F(xk ) + C1 . 2

Thus, F(xk+1 ) ≤ C F(xk )2 This is our ﬁrst main result. The fundamental theorem of calculus asserts that there is t ∈ [0, 1] such that F(z) − F(x) =

0

1

J [x + t (z − x)] (z − x) dt.

By the equations (2) and (3) xk+1 − x = xk − J (xk )−1 [F(xk ) + rk ] − x ,

(14)

282

Sanjay Kumar Khattri

where x is the exact solution of the system F(x) = 0, xk+1 − x = (xk − x ) + J(xk )−1 [F(x ) − F(xk )] − J (xk )−1 rk . Using equation (14)

xk+1 − x = (xk − x ) + J(xk )

−1

[J(xk + t (xk − x ))] (x − xk ) dt − J (xk )−1 rk , 0 1 [J (xk + t (xk − x ))] (x − xk ) dt − J(xk )−1 rk ,

1

= (xk − x ) + J(xk )−1 0 1 [J(x + t (x − xk )) − J(xk )] (x − xk ) dt − J(xk )−1 rk . = J (xk )−1 0

Taking norm of both of the above equation and using x − y ≤ x + y, the sides x y ≤ x y, x ≤ x 1 −1 J (xk + t (x − xk )) − J (xk ) x − xk dt + J (xk )−1 rk , xk+1 − x ≤ J (xk ) 0

by the Lipschitz continuity of the Jacobian. That is J (xk + t (x − xk )) − J(xk ) ≤ L t x − xk . We get

−1

xk+1 − x ≤ x − xk J (xk )

1 0

L t x − xk dt + J (xk )−1 rk ,

L ≤ x − xk 2 J (xk )−1 + J (xk )−1 rk , 2

(15)

since rk ≤ C2 x − xk 2 xk+1 − x ≤ x − xk 2 J (xk )−1

L + C2 J (xk )−1 x − xk 2 . 2

Thus, xk+1 − x ≤ xk − x 2 J (xk )−1 L2 + J (xk )−1

3

Numerical Work

We are solving the simpliﬁed Poisson Boltzmann equation (16) on Ω = [−1, 1] × [−1, 1] with k = 1.0 [3, 4, 5]. Problems with discontinuity in are of practical applications [4]. The domain Ω is divided into four equal sub-domains as shown in the Figure 1 based on the medium properties . It should be noted that elliptic problems with discontinuous coeﬃcients can produce very ill conditioned linear systems. − div ( grad p) + k sinh(p) = f

in

Ω and

p(x, y) = x3 + y 3

on

∂ΩD . (16)

283

Convergence of an Adaptive Newton Method

100 90 80

1 = 1000.0

Number of CG−Iterations

3 = 1000.0

4 = 1.0

70 Quasi−Newton Newton

60 50 40 30 20

2 = 1.0

10 0 0

Figure 1: In the sub-domain Ωi , = i , i = 1, . . . , 4.

2

4

6 8 Newton Step

10

12

14

Figure 2: Computational eﬃciency of Quasi-Newton and Newton methods.

0

5

10

10

Quasi − Newton Newton

Quasi − Newton Newton 0

10

||Δpk||L2 / ||Δp0||

||F(pk)||L2 / ||F(p0)||L2

−5

10

−10

10

−5

10

−10

10

−15

10

−15

10

−20

10

0

−20

2

4

6 8 Newton Step [ k ]

10

12

14

Figure 3: Convergence of residual vector A(p).

10

0

2

4

6

8

10

12

14

Newton Step [ k ]

Figure 4: Convergence of the diﬀerence vector Δp.

Here, the source function f is f = 2 y (y − 1) + 2 x (x − 1) − 100 (x − 1) y (y − 1) exp [x (x − 1) y (y − 1)] . For solving the linear systems, formed by the method of ﬁnite volumes [1, 2], we are using the ILU-preconditioned Conjugate-Gradient (CG) solver. For the Newton algorithm the tolerance of the CG method is 1.0 × 10−15 , while for the quasi-Newton method the tolerance of the CG varies with the Newton iteration k as follows : 1.0 × 10−(k+1) , k = 0, 2, . . . , 14. The distribution of is given by the Figure 1. Thus, in the ﬁrst and third quadrants of the domain = 100.0, and in the second and fourth quadrants of the domain = 1.0. Figures 2, 3 and 4 report the outcome of our numerical work. The Figures 3 and 4 compare convergence of the Quasi-Newton and Newton methods. While the Figure 2 is comparing the computational eﬃciency of the Quasi-Newton and the Newton methods. In the Figures 3 and 4, it is interesting to note that the convergence rate of both the methods is same. The Figure 2 presents the computational work required

284

Sanjay Kumar Khattri

by the Quasi-Newton and Newton methods. We observe here that our approach require approximately half the work needed by the Newton method. Thus, even if initial iterations of the Newton-Krylov algorithm are solved approximately, the convergence rate of the algorithm remains unaﬀected, and such an approximation saves a substantial amount of computational eﬀort.

References [1] S. K. Khattri, Nonlinear elliptic problems with the method of ﬁnite volumes, Diﬀerential Equations and Nonlinear Mechanics, (2006). [2] S. K. Khattri, Analyzing Finite Volume for Single Phase Flow in Porous Media, Journal of Porous Media, (2006). [3] B. Aksoyw, Adaptive Multilevel Numerical Methods with Applications in Diffusive Biomolecular Reactions,” PhD Thesis, The University of California, San Diego, 2001. [4] F. Fogolari, A. Brigo and H. Molinari, The Poisson Boltzmann equation for Biomolecular electrostatics: A Tool for Structural Biology, Journal of Molecular Recognition, John Wiley & Sons Ltd., 15 (2002), 377 - 392. [5] M. Host, R. E. Kozack, F. Saied and S. Subramaniam, Treatment of Electrostatic Eﬀects in Proteins: Multigrid-based Newton Iterative Method for Solution of the Full Nonlinear Poisson-Boltzmann Equation, Proteins: Structure, Function, and Genetics, 18 (1994), 231–245. [6] C. G. Broyden, J. E. Dennis and J. J. Mor`e, On the local and superlinear convergence of quasi-Newton methods, J. Inst. Math. Appl., 12 (1973), 223– 245. [7] L. Donghui and M. Fukushima, A globally and superlinearly convergent GaussNewton-based BFGS method for symmetric nonlinear equations, SIAM J. Numer. Anal., 37 (1999), 152–172. [8] J. E. Dennis and J. J. Mor`e, Quasi-Newton methods, motivation and theory, SIAM Rev., 19 (1997), 46–89. [9] G. Strang, Linear Algebra and Its Applications (4th Ed.), Thomson Learning, 2005. Received: August 31, 2006