Online publication date: 18 April 2011

This article was downloaded by: [Adelfio, Giada] On: 19 April 2011 Access details: Access Details: [subscription number 936584859] Publisher Taylor & ...

Author: Doreen Nash

0 downloads 0 Views 975KB Size

Report

Download PDF

Recommend Documents

Online publication date: 13 April 2011

Online publication date: 01 April 2010

Online publication date: 18 March 2011 PLEASE SCROLL DOWN FOR ARTICLE

Online publication date: 22 April 2011 PLEASE SCROLL DOWN FOR ARTICLE

Online publication date: 14 April 2011 PLEASE SCROLL DOWN FOR ARTICLE

Online publication date: 11 April 2011 PLEASE SCROLL DOWN FOR ARTICLE

Online publication date: 05 April 2011 PLEASE SCROLL DOWN FOR ARTICLE

Recherchebericht. 18. April 2011

Online publication date: 21 October 2010

Online publication date: 13 December 2010

Online publication date: 10 December 2010

Online publication date: 04 October 2010

Online publication date: 26 November 2010

Online publication date: 28 June 2010

Online publication date: 16 March 2010

Online publication date: 15 October 2010

Online publication date: 02 March 2010

Online publication date: 01 February 2010

Online publication date: 17 May 2010

Online publication date: 14 January 2010

Online publication date: 08 December 2010

Online publication date: 16 December 2010

Online publication date: 11 March 2010

Online publication date: 02 December 2010

This article was downloaded by: [Adelfio, Giada] On: 19 April 2011 Access details: Access Details: [subscription number 936584859] Publisher Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 3741 Mortimer Street, London W1T 3JH, UK

Communications in Statistics - Simulation and Computation

Publication details, including instructions for authors and subscription information: http://www.informaworld.com/smpp/title~content=t713597237

Gamma Kernel Intensity Estimation in Temporal Point Processes

G. Marcona; G. Adelfioa; M. Chiodia a Dipartimento di Scienze Statistiche e Matematiche “S. Vianelli”, Università degli Studi di Palermo, Palermo, Italy Online publication date: 18 April 2011

To cite this Article Marcon, G. , Adelfio, G. and Chiodi, M.(2011) 'Gamma Kernel Intensity Estimation in Temporal Point

Processes', Communications in Statistics - Simulation and Computation, 40: 8, 1146 — 1162 To link to this Article: DOI: 10.1080/03610918.2011.563158 URL: http://dx.doi.org/10.1080/03610918.2011.563158

PLEASE SCROLL DOWN FOR ARTICLE Full terms and conditions of use: http://www.informaworld.com/terms-and-conditions-of-access.pdf This article may be used for research, teaching and private study purposes. Any substantial or systematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material.

Communications in Statistics—Simulation and Computation® , 40: 1146–1162, 2011 Copyright © Taylor & Francis Group, LLC ISSN: 0361-0918 print/1532-4141 online DOI: 10.1080/03610918.2011.563158

Gamma Kernel Intensity Estimation in Temporal Point Processes

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

G. MARCON, G. ADELFIO, AND M. CHIODI Dipartimento di Scienze Statistiche e Matematiche “S. Vianelli”, Università degli Studi di Palermo, Palermo, Italy In this article, we propose a nonparametric approach for estimating the intensity function of temporal point processes based on kernel estimators. In particular, we use asymmetric kernel estimators characterized by the gamma distribution, in order to describe features of observed point patterns adequately. Some characteristics of these estimators are analyzed and discussed both through simulated results and applications to real data from different seismic catalogs. Keywords Bounded domain; Gamma kernel estimator; Intensity function; Temporal point processes. Mathematics Subject Classiﬁcation 60G55; 62G07.

1. Introduction Nonparametric estimation based on kernel approach is often used to get a functional form of unknown functions in absence of any guidance or constraints from theory. In seismological context Normal (i.e., Gaussian) kernel estimators have been used mostly to describe the spatial component of the seismicity (Choi and Hall, 1998; Jackson and Kagan, 1999). Zhuang et al. (2002) used kernel approaches for the spatial distribution of the background component of ETAS model. A spacetime nonparametric approach based on Gaussian kernel estimators was introduced by Adelﬁo et al. (2006) to deﬁne the intensity function of a clustering point-process for seismic data; some extensions were provided in Adelﬁo et al. (2010) and Adelﬁo (2010). In this article, we provide estimates of the intensity function of temporal point processes based on kernel estimators. In particular, we focus on situations in which the use of symmetric kernel functions could not be appropriate to describe the features of the observed point patterns. For instance, to describe temporal distribution of the triggered seismicity, the use of symmetric kernels (such as the Received July 27, 2010; Accepted February 4, 2011 Address correspondence to G. Adelﬁo, Dipartimento di Scienze Statistiche e Matematiche “S. Vianelli”, Università degli Studi di Palermo, viale delle Scienze, ed. 13, 90128, Palermo, Italy; E-mail: adelﬁ[email protected]

1146

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

Gamma Kernel Intensity In Temporal Point Processes

1147

Normal one) might be not appropriate since strong earthquakes are usually followed by sequences of further events, close both in time and in space, reﬂecting a gradual release of the seismic stress in the source fault area and producing an increasing of the seismic rate. In general, when the natural domain of deﬁnition of a density to be estimated is not the whole real line but a bounded interval, a Gaussian kernel estimator might be biased, although consistency of kernel estimators is well documented when the support of the underlying function is unbounded (Rosenblatt, 1956; Silverman, 1986). In this article, we propose the use of the Gamma kernel estimators introduced by Chen (2000) to describe the intensity function of temporal point processes whose supports are bounded from one end only. We focus on the estimation of the intensity function of an inhomogeneous time point process by using Gamma kernel estimator and comparing results with Normal kernel estimator. In Sec. 2, the deﬁnition of the intensity function for temporal point processes is reviewed, introducing the Normal kernel intensity estimator ( ) and the Gamma kernel intensity estimators ( ) for intensity functions and bandwidth selection procedures for both estimators. In Sec. 3, simulation results concerning ﬁnite sample properties of the estimators for various distributions and parameter values are reported. Empirical studies of the shape of highly skewed seismic distributions based on real datasets are provided in Sec. 4. Some conclusions are ﬁnally provided in Sec. 5.

2. Temporal Intensity Function and Kernel Estimators Point processes can be speciﬁed mathematically in several ways, for instance by considering joint distributions of the counts of points in arbitrary sets or deﬁning a complete intensity function, that is a function of the points history that generalizes the rate function of a Poisson process. More formally, a time point process is a random collection of points, each one representing the time of a single event. Any analytic temporal point process is uniquely characterized by its associated conditional intensity function t t (Daley and Vere-Jones, 2003) deﬁned as the frequency with which events are expected to occur around a particular location in time, conditional on the prior history t of the point process up to time t: t t = lim

t→0

ENt t + t t t

(1)

where t is time increment of time t and ENt t + t t is the history-dependent expected value of occurrence in t t + t . When · does not depend on the past history but just on the current time t, the expression in (1) identiﬁes an inhomogeneous Poisson process (as analyzed in this article), while a constant intensity characterizes homogeneous Poisson processes. Kernel estimators provide a simple and effective tool of ﬁnding structure in datasets without the imposition of a parametric model, with the advantage of being very intuitive and relatively simple to analyze mathematically. Let t1 t2 tN be the N ordered observations on the ﬁxed interval 0 T of an inhomogeneous Poisson process with intensity function as in (1), for which the conditional intensity is independent on the past history, such that t t = t, and t can be regarded as a mean intensity.

1148

Marcon et al.

The natural kernel estimate of t is simply an extension of kernel density estimator, except that there is no division by N , since t is based on an expected number rather than an expected proportion. Therefore, the standard kernel estimator for intensity function is: ˆ = t

N

t ti h

(2)

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

i=1

where is a kernel function which determines the shape of the bumps such that tdt = 1 and h is a positive number usually called the smoothing parameter or bandwidth. The kernel function is usually symmetric and its choice is often considered a problem of less importance than the smoothing parameters. ˆ as an estimator · the mean integrated To measure the global accuracy of · squared error (MISE) and the Kullback–Leibler distance (KLD) are widely used. They are deﬁned by: ˆ = E t ˆ − t 2 dt MISE (3) ˆ = KLD

t log

t dt ˆ t

(4)

As a particular case of (2), the Normal intensity kernel estimator is deﬁned by: N t − t 1 i ˆ t = h i=1 h

(5)

where is a standard Normal density function. The use of Normal kernel functions in some application ﬁelds is actually not the best solution, since in case of a bounded support and asymmetric functions boundary bias could be observed. This problem is due to the use of a ﬁxed kernel which assigns weight outside the support when smoothing is carried out near the boundary (Bouezmarni and Scaillet, 2005). To solve this issue, Gamma kernel estimators for intensity function based on asymmetric kernels with ﬂexible form and location on the non-negative real line are introduced. 2.1. The Gamma Kernel Intensity Estimators The Gamma kernel estimators for density functions have been introduced by Chen (2000); they are based on an asymmetric kernel function deﬁned on + with varying shape and location parameters, since depending on points location. These estimators are not subject to boundary bias and achieve the optimal rate of convergence for the MISE as for functions of the class of positive kernel estimators. The ﬁrst Gamma kernel intensity estimator is deﬁned as in (5) by using the gamma kernel deﬁned by: tti /h exp−t/h (6) hti /h+1 t/h + 1 that is a Gamma density function with thi + 1 and h the shape and scale parameters respectively, such that the ith kernel function has its mode corresponding on the ∗ t ti h =

Gamma Kernel Intensity In Temporal Point Processes

1149

observation ti . This leads to the Gamma kernel intensity estimator ˆ ∗ t, deﬁned by: ˆ ∗ t =

N

∗ t ti h

(7)

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

i=1

It is similar to the normal estimator deﬁned in (5), replacing the symmetric kernel function with the gamma kernel ∗ . To reduce bias effects of the estimator ˆ ∗ t, due to the fact that each observation ti corresponds to the mode of the kernel function rather than its mean, Chen (2000) suggested the use of the kernel t ti h, that is a gamma kernel function characterized by a shape parameter h ti , such that: if ti ≥ 2h t /h h ti = 1i 2 t /h + 1 if 0 ≤ ti < 2h 4 i This provides the following Gamma kernel intensity estimator: ˆ t =

N

t ti h

(8)

i=1

with t ti h =

h ti exp−t/h hh ti h ti

Extending the results for densities by Chen (2000), the estimator (8) has usually better global performance (in terms of MISE) and uses a smaller bandwidth than (7) when h is ﬁxed. Therefore, throughout this article, the Gamma kernel estimator refers to the one deﬁned in (8) unless otherwise speciﬁed. Denoting with f = /N the univariate distribution probability density function which we are trying to estimate, Chen (2000) derived the asymptotic MISE for optimal choice of h for the corresponding Gamma kernel density estimator: 5 MISEopt fˆ ∼ 4/5 4

4/5

2 1/5 −4/5 1 −1/2 xf x dx x fxdx N √ 2 0 0

(9)

In this article, we use an approximated version of the asymptotic formulation of MISE for densities, since we deal with intensities of a inhomogeneous Poisson Process and N is a Poisson random variable with expected value T (see Sec. 3). Therefore, from the fundamental relationships and using the standard ﬁrstorder Taylor approximation for the variance of a product: VarX1 X2 ≈ E 2 X1 VarX2 + E 2 X2 VarX1 then: 2 ˆ ˆ ˆ MSEx = Varx + Biasx

≈ E 2 NVarfˆ x + E 2 fˆ xVarN + T2 Bias2 fˆ x

1150

Marcon et al. = T2 Varfˆ x + E 2 fˆ xT + T2 Bias2 fˆ x = T2 MSEfˆ x + E 2 fˆ xT

Finally, replacing Efˆ x by fx in the second term and integrating on x, we obtain the following approximation: ˆ ≈ 2 MISEfˆ + T MISE T

f 2 xdx

(10)

0

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

2.2. Bandwidth Selection Kernel density estimator requires the speciﬁcation of a bandwidth h. This choice is very important, as it is shown in Figs. 1 and 2, where the individual bumps are shown as well as the estimates of the intensity functions constructed by adding them up, both for the Normal estimator ( ) and the Gamma one ( ). It should be noted the effect of varying the window width: (a) if h is too small then spurious ﬁne structure becomes visible and ˆ has a higher variance; (b) if h is too large then the real nature of the distribution is obscured and ˆ has a lower variance. Although in this article we focus on the choice of the proper kernel function instead of the selection of smoothing parameters, we brieﬂy summarize the properties of the cross-validation based approach (Hall, 1982), that is the one used throughout the article. A generalization of cross-validation is provided in Adelﬁo and Chiodi (2011), by the deﬁnition of the Forward Likelihood-based Predictive (FLP) approach aimed to prediction and accounting for the temporal ordering of observations. The bandwidth selector based on cross-validation approach is among the earliest fully automatic and consistent ones: cross-validation involves partitioning a

Figure 1. Normal Kernel estimates showing individual kernels. Window widths: (a) h = 01, (b) h = 03.

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

Gamma Kernel Intensity In Temporal Point Processes

1151

Figure 2. Gamma Kernel estimates showing individual kernels. Window widths: (a) h = 01, (b) h = 03.

sample of data into complementary subsets, performing the analysis on one subset (called the training set), and validating the analysis on the other subset (called the validation set or testing set). Let ˆ −i ti be the intensity function computed for observation ti and estimated by using the remaining data; then logLh =

N i=1

log ˆ −i ti −

T

ˆ tdt

(11)

0

The likelihood cross-validated selection of h is the value of h which maximize the function (11). A considerable amount has been written about selection of bandwidth for density estimation, but often the proposed methods depend on the unknown density (e.g., Silverman, 1978), implying restrictive limits to the accuracy of any possible data-driven bandwidth selector. The cross-validation based methods are the most interesting automatic methods of choosing a bandwidth (Hall, 1983; Stone, 1984); in particular, while crossvalidation based on quadratic Kullback–Leibler loss was considered by Bowman et al. (1984) and (Rudemo, 1982), cross-validation for density estimation has been shown to be asymptotically optimal (Hall, 1983) rather then simply consistent (Chow et al., 1983). However, the method is often computationally expensive because of the large number of the training process iterations, and sometimes the theoretical performance of this bandwidth selector is somewhat disappointing (Hall and Marron, 1987); nevertheless, when the loss function is chosen correctly, crossvalidation can be relied on to perform well for large samples (Bowman et al., 1984).

3. Simulation Results A simulation study was designed and carried out to investigate the performance of the Gamma kernel estimators (both (7) and (8)) and the Normal one (5) to

1152

Marcon et al.

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

estimate the intensity functions deﬁned in different scenarios. We simulated an inhomogeneous Poisson process with a smoothed true intensity function, named true t and sample size n,obtained generating a Poisson random variable N with T assigned parameter T = 0 true tdt. Therefore, the intensity function is deﬁned by true t = ftT , where ft is one of the six distributions described below, with either bounded or unbounded support and either one or two modes: (a) Gamma mixture obtained by mixing the distributions 1 1 and 3 2 with mixing proportion 21 ; (b) Gamma density with shape = 1 and scale = 2; (c) Gamma density with shape = 3 and scale = 1; (d) Normal density with mean = 5 and standard deviation = 1; (e) Pareto density with shape = 5 and scale = 1; (f) Normal mixture obtained by mixing the distributions 5 1 and 8 2 with mixing proportion 21 . For each of the above cases for the theoretical intensity function true t we used three values of T (100, 200, 500) and for each combination of true t and T we generated 100 simulated samples. For each simulation, the bandwidth is chosen as the one that maximizes logL(h) in (11). We do not report results relative to the ﬁrst Gamma kernel estimator because simulations conﬁrm the better performance of the estimator (8) than (7), as mentioned in Sec. 2.1; therefore, we focus on the comparison between the second Gamma kernel estimator and the Normal one. Intensities have been simulated for different values of T 100 200 500, showing an improving performance for increasing sample size. To make ﬁgures less crowded, we provide the simulated intensities together with their average just for the case T = 500 (see the plots in Fig. 3). These plots give an idea of the ﬁnite sample bias of the two estimators: it seems that bias is always smaller for the Gamma kernel estimates (solid lines) in comparison with the Normal kernel estimates (dotted lines) when the intensity functions are highly skewed (in Fig. 3, see plots (a), (b), and (e)). At the same time the Normal kernel function performs well when the real intensities are symmetric and deﬁned on boundless support (in Fig. 3, see plots (c), (d), and (f)). Let us see Figs. 4–5 in order to analyze bias and variance of each estimator in detail: in Fig. 4 the averages of the intensity functions estimated over the 100 simulated samples both using the Gamma and the Normal kernel estimators are showed; Fig. 5 reports the corresponding simulated Mean Squared Errors (MSE’s). Both groups of ﬁgures conﬁrm previous results for almost all the sample sizes considered across the entire range. As shown before, behaviors of the Normal kernel estimator and the true intensity are strongly different in cases (a), (b), and (e); on the other hand, the Gamma kernel estimator exhibits the same behavior both at the boundary and elsewhere as the true intensity function. The main difference between the Gamma and the Normal kernel estimators is showed in Fig. 5 plots (a), (b), and (e), where the simulated MSE’s of the Gamma kernel are much lower near t = 0; whereas plots (c), (d), and (f) in Fig. 5 show similar patterns over the entire support. The mean of the cross-validation log-likelihood for the Gamma and the Normal estimators computed over 100 simulated samples with different sizes and corresponding t-tests for paired samples are reported in Table 1; the Gamma kernel

1153

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

Gamma Kernel Intensity In Temporal Point Processes

Figure 3. Simulated intensity functions over 100 samples with T = 500 by using the Gamma (black solid line) and the Normal kernel estimator (dotted line). The true intensity (grey solid line) is: (a) Gamma mixture; (b) Gamma(1, 2); (c) Gamma(3, 1); (d) Normal(5, 1); (e) Pareto(5, 1); and (f) Normal mixture.

Marcon et al.

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

1154

Figure 4. Simulated mean of the 100 intensity functions with T = 500 obtained by using the Gamma kernel estimator (black solid line) and the Normal kernel estimator (dotted line). The true intensity (grey solid line) is: (a) Gamma mixture; (b) Gamma(1, 2); (c) Gamma(3, 1); (d) Normal(5, 1); (e) Pareto(5, 1); and (f) Normal mixture.

1155

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

Gamma Kernel Intensity In Temporal Point Processes

Figure 5. Simulated MSE to compare Gamma (solid line) and Normal kernel estimator (dotted line) over 100 samples with T = 500. The true intensity is: (a) Gamma mixture; (b) Gamma(1, 2); (c) Gamma(3, 1); (d) Normal(5, 1); (e) Pareto(5, 1); and (f) Normal mixture.

1156

Marcon et al.

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

Table 1 Mean cross-validation log-likelihood for the Gamma kernel estimator (Mean logL ) and the Normal one (Mean logL ) and corresponding t-tests computed over 100 simulated samples with T = 100 200 500 for the 6 theoretical intensity functions (a)–(f) described in Sec. 3 Mean logL

Mean logL

t-Test

p-Value

T = 100

(a) (b) (c) (d) (e) (f)

12086 19336 18031 22551 40650 12330

10924 18624 18037 22693 38695 12403

1612 1507 −034 −1308 970 −1454

0.00 0.00 0.74 0.00 0.00 0.00

T = 200

(a) (b) (c) (d) (e) (f)

38358 51769 49230 58837 93506 38409

36387 50347 49232 58991 88855 38468

1661 1384 −013 −2111 973 −698

0.00 0.00 0.90 0.00 0.00 0.00

T = 500

(a) (b) (c) (d) (e) (f)

139743 176970 167661 192169 282539 139943

136207 173855 167615 192395 272682 140011

1171 1188 172 −1481 839 −739

0.00 0.00 0.09 0.00 0.00 0.00

estimator ﬁts signiﬁcantly better than the Normal one for cases (a), (b), and (e), while for the cases (c), (d), and (f) there are not signiﬁcant differences between the two estimators. Table 2 summarizes the MISE’s deﬁned in (10) and the Kullback–Leibler distances, deﬁned in (4), computed for both estimators over the different simulated intensity functions: we can observe that the performance of the Gamma and the Normal smoothers are close when the true intensity is symmetric whereas their difference increases for highly skewed functions. In other words, the Gamma kernel adequately estimates asymmetric intensity functions and its performance improves for greater sample sizes. The similarity between the Gamma empirical and theoretical MISE (obtained replacing (9) in (10)), is a further result validating our estimation procedures, though the approximation of the results. Asymptotic MISE evaluation for the Normal kernel estimator (Silverman, 1986) is not reported since dealing with densities with bounded domains the assumptions required for its applicability (Rosenblatt, 1956) concerning the continuity of derivatives and symmetry around zero are not satisﬁed by the simulated functions. Finally, a simulation study was carried out to investigate the performance of the Gamma kernel estimator (8) and the Normal one (5) in a very extreme situation where the true intensity function of an inhomogeneous Poisson process is

Gamma Kernel Intensity In Temporal Point Processes

1157

Table 2 Empirical Mean Integrated Squared Error (MISE) and Kullback–Leibler distance (KLD) for the Gamma and the Normal kernel estimators computed over 100 simulated samples with T = 100 200 500 for the 6 theoretical intensity functions (a)–(f) described in Sec. 3. The corresponding asymptotic theoretical MISE for the Gamma kernel estimator is also reported MISE

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

T = 100

T = 200

T = 500

(a) (b) (c) (d) (e) (f) (a) (b) (c) (d) (e) (f) (a) (b) (c) (d) (e) (f)

MISE

KLD

Empirical

Theoretical

Empirical

6158 6120 9760 12122 62343 7243 14582 18355 19478 20501 145162 15178 49980 52834 50964 57354 381951 46025

11929 11517 9365 14104 111704 9077 26785 25715 20958 31563 249869 20430 78331 74704 61047 91941 727454 59919

31056 30361 10519 11546 497754 7988 100506 113582 22419 19326 2407073 16717 463890 624272 75146 52404 12574254 51581

7909 6881 67286 97705 45231 32576 8139 9163 45564 26797 49255 29532 9758 12170 32672 30509 48864 15885

18186 24281 83647 42521 286219 406283 24781 34805 102321 27651 653775 155364 37925 62039 120403 27868 1122038 20229

Figure 6. Simulated mean (on the left) and MSE (on the right) over 100 samples with T = 500 for the Gamma kernel estimator (solid line) and the Normal kernel estimator (dotted line), assuming a step intensity function as in Eq. (12).

1158

Marcon et al.

proportional to a step function such that: 1 0 < t < 10 true t = 5 10 ≤ t ≤ 20

(12)

Simulation results over 100 samples for different values of T 100 200 500 show that both the Normal and Gamma kernel estimators provide similarly results, reﬂecting the hardness of this extreme case, mainly related to the presence of a bounded support and discontinuity points, in correspondence of which very high values of the simulated MSE are obtained (see Fig. 6 for T = 500).

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

4. Application to Real Data The empirical illustration concerns the analysis of four different seismic datasets. The estimation of the time intensity function is carried out for four seismic catalogs of different geographical areas. Each seismic catalog is formed by the space-time coordinates of seismic events. In this article, we focus just on temporal values. (a) Colﬁorito (Fig. 7). The Colﬁorito earthquake sequence (n = 2201) occurred on September 26, 1997, at 2:33 am in the area between Umbria and Marche, near Colﬁorito (source: INGV – Istituto Nazionale di Geoﬁsica e Vulcanologia). The seismic sequence was characterized both by the two mainshocks (2:33 am and 11:40 am, latitude 430 N and longitude 129 E) and by several aftershocks until November 3, 1997. (b) Palermo (Fig. 8). On September 6, 2002, at 3:21 am, a M = 56 earthquake, occurring some tens of kilometers offshore from the Northern Sicilian coast (latitude 137 N and longitude 384 E), slightly damaged the city of Palermo and surroundings. The seismic sequence (the mainshock and the several aftershocks, occurred for more than 80 days) is constituted by 810 events occurred in the

Figure 7. sequence.

Gamma (solid line) and Normal (dotted line) kernel estimates for (a) Colﬁorito

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

Gamma Kernel Intensity In Temporal Point Processes

Figure 8. sequence.

1159

Gamma (solid line) and Normal (dotted line) kernel estimates for (b) Palermo

area 13.18–14.45 E and 37.95–38.95 N; the cluster has been drawn from the Southern Tyrrhenian Sea catalog (source INGV – Istituto Nazionale di Geoﬁsica e Vulcanologia). (c) Landers (Fig. 9). The Landers sequence (n = 751) is drawn from the Southern California catalogs (source: U.S.G.S. – U.S. Geological Survey), that represents earthquakes occurred between January 1984 and June 2004 in a rectangular area around Los Angeles, between longitude −112 and −114 E and latitude 32–37 N; the dataset consists of 6796 earthquakes, with magnitude not smaller than 3.0. Landers is the earthquake identiﬁed by latitude 342 N, longitude

Figure 9. sequence.

Gamma (solid line) and Normal (dotted line) kernel estimates for (c) Landers

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

1160

Figure 10. sequence.

Marcon et al.

Gamma (solid line) and Normal (dotted line) kernel estimates for (d) Japan

−1164 E, magnitude M = 73, and time June 28 1992, 4:57:31 am (Adelﬁo et al., 2006; Schoenberg, 2003). (d) Japan (Fig. 10). We use the hypocenter data recorded by the Japan Meteorological Agency, and consider one of several datasets discussed in Ogata (1998) and Ogata and Zhuang (2006). These data cover the area of the east coast of Tohoku District, one of the most active plate boundary zones in and around Japan. The data (n = 4333) consist of earthquakes of magnitude 4.5 and larger that are chosen from the region 36–42 N and 141–145 E for depths down to 100 km and for the time span 1926–1995. During 1938, the Shioya-Oki great swarm, including several large earthquakes, occurred near the coast of Fukusima Prefecture. We examine the intensity rate changes in time at the epicenter of the largest earthquake of M = 75 that occurred on November 5, 1938, with longitude 14218 E and latitude 3733 N (Adelﬁo and Ogata, 2010).

Table 3 Values of the h and of the log-likelihood estimated for (a) Colﬁorito, (b) Palermo, (c) Landers, and (d) Japan data by cross-validation Gamma Kernel

(a) Colﬁorito (b) Palermo (c) Landers (d) Japan

Normal Kernel

h

logL

h

logL

0024 0019 0193 0174

720202 175955 97415 −13643

0113 0678 10059 3044

733212 171882 61944 −11325

Gamma Kernel Intensity In Temporal Point Processes

1161

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

The Gamma and the Normal kernel estimators are applied to each seismic catalog. The cross-validation approach is used to select the bandwidth parameters. Figures 8 and 9 show that for the Palermo and Landers sequences, the Gamma kernel estimator catches the highest intensity near the origin, whereas the Normal kernel estimator does not. On the other hand, Figs. 7 and 10 show similar ﬁtting of the Gamma and the Normal kernel estimators for Colﬁorito and Japanese data. From these ﬁgures, we can observe that, for highly skewed distributions, the Gamma kernel estimator performs better than the Normal one mostly near the origin, suggesting that its choice could be crucial in those cases where bias effects are relevant. These results are summarized in Table 3, reporting the logL and h values estimated for each sequence.

5. Summary and Conclusion We compared the Gamma and the Normal kernel intensity estimators both when the intensity of the generating process is asymmetric, and deﬁned on a limited support 0 +, and when it is a symmetric function with unbounded domain. For simulated cases, results show that the Gamma kernel estimator is less biased than the Normal one near the origin for limited support 0 +, while the choice between the two competitive estimators is irrelevant when the true intensity is symmetric. This point was also illustrated through a nonparametric estimation of the seismic activity of four real different catalogs, observing again that the Gamma kernel estimator performs better when the time behavior of the sequences is described by a skewed distribution, while both the Gamma and the Normal kernel provide good results in other cases. Therefore, although it is usual to consider the choice of the kernel function less important than the bandwidth selection, there are some situations in which the choice of an appropriate kernel function is a crucial issue. As a direction for future work, we are considering methods for simultaneous choice of bandwidth parameters and kernel functions in space-time point patterns.

Acknowledgment The authors thank the reviewers for their useful comments and suggestions which improved the quality of the article.

References Adelﬁo, G. (2010). Kernel estimation and display of a ﬁve-dimensional conditional intensity function. Nonlinear Processes in Geophysics 17:237–244. Adelﬁo, G., Chiodi, M. (2009). Second-order diagnostics for space-time point processes with application to seismic events. Environmetrics 20:895–911. Adelﬁo, G., Chiodi, M. (2011). Kernel intensity for space-time point processes with application to seismological problems. In: Classiﬁcation and Multivariate Analysis for Complex Data Structures. Series: Studies in Classiﬁcation, Data Analysis, and Knowledge Organization. Heidelberg: Springer-Verlag, pp. 401–408. Adelﬁo, G., Ogata, Y. (2010). Hybrid kernel estimates of space-time earthquake occurrance rates using the etas model. Annals of the Institute of Statistical Mathematics 62:127–143.

Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011

1162

Marcon et al.

Adelﬁo, G., Chiodi, M., De Luca, L., Luzio, D. (2006). Nonparametric clustering of seismic events. In: Data Analysis, Classiﬁcation and the Forward Search. Series: Studies in Classiﬁcation, Data Analysis, and Knowledge Organization. Heidelberg: Springer-Verlag, pp. 397–404. Adelﬁo, G., Chiodi, M., De Luca, L., Luzio, D. (2010). An algorithm for earthquake clustering based on maximum likelihood. In: Data Analysis and Classiﬁcation. Series: Studies in Classiﬁcation, Data Analysis, and Knowledge Organization. New York: Springer, pp. 25–32. Bouezmarni, T., Scaillet, O. (2005). Consistency of asymmetric kernel density estimators and smoothed histograms with application to income data. Technical Report 0306, Interuniversity Attraction Pole. Economic Theory 21:390–412. Bowman, A. W., Hall, P., Titterington, D. M. (1984). Cross-validation in nonparametric estimation of probabilities and probability density. Biometrika Trust 71:341–351. Chen, S. X. (2000). Probability density function estimating using gamma kernels. Annals of the Institute of Statistical Mathematics 52:471–480. Choi, E., Hall, P. (1998). On bias reduction in local linear smoothing. Biometrika 85(2): 333–345. Chow, Y. S., German, S., Wu, L. D. (1983). Consistent cross-validated density estimation. The Annals of Statistics 11:25–38. Daley, D. J., Vere-Jones, D. (2003). An Introduction to the Theory of Point Processes. 2nd ed. New York: Springer-Verlag. Hall, P. (1982). Cross-validation in density estimation. Biometrika 69(2):383–390. Hall, P. (1983). Large samples optimality of least squares cross-validation in density estimation. The Annals of Statistics 11:1156–74. Hall, P., Marron, J. S. (1987). Extent to which least-squares cross-validation minimizes integrated squared error in nonparametric density estimation. Probability Theory Related Fields 74:567–581. Jackson, D. D., Kagan, Y. Y. (1999). Testable earthquake forecasts for 1999. Seismlogical Research Letters 70:393–403. Ogata, Y. (1998). Space-time point-process models for earthquake occurrences. Annals of the Institute of Statistical Mathematics 50(2):379–402. Ogata, Y., Zhuang, J. (2006). Space-time etas model and an improved extension. Tectonophysics 413:13–23. Rosenblatt, M. (21956). Remarks on some nonparametric estimates of a density function. Annals of Mathematical Statistics 27:832–837. Rudemo, M. (1982). Empirical choice of histograms and kernel density estimators. Scandinavian Journal of Statistics 9:65–78. Schoenberg, F. P. (2003). Multi-dimensional residual analysis of point process models for earthquake occurrences. Journal American Statistical Association 98(464):789–795. Silverman, B. W. (1978). Choosing the window width when estimating a density. Biometrika Trust 65:1–11. Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis. London: Chapman and Hall. Stone, C. J. (1984). An asymptotically optimal window selection rule for kernel density estimates. The Annals of Statistics 12:1285–1297. Zhuang, J., Ogata, Y., Vere-Jones, D. (2002). Stochastic declustering of space-time earthquake occurrences. Journal of the American Statistical Association 97(458):369–379.