This article was downloaded by: [Adelfio, Giada] On: 19 April 2011 Access details: Access Details: [subscription number 936584859] Publisher Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 3741 Mortimer Street, London W1T 3JH, UK
Communications in Statistics - Simulation and Computation
Publication details, including instructions for authors and subscription information: http://www.informaworld.com/smpp/title~content=t713597237
Gamma Kernel Intensity Estimation in Temporal Point Processes
G. Marcona; G. Adelfioa; M. Chiodia a Dipartimento di Scienze Statistiche e Matematiche “S. Vianelli”, Università degli Studi di Palermo, Palermo, Italy Online publication date: 18 April 2011
To cite this Article Marcon, G. , Adelfio, G. and Chiodi, M.(2011) 'Gamma Kernel Intensity Estimation in Temporal Point
Processes', Communications in Statistics - Simulation and Computation, 40: 8, 1146 — 1162 To link to this Article: DOI: 10.1080/03610918.2011.563158 URL: http://dx.doi.org/10.1080/03610918.2011.563158
PLEASE SCROLL DOWN FOR ARTICLE Full terms and conditions of use: http://www.informaworld.com/terms-and-conditions-of-access.pdf This article may be used for research, teaching and private study purposes. Any substantial or systematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material.
Communications in Statistics—Simulation and Computation® , 40: 1146–1162, 2011 Copyright © Taylor & Francis Group, LLC ISSN: 0361-0918 print/1532-4141 online DOI: 10.1080/03610918.2011.563158
Gamma Kernel Intensity Estimation in Temporal Point Processes
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
G. MARCON, G. ADELFIO, AND M. CHIODI Dipartimento di Scienze Statistiche e Matematiche “S. Vianelli”, Università degli Studi di Palermo, Palermo, Italy In this article, we propose a nonparametric approach for estimating the intensity function of temporal point processes based on kernel estimators. In particular, we use asymmetric kernel estimators characterized by the gamma distribution, in order to describe features of observed point patterns adequately. Some characteristics of these estimators are analyzed and discussed both through simulated results and applications to real data from different seismic catalogs. Keywords Bounded domain; Gamma kernel estimator; Intensity function; Temporal point processes. Mathematics Subject Classification 60G55; 62G07.
1. Introduction Nonparametric estimation based on kernel approach is often used to get a functional form of unknown functions in absence of any guidance or constraints from theory. In seismological context Normal (i.e., Gaussian) kernel estimators have been used mostly to describe the spatial component of the seismicity (Choi and Hall, 1998; Jackson and Kagan, 1999). Zhuang et al. (2002) used kernel approaches for the spatial distribution of the background component of ETAS model. A spacetime nonparametric approach based on Gaussian kernel estimators was introduced by Adelfio et al. (2006) to define the intensity function of a clustering point-process for seismic data; some extensions were provided in Adelfio et al. (2010) and Adelfio (2010). In this article, we provide estimates of the intensity function of temporal point processes based on kernel estimators. In particular, we focus on situations in which the use of symmetric kernel functions could not be appropriate to describe the features of the observed point patterns. For instance, to describe temporal distribution of the triggered seismicity, the use of symmetric kernels (such as the Received July 27, 2010; Accepted February 4, 2011 Address correspondence to G. Adelfio, Dipartimento di Scienze Statistiche e Matematiche “S. Vianelli”, Università degli Studi di Palermo, viale delle Scienze, ed. 13, 90128, Palermo, Italy; E-mail: adelfi
[email protected]
1146
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
Gamma Kernel Intensity In Temporal Point Processes
1147
Normal one) might be not appropriate since strong earthquakes are usually followed by sequences of further events, close both in time and in space, reflecting a gradual release of the seismic stress in the source fault area and producing an increasing of the seismic rate. In general, when the natural domain of definition of a density to be estimated is not the whole real line but a bounded interval, a Gaussian kernel estimator might be biased, although consistency of kernel estimators is well documented when the support of the underlying function is unbounded (Rosenblatt, 1956; Silverman, 1986). In this article, we propose the use of the Gamma kernel estimators introduced by Chen (2000) to describe the intensity function of temporal point processes whose supports are bounded from one end only. We focus on the estimation of the intensity function of an inhomogeneous time point process by using Gamma kernel estimator and comparing results with Normal kernel estimator. In Sec. 2, the definition of the intensity function for temporal point processes is reviewed, introducing the Normal kernel intensity estimator ( ) and the Gamma kernel intensity estimators ( ) for intensity functions and bandwidth selection procedures for both estimators. In Sec. 3, simulation results concerning finite sample properties of the estimators for various distributions and parameter values are reported. Empirical studies of the shape of highly skewed seismic distributions based on real datasets are provided in Sec. 4. Some conclusions are finally provided in Sec. 5.
2. Temporal Intensity Function and Kernel Estimators Point processes can be specified mathematically in several ways, for instance by considering joint distributions of the counts of points in arbitrary sets or defining a complete intensity function, that is a function of the points history that generalizes the rate function of a Poisson process. More formally, a time point process is a random collection of points, each one representing the time of a single event. Any analytic temporal point process is uniquely characterized by its associated conditional intensity function t t (Daley and Vere-Jones, 2003) defined as the frequency with which events are expected to occur around a particular location in time, conditional on the prior history t of the point process up to time t: t t = lim
t→0
ENt t + t t t
(1)
where t is time increment of time t and ENt t + t t is the history-dependent expected value of occurrence in t t + t . When · does not depend on the past history but just on the current time t, the expression in (1) identifies an inhomogeneous Poisson process (as analyzed in this article), while a constant intensity characterizes homogeneous Poisson processes. Kernel estimators provide a simple and effective tool of finding structure in datasets without the imposition of a parametric model, with the advantage of being very intuitive and relatively simple to analyze mathematically. Let t1 t2 tN be the N ordered observations on the fixed interval 0 T of an inhomogeneous Poisson process with intensity function as in (1), for which the conditional intensity is independent on the past history, such that t t = t, and t can be regarded as a mean intensity.
1148
Marcon et al.
The natural kernel estimate of t is simply an extension of kernel density estimator, except that there is no division by N , since t is based on an expected number rather than an expected proportion. Therefore, the standard kernel estimator for intensity function is: ˆ = t
N
t ti h
(2)
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
i=1
where is a kernel function which determines the shape of the bumps such that tdt = 1 and h is a positive number usually called the smoothing parameter or bandwidth. The kernel function is usually symmetric and its choice is often considered a problem of less importance than the smoothing parameters. ˆ as an estimator · the mean integrated To measure the global accuracy of · squared error (MISE) and the Kullback–Leibler distance (KLD) are widely used. They are defined by: ˆ = E t ˆ − t 2 dt MISE (3) ˆ = KLD
t log
t dt ˆ t
(4)
As a particular case of (2), the Normal intensity kernel estimator is defined by: N t − t 1 i ˆ t = h i=1 h
(5)
where is a standard Normal density function. The use of Normal kernel functions in some application fields is actually not the best solution, since in case of a bounded support and asymmetric functions boundary bias could be observed. This problem is due to the use of a fixed kernel which assigns weight outside the support when smoothing is carried out near the boundary (Bouezmarni and Scaillet, 2005). To solve this issue, Gamma kernel estimators for intensity function based on asymmetric kernels with flexible form and location on the non-negative real line are introduced. 2.1. The Gamma Kernel Intensity Estimators The Gamma kernel estimators for density functions have been introduced by Chen (2000); they are based on an asymmetric kernel function defined on + with varying shape and location parameters, since depending on points location. These estimators are not subject to boundary bias and achieve the optimal rate of convergence for the MISE as for functions of the class of positive kernel estimators. The first Gamma kernel intensity estimator is defined as in (5) by using the gamma kernel defined by: tti /h exp−t/h (6) hti /h+1 t/h + 1 that is a Gamma density function with thi + 1 and h the shape and scale parameters respectively, such that the ith kernel function has its mode corresponding on the ∗ t ti h =
Gamma Kernel Intensity In Temporal Point Processes
1149
observation ti . This leads to the Gamma kernel intensity estimator ˆ ∗ t, defined by: ˆ ∗ t =
N
∗ t ti h
(7)
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
i=1
It is similar to the normal estimator defined in (5), replacing the symmetric kernel function with the gamma kernel ∗ . To reduce bias effects of the estimator ˆ ∗ t, due to the fact that each observation ti corresponds to the mode of the kernel function rather than its mean, Chen (2000) suggested the use of the kernel t ti h, that is a gamma kernel function characterized by a shape parameter h ti , such that: if ti ≥ 2h t /h h ti = 1i 2 t /h + 1 if 0 ≤ ti < 2h 4 i This provides the following Gamma kernel intensity estimator: ˆ t =
N
t ti h
(8)
i=1
with t ti h =
h ti exp−t/h hh ti h ti
Extending the results for densities by Chen (2000), the estimator (8) has usually better global performance (in terms of MISE) and uses a smaller bandwidth than (7) when h is fixed. Therefore, throughout this article, the Gamma kernel estimator refers to the one defined in (8) unless otherwise specified. Denoting with f = /N the univariate distribution probability density function which we are trying to estimate, Chen (2000) derived the asymptotic MISE for optimal choice of h for the corresponding Gamma kernel density estimator: 5 MISEopt fˆ ∼ 4/5 4
4/5
2 1/5 −4/5 1 −1/2 xf x dx x fxdx N √ 2 0 0
(9)
In this article, we use an approximated version of the asymptotic formulation of MISE for densities, since we deal with intensities of a inhomogeneous Poisson Process and N is a Poisson random variable with expected value T (see Sec. 3). Therefore, from the fundamental relationships and using the standard firstorder Taylor approximation for the variance of a product: VarX1 X2 ≈ E 2 X1 VarX2 + E 2 X2 VarX1 then: 2 ˆ ˆ ˆ MSEx = Varx + Biasx
≈ E 2 NVarfˆ x + E 2 fˆ xVarN + T2 Bias2 fˆ x
1150
Marcon et al. = T2 Varfˆ x + E 2 fˆ xT + T2 Bias2 fˆ x = T2 MSEfˆ x + E 2 fˆ xT
Finally, replacing Efˆ x by fx in the second term and integrating on x, we obtain the following approximation: ˆ ≈ 2 MISEfˆ + T MISE T
f 2 xdx
(10)
0
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
2.2. Bandwidth Selection Kernel density estimator requires the specification of a bandwidth h. This choice is very important, as it is shown in Figs. 1 and 2, where the individual bumps are shown as well as the estimates of the intensity functions constructed by adding them up, both for the Normal estimator ( ) and the Gamma one ( ). It should be noted the effect of varying the window width: (a) if h is too small then spurious fine structure becomes visible and ˆ has a higher variance; (b) if h is too large then the real nature of the distribution is obscured and ˆ has a lower variance. Although in this article we focus on the choice of the proper kernel function instead of the selection of smoothing parameters, we briefly summarize the properties of the cross-validation based approach (Hall, 1982), that is the one used throughout the article. A generalization of cross-validation is provided in Adelfio and Chiodi (2011), by the definition of the Forward Likelihood-based Predictive (FLP) approach aimed to prediction and accounting for the temporal ordering of observations. The bandwidth selector based on cross-validation approach is among the earliest fully automatic and consistent ones: cross-validation involves partitioning a
Figure 1. Normal Kernel estimates showing individual kernels. Window widths: (a) h = 01, (b) h = 03.
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
Gamma Kernel Intensity In Temporal Point Processes
1151
Figure 2. Gamma Kernel estimates showing individual kernels. Window widths: (a) h = 01, (b) h = 03.
sample of data into complementary subsets, performing the analysis on one subset (called the training set), and validating the analysis on the other subset (called the validation set or testing set). Let ˆ −i ti be the intensity function computed for observation ti and estimated by using the remaining data; then logLh =
N i=1
log ˆ −i ti −
T
ˆ tdt
(11)
0
The likelihood cross-validated selection of h is the value of h which maximize the function (11). A considerable amount has been written about selection of bandwidth for density estimation, but often the proposed methods depend on the unknown density (e.g., Silverman, 1978), implying restrictive limits to the accuracy of any possible data-driven bandwidth selector. The cross-validation based methods are the most interesting automatic methods of choosing a bandwidth (Hall, 1983; Stone, 1984); in particular, while crossvalidation based on quadratic Kullback–Leibler loss was considered by Bowman et al. (1984) and (Rudemo, 1982), cross-validation for density estimation has been shown to be asymptotically optimal (Hall, 1983) rather then simply consistent (Chow et al., 1983). However, the method is often computationally expensive because of the large number of the training process iterations, and sometimes the theoretical performance of this bandwidth selector is somewhat disappointing (Hall and Marron, 1987); nevertheless, when the loss function is chosen correctly, crossvalidation can be relied on to perform well for large samples (Bowman et al., 1984).
3. Simulation Results A simulation study was designed and carried out to investigate the performance of the Gamma kernel estimators (both (7) and (8)) and the Normal one (5) to
1152
Marcon et al.
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
estimate the intensity functions defined in different scenarios. We simulated an inhomogeneous Poisson process with a smoothed true intensity function, named true t and sample size n,obtained generating a Poisson random variable N with T assigned parameter T = 0 true tdt. Therefore, the intensity function is defined by true t = ftT , where ft is one of the six distributions described below, with either bounded or unbounded support and either one or two modes: (a) Gamma mixture obtained by mixing the distributions 1 1 and 3 2 with mixing proportion 21 ; (b) Gamma density with shape = 1 and scale = 2; (c) Gamma density with shape = 3 and scale = 1; (d) Normal density with mean = 5 and standard deviation = 1; (e) Pareto density with shape = 5 and scale = 1; (f) Normal mixture obtained by mixing the distributions 5 1 and 8 2 with mixing proportion 21 . For each of the above cases for the theoretical intensity function true t we used three values of T (100, 200, 500) and for each combination of true t and T we generated 100 simulated samples. For each simulation, the bandwidth is chosen as the one that maximizes logL(h) in (11). We do not report results relative to the first Gamma kernel estimator because simulations confirm the better performance of the estimator (8) than (7), as mentioned in Sec. 2.1; therefore, we focus on the comparison between the second Gamma kernel estimator and the Normal one. Intensities have been simulated for different values of T 100 200 500, showing an improving performance for increasing sample size. To make figures less crowded, we provide the simulated intensities together with their average just for the case T = 500 (see the plots in Fig. 3). These plots give an idea of the finite sample bias of the two estimators: it seems that bias is always smaller for the Gamma kernel estimates (solid lines) in comparison with the Normal kernel estimates (dotted lines) when the intensity functions are highly skewed (in Fig. 3, see plots (a), (b), and (e)). At the same time the Normal kernel function performs well when the real intensities are symmetric and defined on boundless support (in Fig. 3, see plots (c), (d), and (f)). Let us see Figs. 4–5 in order to analyze bias and variance of each estimator in detail: in Fig. 4 the averages of the intensity functions estimated over the 100 simulated samples both using the Gamma and the Normal kernel estimators are showed; Fig. 5 reports the corresponding simulated Mean Squared Errors (MSE’s). Both groups of figures confirm previous results for almost all the sample sizes considered across the entire range. As shown before, behaviors of the Normal kernel estimator and the true intensity are strongly different in cases (a), (b), and (e); on the other hand, the Gamma kernel estimator exhibits the same behavior both at the boundary and elsewhere as the true intensity function. The main difference between the Gamma and the Normal kernel estimators is showed in Fig. 5 plots (a), (b), and (e), where the simulated MSE’s of the Gamma kernel are much lower near t = 0; whereas plots (c), (d), and (f) in Fig. 5 show similar patterns over the entire support. The mean of the cross-validation log-likelihood for the Gamma and the Normal estimators computed over 100 simulated samples with different sizes and corresponding t-tests for paired samples are reported in Table 1; the Gamma kernel
1153
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
Gamma Kernel Intensity In Temporal Point Processes
Figure 3. Simulated intensity functions over 100 samples with T = 500 by using the Gamma (black solid line) and the Normal kernel estimator (dotted line). The true intensity (grey solid line) is: (a) Gamma mixture; (b) Gamma(1, 2); (c) Gamma(3, 1); (d) Normal(5, 1); (e) Pareto(5, 1); and (f) Normal mixture.
Marcon et al.
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
1154
Figure 4. Simulated mean of the 100 intensity functions with T = 500 obtained by using the Gamma kernel estimator (black solid line) and the Normal kernel estimator (dotted line). The true intensity (grey solid line) is: (a) Gamma mixture; (b) Gamma(1, 2); (c) Gamma(3, 1); (d) Normal(5, 1); (e) Pareto(5, 1); and (f) Normal mixture.
1155
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
Gamma Kernel Intensity In Temporal Point Processes
Figure 5. Simulated MSE to compare Gamma (solid line) and Normal kernel estimator (dotted line) over 100 samples with T = 500. The true intensity is: (a) Gamma mixture; (b) Gamma(1, 2); (c) Gamma(3, 1); (d) Normal(5, 1); (e) Pareto(5, 1); and (f) Normal mixture.
1156
Marcon et al.
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
Table 1 Mean cross-validation log-likelihood for the Gamma kernel estimator (Mean logL ) and the Normal one (Mean logL ) and corresponding t-tests computed over 100 simulated samples with T = 100 200 500 for the 6 theoretical intensity functions (a)–(f) described in Sec. 3 Mean logL
Mean logL
t-Test
p-Value
T = 100
(a) (b) (c) (d) (e) (f)
12086 19336 18031 22551 40650 12330
10924 18624 18037 22693 38695 12403
1612 1507 −034 −1308 970 −1454
0.00 0.00 0.74 0.00 0.00 0.00
T = 200
(a) (b) (c) (d) (e) (f)
38358 51769 49230 58837 93506 38409
36387 50347 49232 58991 88855 38468
1661 1384 −013 −2111 973 −698
0.00 0.00 0.90 0.00 0.00 0.00
T = 500
(a) (b) (c) (d) (e) (f)
139743 176970 167661 192169 282539 139943
136207 173855 167615 192395 272682 140011
1171 1188 172 −1481 839 −739
0.00 0.00 0.09 0.00 0.00 0.00
estimator fits significantly better than the Normal one for cases (a), (b), and (e), while for the cases (c), (d), and (f) there are not significant differences between the two estimators. Table 2 summarizes the MISE’s defined in (10) and the Kullback–Leibler distances, defined in (4), computed for both estimators over the different simulated intensity functions: we can observe that the performance of the Gamma and the Normal smoothers are close when the true intensity is symmetric whereas their difference increases for highly skewed functions. In other words, the Gamma kernel adequately estimates asymmetric intensity functions and its performance improves for greater sample sizes. The similarity between the Gamma empirical and theoretical MISE (obtained replacing (9) in (10)), is a further result validating our estimation procedures, though the approximation of the results. Asymptotic MISE evaluation for the Normal kernel estimator (Silverman, 1986) is not reported since dealing with densities with bounded domains the assumptions required for its applicability (Rosenblatt, 1956) concerning the continuity of derivatives and symmetry around zero are not satisfied by the simulated functions. Finally, a simulation study was carried out to investigate the performance of the Gamma kernel estimator (8) and the Normal one (5) in a very extreme situation where the true intensity function of an inhomogeneous Poisson process is
Gamma Kernel Intensity In Temporal Point Processes
1157
Table 2 Empirical Mean Integrated Squared Error (MISE) and Kullback–Leibler distance (KLD) for the Gamma and the Normal kernel estimators computed over 100 simulated samples with T = 100 200 500 for the 6 theoretical intensity functions (a)–(f) described in Sec. 3. The corresponding asymptotic theoretical MISE for the Gamma kernel estimator is also reported MISE
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
T = 100
T = 200
T = 500
(a) (b) (c) (d) (e) (f) (a) (b) (c) (d) (e) (f) (a) (b) (c) (d) (e) (f)
MISE
KLD
Empirical
Theoretical
Empirical
6158 6120 9760 12122 62343 7243 14582 18355 19478 20501 145162 15178 49980 52834 50964 57354 381951 46025
11929 11517 9365 14104 111704 9077 26785 25715 20958 31563 249869 20430 78331 74704 61047 91941 727454 59919
31056 30361 10519 11546 497754 7988 100506 113582 22419 19326 2407073 16717 463890 624272 75146 52404 12574254 51581
7909 6881 67286 97705 45231 32576 8139 9163 45564 26797 49255 29532 9758 12170 32672 30509 48864 15885
18186 24281 83647 42521 286219 406283 24781 34805 102321 27651 653775 155364 37925 62039 120403 27868 1122038 20229
Figure 6. Simulated mean (on the left) and MSE (on the right) over 100 samples with T = 500 for the Gamma kernel estimator (solid line) and the Normal kernel estimator (dotted line), assuming a step intensity function as in Eq. (12).
1158
Marcon et al.
proportional to a step function such that: 1 0 < t < 10 true t = 5 10 ≤ t ≤ 20
(12)
Simulation results over 100 samples for different values of T 100 200 500 show that both the Normal and Gamma kernel estimators provide similarly results, reflecting the hardness of this extreme case, mainly related to the presence of a bounded support and discontinuity points, in correspondence of which very high values of the simulated MSE are obtained (see Fig. 6 for T = 500).
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
4. Application to Real Data The empirical illustration concerns the analysis of four different seismic datasets. The estimation of the time intensity function is carried out for four seismic catalogs of different geographical areas. Each seismic catalog is formed by the space-time coordinates of seismic events. In this article, we focus just on temporal values. (a) Colfiorito (Fig. 7). The Colfiorito earthquake sequence (n = 2201) occurred on September 26, 1997, at 2:33 am in the area between Umbria and Marche, near Colfiorito (source: INGV – Istituto Nazionale di Geofisica e Vulcanologia). The seismic sequence was characterized both by the two mainshocks (2:33 am and 11:40 am, latitude 430 N and longitude 129 E) and by several aftershocks until November 3, 1997. (b) Palermo (Fig. 8). On September 6, 2002, at 3:21 am, a M = 56 earthquake, occurring some tens of kilometers offshore from the Northern Sicilian coast (latitude 137 N and longitude 384 E), slightly damaged the city of Palermo and surroundings. The seismic sequence (the mainshock and the several aftershocks, occurred for more than 80 days) is constituted by 810 events occurred in the
Figure 7. sequence.
Gamma (solid line) and Normal (dotted line) kernel estimates for (a) Colfiorito
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
Gamma Kernel Intensity In Temporal Point Processes
Figure 8. sequence.
1159
Gamma (solid line) and Normal (dotted line) kernel estimates for (b) Palermo
area 13.18–14.45 E and 37.95–38.95 N; the cluster has been drawn from the Southern Tyrrhenian Sea catalog (source INGV – Istituto Nazionale di Geofisica e Vulcanologia). (c) Landers (Fig. 9). The Landers sequence (n = 751) is drawn from the Southern California catalogs (source: U.S.G.S. – U.S. Geological Survey), that represents earthquakes occurred between January 1984 and June 2004 in a rectangular area around Los Angeles, between longitude −112 and −114 E and latitude 32–37 N; the dataset consists of 6796 earthquakes, with magnitude not smaller than 3.0. Landers is the earthquake identified by latitude 342 N, longitude
Figure 9. sequence.
Gamma (solid line) and Normal (dotted line) kernel estimates for (c) Landers
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
1160
Figure 10. sequence.
Marcon et al.
Gamma (solid line) and Normal (dotted line) kernel estimates for (d) Japan
−1164 E, magnitude M = 73, and time June 28 1992, 4:57:31 am (Adelfio et al., 2006; Schoenberg, 2003). (d) Japan (Fig. 10). We use the hypocenter data recorded by the Japan Meteorological Agency, and consider one of several datasets discussed in Ogata (1998) and Ogata and Zhuang (2006). These data cover the area of the east coast of Tohoku District, one of the most active plate boundary zones in and around Japan. The data (n = 4333) consist of earthquakes of magnitude 4.5 and larger that are chosen from the region 36–42 N and 141–145 E for depths down to 100 km and for the time span 1926–1995. During 1938, the Shioya-Oki great swarm, including several large earthquakes, occurred near the coast of Fukusima Prefecture. We examine the intensity rate changes in time at the epicenter of the largest earthquake of M = 75 that occurred on November 5, 1938, with longitude 14218 E and latitude 3733 N (Adelfio and Ogata, 2010).
Table 3 Values of the h and of the log-likelihood estimated for (a) Colfiorito, (b) Palermo, (c) Landers, and (d) Japan data by cross-validation Gamma Kernel
(a) Colfiorito (b) Palermo (c) Landers (d) Japan
Normal Kernel
h
logL
h
logL
0024 0019 0193 0174
720202 175955 97415 −13643
0113 0678 10059 3044
733212 171882 61944 −11325
Gamma Kernel Intensity In Temporal Point Processes
1161
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
The Gamma and the Normal kernel estimators are applied to each seismic catalog. The cross-validation approach is used to select the bandwidth parameters. Figures 8 and 9 show that for the Palermo and Landers sequences, the Gamma kernel estimator catches the highest intensity near the origin, whereas the Normal kernel estimator does not. On the other hand, Figs. 7 and 10 show similar fitting of the Gamma and the Normal kernel estimators for Colfiorito and Japanese data. From these figures, we can observe that, for highly skewed distributions, the Gamma kernel estimator performs better than the Normal one mostly near the origin, suggesting that its choice could be crucial in those cases where bias effects are relevant. These results are summarized in Table 3, reporting the logL and h values estimated for each sequence.
5. Summary and Conclusion We compared the Gamma and the Normal kernel intensity estimators both when the intensity of the generating process is asymmetric, and defined on a limited support 0 +, and when it is a symmetric function with unbounded domain. For simulated cases, results show that the Gamma kernel estimator is less biased than the Normal one near the origin for limited support 0 +, while the choice between the two competitive estimators is irrelevant when the true intensity is symmetric. This point was also illustrated through a nonparametric estimation of the seismic activity of four real different catalogs, observing again that the Gamma kernel estimator performs better when the time behavior of the sequences is described by a skewed distribution, while both the Gamma and the Normal kernel provide good results in other cases. Therefore, although it is usual to consider the choice of the kernel function less important than the bandwidth selection, there are some situations in which the choice of an appropriate kernel function is a crucial issue. As a direction for future work, we are considering methods for simultaneous choice of bandwidth parameters and kernel functions in space-time point patterns.
Acknowledgment The authors thank the reviewers for their useful comments and suggestions which improved the quality of the article.
References Adelfio, G. (2010). Kernel estimation and display of a five-dimensional conditional intensity function. Nonlinear Processes in Geophysics 17:237–244. Adelfio, G., Chiodi, M. (2009). Second-order diagnostics for space-time point processes with application to seismic events. Environmetrics 20:895–911. Adelfio, G., Chiodi, M. (2011). Kernel intensity for space-time point processes with application to seismological problems. In: Classification and Multivariate Analysis for Complex Data Structures. Series: Studies in Classification, Data Analysis, and Knowledge Organization. Heidelberg: Springer-Verlag, pp. 401–408. Adelfio, G., Ogata, Y. (2010). Hybrid kernel estimates of space-time earthquake occurrance rates using the etas model. Annals of the Institute of Statistical Mathematics 62:127–143.
Downloaded By: [Adelfio, Giada] At: 09:44 19 April 2011
1162
Marcon et al.
Adelfio, G., Chiodi, M., De Luca, L., Luzio, D. (2006). Nonparametric clustering of seismic events. In: Data Analysis, Classification and the Forward Search. Series: Studies in Classification, Data Analysis, and Knowledge Organization. Heidelberg: Springer-Verlag, pp. 397–404. Adelfio, G., Chiodi, M., De Luca, L., Luzio, D. (2010). An algorithm for earthquake clustering based on maximum likelihood. In: Data Analysis and Classification. Series: Studies in Classification, Data Analysis, and Knowledge Organization. New York: Springer, pp. 25–32. Bouezmarni, T., Scaillet, O. (2005). Consistency of asymmetric kernel density estimators and smoothed histograms with application to income data. Technical Report 0306, Interuniversity Attraction Pole. Economic Theory 21:390–412. Bowman, A. W., Hall, P., Titterington, D. M. (1984). Cross-validation in nonparametric estimation of probabilities and probability density. Biometrika Trust 71:341–351. Chen, S. X. (2000). Probability density function estimating using gamma kernels. Annals of the Institute of Statistical Mathematics 52:471–480. Choi, E., Hall, P. (1998). On bias reduction in local linear smoothing. Biometrika 85(2): 333–345. Chow, Y. S., German, S., Wu, L. D. (1983). Consistent cross-validated density estimation. The Annals of Statistics 11:25–38. Daley, D. J., Vere-Jones, D. (2003). An Introduction to the Theory of Point Processes. 2nd ed. New York: Springer-Verlag. Hall, P. (1982). Cross-validation in density estimation. Biometrika 69(2):383–390. Hall, P. (1983). Large samples optimality of least squares cross-validation in density estimation. The Annals of Statistics 11:1156–74. Hall, P., Marron, J. S. (1987). Extent to which least-squares cross-validation minimizes integrated squared error in nonparametric density estimation. Probability Theory Related Fields 74:567–581. Jackson, D. D., Kagan, Y. Y. (1999). Testable earthquake forecasts for 1999. Seismlogical Research Letters 70:393–403. Ogata, Y. (1998). Space-time point-process models for earthquake occurrences. Annals of the Institute of Statistical Mathematics 50(2):379–402. Ogata, Y., Zhuang, J. (2006). Space-time etas model and an improved extension. Tectonophysics 413:13–23. Rosenblatt, M. (21956). Remarks on some nonparametric estimates of a density function. Annals of Mathematical Statistics 27:832–837. Rudemo, M. (1982). Empirical choice of histograms and kernel density estimators. Scandinavian Journal of Statistics 9:65–78. Schoenberg, F. P. (2003). Multi-dimensional residual analysis of point process models for earthquake occurrences. Journal American Statistical Association 98(464):789–795. Silverman, B. W. (1978). Choosing the window width when estimating a density. Biometrika Trust 65:1–11. Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis. London: Chapman and Hall. Stone, C. J. (1984). An asymptotically optimal window selection rule for kernel density estimates. The Annals of Statistics 12:1285–1297. Zhuang, J., Ogata, Y., Vere-Jones, D. (2002). Stochastic declustering of space-time earthquake occurrences. Journal of the American Statistical Association 97(458):369–379.