7.8 The Extremal Index

So far our discussions of extreme values are based on the assumption that the data are iid random variables. However, in reality extremal events tend to occur in clusters because of the serial dependence in the data. For instance, we often observe large returns (both positive and negative) of an asset after some news event. In this section we extend the theory and applications of extreme values to cases in which the data form a strictly stationary time series. The basic concept of the extension is extremal index, which allows one to characterize the relationship between the dependence structure of the data and their extremal behavior. Our discussion will be brief. Interested readers are referred to Beirlant et al. (2004, Chapter 10) and Embrechts et al. (1997).

Let x1, x2, … be a strictly stationary sequence of random variables with marginal distribution function F(x). Consider the case of n observations {xi|i = 1, … , n}. As before, let x(n) be the maximum of the data, that is, x(n) = max{xi}. We seek the limiting distribution of (x(n) − βn)/αn for some suitably chosen normalizing constants αn > 0 and βn. If {xi} were iid, Section 7.5 shows that the only possible nondegenerate limits are the extreme value distributions. What is the limiting distribution when {xi} are serially dependent?

To answer this question, we start with a heuristic argument. Suppose that the serial dependence of the stationary series xi decays quickly so that xi and xi+ℓ are essentially independent when ℓ is sufficiently large. In other words, assume that the long-range dependence of xi vanishes quickly. Now divide the data into disjoint blocks of size k. Specifically, let g = [n/k] be the largest integer less than or equal to n/k. The ith block of the data is then {xj|j = (i − 1) * k + 1, … , i * k}, where it is understood that the (g + 1)th block may contain less than k observations. Let xk, i be the maximum of the ith block, that is, xk, i = max{xj|j = (i − 1) * k + 1, … , i * k}. The collection of block maxima is {xk, i|i = 1, … , g + 1}. From the definitions, it is easy to see that

7.44 7.44

That is, the sample maximum is also the maximum of the block maxima. If the block size k is sufficiently large and the block maximum xk, i does not occur near the end of the ith block, then xk, i and xk, i+1 are sufficiently far apart and essentially independent under the assumption of weak long-range dependence in {xi}. Consequently, {xk, i|i = 1, … , g + 1} can be regarded as a sample of iid random variables, and the limiting distribution of its maximum, which is x(n), should be the extreme value distribution. The prior discussion shows that, under some proper condition, the limiting distribution of the maximum of a strictly stationary time series is also the extreme value distribution.

The proper condition needed for the maximum x(n) of a strictly stationary time series to have the extreme value limiting distribution is obtained by Leadbetter (1974) and known as the D(un) condition. Details are given in the next section. The prior heuristic argument also suggests that, even though the limiting distribution of x(n) is also the extreme value distribution, the parameters associated with the limiting distribution, however, will not be the same as those when {xi} are iid random samples because the limiting distribution depends on the marginal distribution of the underlying sequences. For the iid sequences, the marginal distribution is F(x), but for a stationary series the underlying sequences are the block maxima xk, i whose marginal distribution is not F(x). The marginal distribution of xk, i depends on k and the strength of serial dependence in {xi}.

7.8.1 The D(un) Condition

Consider the sample x1, x2, … , xn. To place limits on the long-range dependence of {xi}, let un be a sequence of thresholds increasing at a rate for which the expected number of exceedances of xi over un remains bounded. Mathematically, this says that Inline, where F( · ) is the marginal cumulative distribution function of xi. For any positive integers p and q, suppose that iv (v = 1, … , p) and jt (t = 1, … , q) are arbitrary integers satisfying

Inline

where j1ip ≥ ℓn, where ℓn is a function of the sample size n such that ℓn/n → 0 as Inline. Let A1 = {i1, i2, … , ip} and A2 = {j1, j2, … , jq} be two sets of time indices. From the prior condition, elements in A1 and A2 are separated by at least ℓn time periods. The condition D(un) is satisfied if

7.45 7.45

where Inline as Inline. This condition says that any two events of the form Inline and Inline can become asymptotically independent as the sample size n increases when the index subsets A1 and A2 of {1, 2, … , n} are separated by a distance ℓn which satisfies ℓn/n → 0 as Inline. The D(un) condition looks complicated, but it is relatively weak. For instance, consider Gaussian sequences with autocorrelation ρn for lag n. The D(un) condition is satisfied if ρnln(n) → 0 as Inline; see Berman (1964).

Leadbetter's Theorem 1

Suppose that {xi|i = 1, … , n} is a strictly stationary time series for which there exist sequences of constants αn > 0 and βn and a nondegenerate distribution function F*( · ) such that

Inline

where → d denotes convergence in distribution. If D(un) holds with un = αnx + βn for each x such that F*(x) > 0, then F*(x) is an extreme value distribution function.

The prior theorem shows that the possible limiting distributions for the maxima of strictly stationary time series satisfying the D(un) condition are also the extreme value distributions. As noted before, the dependence can affect the limiting distribution, however. The effect of the dependence appears in the marginal distribution of the block maxima xk, i. To state the effect more precisely, let Inline be a sequence of iid random variables such that the marginal distribution of Inline is the same as that of the stationary time series xi. Let Inline be the maximum of Inline. Leadbetter (1983) establishes the following result.

Leadbetter's Theorem 2

If there exist sequences of constants αn > 0 and βn and a nondegenerate distribution function Inline such that

Inline

if the condition D(un) holds with un = αnx + βn for each x such that Inline, and if P[(x(n) − βn)/αnx] converges for some x, then

Inline

for some constant θ ∈ (0, 1].

The constant θ is called the extremal index. It plays an important role in determining the limiting distribution F*(x) for the maximum of a strictly stationary time series. To see this, we provide some simple derivations for the case of ξ ≠ 0. From the result of Eq. (7.16), Inline is the generalized extreme value distribution and assumes the form

Inline

where ξ ≠ 0 and 1 + ξ(x − β)/α > 0. In other words, we assume that for the iid sequence Inline, the limiting extreme distribution of Inline has parameters ξ, β and α. Based on Theorem 2 of Leadbetter (1983), we have

7.46 7.46

where ξ* = ξ, α* = αθξ, and β* = β − α(1 − θξ)/ξ. Therefore, for a stationary time series {xi} satisfying the D(un) condition, the limiting distribution of the sample maximum is the generalized extreme value distribution with the shape parameter ξ, which is the same as that of the iid sequences. On the other hand, the location and scale parameters are affected by the extremal index θ. Specifically, α* = αθξ and β* = β − α(1 − θξ)/ξ. Results for the case of ξ = 0 can be derived via the same approach and we have α* = α and β* = β + αln(θ).

A formal definition of the extremal index is as follows: Let {xi} be a strictly stationary time series with marginal cumulative distribution function F(x) and θ a nonnegative number. Assume that for every τ > 0 there exists a sequence of thresholds un such that

7.47 7.47

7.48 7.48

Then θ is called the extremal index of the time series {xi}. See Embrechts et al. (1997). Note that, for the corresponding iid sequence Inline, under the assumption that Eq. (7.47) holds, we have

Inline

where we have used the property Inline. Thus, the definition also highlights the role played by the extremal index θ.

7.8.2 Estimation of the Extremal Index

There are several ways to estimate the extremal index θ of a strictly stationary time series {xi}. Each estimation method is associated with an interpretation of the extremal index. In what follows, we discuss some of the estimation methods.

7.8.2.1 The Blocks Method

From the definition of the extremal index θ, we have, for a large n, that

Inline

provided that n[1 − F(un)] → τ > 0. Hence

7.49 7.49

This limiting relationship suggests a method to estimate θ. The denominator can be estimated by the sample quantile, namely

Inline

where I(C) = 1 if the augment C holds and = 0 otherwise, that is, I(C) is the indicator variable for the statement C, and N(un) denotes the number of exceedances of the sample over the threshold un. The numerator P(x(n)un) is harder to estimate. One possibility is to use the block maxima. Specifically, let k = k(n) be a properly chosen block size that depends on the sample size n and, as before, let g = [n/k] be the integer part of n/k. For simplicity, assume that n = gk. The ith block consists of {xj|j = (i − 1) * k + 1, … , i * k} and let xk, i be the maximum of the ith block. Using Eq. (7.44) and the approximate independence of block maxima, we have

Inline

The probability P(xk, iun) can be estimated from the block maxima, that is,

Inline

where G(un) is the number of blocks such that the block maximum exceeds the threshold un. Combining the estimators for numerator and denominator, we obtain

7.50 7.50

where the subscript b signifies the blocks method. Note that N(un) is the number of exceedances of the sample {xi} over the threshold un and G(un) is the number of blocks with one or more exceedances. Using approximation based on Taylor expansion of ln(1 − x), we obtain a second estimator:

Inline

Based on the results of Hsing et al. (1988), this estimator can also be interpreted as the reciprocal of the mean cluster size of the limiting compound Poisson process N(un).

7.8.2.2 The Runs Method

O'Brien (1987) proved, under certain weak mixing condition, that

Inline

where Inline, where s is a function of the sample size n satisfying some growth conditions, including Inline and s/n → 0 as Inline. See Beirlant et al. (2004) and Embrechts et al. (1997) for details. This result has been used to construct an estimator of θ based on runs:

Inline

where N(un) is the number of exceedances of the sample {xi} over the threshold un, k is a function of n, and Ai, n = {xi > un, xi+1un, … , xi+kun}. Note that Ai, n denotes the event that an exceedance is followed by a run of k observations below the threshold. Since k/n → 0 as Inline, we can write the runs estimator as

Inline

Finally, other estimators of θ are available in the literature. See, for instance, the methods discussed in Beirlant et al. (2004). For demonstration, we consider, again, the negative daily log returns of IBM stock from July 3, 1962, to December 31, 1998. Figure 7.10 shows the estimates of the extremal index for various thresholds when the block size k = 10. We chose k = 10 because the daily log returns have weak serial dependence. The estimates are based on the blocks method, that is, Inline. From the plot, we see that Inline 0.82 for threshold 0.025. Indeed, a simple direct calculation using k = 10 and threshold 0.025 gives Inline. The plot also shows that the estimate Inline of the extremal index might be sensitive to the choices of threshold and block size k.

Figure 7.10 Estimates of extremal index for negative daily log returns of IBM stock from July 3, 1962, to December 31, 1998. Block size is k = 10 and lower horizontal axis of plot K denotes number of blocks whose maximum exceeds threshold.

7.10

7.8.3 Value at Risk for a Stationary Time Series

The relationship between F*(x) of the maximum of a stationary time series and Inline of its iid counterpart established in Theorem 2 of Leadbetter (1983) can be used to calculate the VaR of a financial position when the associated log returns form a stationary time series. Specifically, from P(x(n)un) ≈ [F(x)]nθ, the (1 − p)th quantile of F(x) is the (1 − p)nθth quantile of the limiting extreme value distribution of x(n). Consequently, the VaR of Eq. (7.28) based on the extreme value theory becomes

7.51 7.51

where n is the length of the subperiod. From the formula, we risk underestimating the VaR if the extremal index is overlooked.

As an illustration, again consider the negative daily log returns of IBM stock from July 3, 1962, to December 31, 1998. Using Inline = 0.823, the 1% VaR for the long position of $10 millions on the stock for the next trading day becomes 3.2714 for the case of choosing n = 63 days in parameter estimation. As expected, this is higher than the 3.0497 of Example 7.6 when the extremal index is neglected.

7.8.3.1 R Demonstration

> library(evir)

> help(exindex)

> m1=exindex(nibm,10) %Estimate the extremal index

   of Figure 7.10.

>      % VaR calculation.

> 2.583-(.945/.335)*(1-(-63*.823*log(.99))∧-.335)

[1] 3.271388

Exercises

7.1 Consider the daily returns of GE stock from January 2, 1998, to December 31, 2008. The data can be obtained from CRSP or the file d-ge9808.txt. Convert the simple returns into log returns. Suppose that you hold a long position on the stock valued at $1 million. Use the tail probability 0.01. Compute the value at risk of your position for 1-day horizon and 15-day horizon using the following methods:

a. The RiskMetrics method.

b. A Gaussian ARMA–GARCH model.

c. An ARMA–GARCH model with a Student-t distribution. You should also estimate the degrees of freedom.

d. The traditional extreme value theory with subperiod length n = 21.

7.2 The file d-csco9808.txt contains the daily simple returns of Cisco Systems stock from 1998 to 2008 with 2767 observations. Transform the simple returns to log returns. Suppose that you hold a long position of Cisco stock valued at $1 million. Compute the value at risk of your position for the next trading day using probability p = 0.01.

a. Use the RiskMetrics method.

b. Use a GARCH model with a conditional Gaussian distribution.

c. Use a GARCH model with a Student-t distribution. You may also estimate the degrees of freedom.

d. Use the unconditional sample quantile.

e. Use a two-dimensional homogeneous Poisson process with threshold 2%, that is, focusing on the exceeding times and exceedances that the daily stock price drops 2% or more. Check the fitted model.

f. Use a two-dimensional nonhomogeneous Poisson process with threshold 2%. The explanatory variables are (1) an annual time trend, (2) a dummy variable for October, November, and December, and (3) a fitted volatility based on a Gaussian GARCH(1,1) model. Perform a diagnostic check on the fitted model.

g. Repeat the prior two-dimensional nonhomogeneous Poisson process with threshold 2.5 or 3%. Comment on the selection of threshold.

7.3 Use Hill's estimator and the data d-csco9808.txt to estimate the tail index for daily log returns of Cisco stock.

7.4 The file d-hpq3dx9808.txt contains dates and the daily simple returns of Hewlett-Packard, the CRSP value-weighted index, equal-weighted index, and the S&P 500 index from 1998 to 2008. The returns include dividend distributions. Transform the simple returns to log returns. Assume that the tail probability of interest is 0.01. Calculate value at risk for the following financial positions for the first trading day of year 2009.

a. Long on Hewlett-Packard stock of $1 million and S&P 500 index of $1 million using RiskMetrics. The α coefficient of the IGARCH(1,1) model for each series should be estimated.

b. The same position as part (a) but using a univariate ARMA–GARCH model for each return series.

c. A long position on Hewlett-Packard stock of $1 million using a two-dimensional nonhomogeneous Poisson model with the following explanatory variables: (1) an annual time trend, (2) a fitted volatility based on a Gaussian GARCH model for Hewlett-Packard stock, (3) a fitted volatility based on a Gaussian GARCH model for the S&P 500 index returns, and (4) a fitted volatility based on a Gaussian GARCH model for the value-weighted index return. Perform a diagnostic check for the fitted models. Are the market volatility as measured by the S&P 500 index and value-weighted index returns helpful in determining the tail behavior of stock returns of Hewlett-Packard? You may choose several thresholds.

7.5 Consider the daily returns of Alcoa (AA) stock and the S&P 500 composite index (SPX) from 1998 to 2008. The simple returns and dates are in the file d-aaspx9808.txt. Transform the simple returns to log returns and focus on the daily negative log returns of AA stock.

a. Fit the generalized extreme value distribution to the negative AA log returns, in percentages, with subperiods of 21 trading days. Write down the parameter estimates and their standard errors. Obtain a scatterplot and a QQ plot of the residuals.

b. What is the return level of the prior fitted model when 24 subperiods of 21 days are used?

c. Obtain a QQ plot (against exponential distribution) of the negative log returns with threshold 2.5% and a mean excess plot of the returns.

d. Fit a generalize Pareto distribution to the negative log returns with threshold 3.5%. Write down the parameter estimates and their standard errors.

e. Obtain (i) a plot of excess distribution, (ii) a plot of the tail of the underlying distribution, (iii) a scatterplot of residuals, and (iv) a QQ plot of the residuals for the fitted GPD.

f. Based on the fitted GPD model, compute the VaR and expected shortfall for probabilities q = 0.99 and 0.999.

7.6 Consider, again, the daily log returns of Alcoa (AA) stock in Exercise 7.5. Focus now on the daily positive log returns. Answer the same questions as in Exercise 7.5. However, use threshold 3% in fitting the GPD model.

7.7 Consider the daily returns of SPX in d-aaspx9808.txt. Transform the returns into log returns and focus on the daily negative log returns.

a. Fit the generalized extreme value distribution to the negative SPX log returns, in percentage, with subperiods of 21 trading days. Write down the parameter estimates and their standard errors. Obtain a scatterplot and a QQ plot of the residuals.

b. What is the return level of the prior fitted model when 24 subperiods of 21 days are used?

c. Obtain a QQ plot (against exponential distribution) of the negative log returns with threshold 2.5% and a mean excess plot of the returns.

d. Fit a generalize Pareto distribution to the negative log returns with threshold 2.5%. Write down the parameter estimates and their standard errors.

e. Obtain (i) a plot of excess distribution, (ii) a plot of the tail of the underlying distribution, (iii) a scatterplot of residuals, and (iv) a QQ plot of the residuals for the fitted GPD.

f. Based on the fitted GPD model, compute the VaR and expected shortfall for probabilities q = 0.99 and 0.999.

7.8 Consider the daily log returns of the GE stock of Exercise 7.1. Obtain estimates Inline and Inline of the extremal index of (a) the positive return series and (b) the negative return series, using block sizes k = 5 and 10 and threshold 2.5%.

References

Beirlant, J., Goegebeur, Y., Segers, J., and Teugels, J. (2004). Statistics of Extremes: Theory and Applications. Wiley, Hoboken, NJ.

Berman, S. M. (1964). Limiting theorems for the maximum term in stationary sequences. Annals of Mathematical Statistics 35: 502–516.

Coles, S. (2001). An Introduction to Statistical Modeling of Extreme Values. Springer, New York.

Cox, D. R. and Hinkley, D. V. (1974). Theoretical Statistics. Chapman and Hall, London.

Danielsson, J. and De Vries, C. G. (1997a). Value at risk and extreme returns. Working paper, London School of Economics, London, UK.

Danielsson, J. and De Vries, C. G. (1997b). Tail index and quantile estimation with very high frequency data. Journal of Empirical Finance 4: 241–257.

Davison, A. C. and Smith, R. L. (1990). Models for exceedances over high thresholds (with discussion). Journal of the Royal Statistical Society Series B 52: 393–442.

Dekkers, A. L. M. and De Haan, L. (1989). On the estimation of extreme value index and large quantile estimation. Annals of Statistics 17: 1795–1832.

Duffie, D. and Pan, J. (1997). An overview of value at risk. Journal of Derivatives Spring: 7–48.

Embrechts, P., Kuppelberg, C., and Mikosch, T. (1997). Modelling Extremal Events. Springer, Berlin.

Feller, W. (1971). An Introduction to Probability Theory and Its Applications, Vol. 2. Wiley, New York.

Goldie, C. M. and Smith, R. L. (1987). Slow variation with remainder: Theory and applications. Quarterly Journal of Mathematics 38: 45–71.

Gnedenko, B. V. (1943). Sur la distribution limite du terme maximum of d'une série Aléatorie. Annals of Mathematics 44: 423–453.

Gumbel, E. J. (1958). Statistics of Extremes. Columbia University Press, New York.

Hill, B. M. (1975). A simple general approach to inference about the tail of a distribution. Annals of Statistics 3: 1163–1173.

Hsing, T., Hüsler, J. and Leadbetter, M. R. (1988). On the exceedance point process for a stationary sequence. Probability Theory and Related Fields 78: 97–112.

Jenkinson, A. F. (1955). The frequency distribution of the annual maximum (or minimum) of meteorological elements. Quarterly Journal of the Royal Meteorological Society 81: 158–171.

Jorion, P. (2006). Value at Risk: The New Benchmark for Managing Financial Risk, 3rd ed. McGraw-Hill, Chicago.

Koenker, R. W. and Bassett, G. W. (1978). Regression quantiles. Econometrica 46: 33–50.

Koenker, R. W. and D'Orey, V. (1987). Computing regression quantiles. Applied Statistics 36: 383–393.

Leadbetter, M. R.(1974). On extreme values in stationary sequences. Zeitschrift für Wahrscheinlichkeitsthorie und Verwandte Gebiete 28: 289–303.

Leadbetter, M. R. (1983). Extremes and local dependence in stationary sequences. Zeitschrift für Wahrscheinlichkeitsthorie und Verwandte Gebiete 65: 291–306.

Leadbetter, M. R., Lindgren, G., and Rootzén, H. (1983). Extremes and Related Properties of Random Sequences and Processes. Springer, New York.

Longerstaey, J. and More, L. (1995). Introduction to RiskMetricsTM, 4th ed. Morgan Guaranty Trust Company, New York.

Longin, F. M. (1996). The asymptotic distribution of extreme stock market returns. Journal of Business 69: 383–408.

Longin, F. M. (1999a). Optimal margin level in futures markets: Extreme price movements. Journal of Futures Markets 19: 127–152.

Longin, F. M. (1999b). From value at risk to stress testing: The extreme value approach. Working paper, Centre for Economic Policy Research, London.

O'Brien, G. L.(1987). Extreme values for stationary and Markov sequences. Annals of Probability 15: 281–291.

Pickands, J. (1975). Statistical inference using extreme order statistics. Annals of Statistics 3: 119–131.

Smith, R. L. (1989). Extreme value analysis of environmental time series: An application to trend detection in ground-level ozone (with discussion). Statistical Science 4: 367–393.

Smith, R. L. (1999). Measuring risk with extreme value theory. Working paper, Department of Statistics, University of North Carolina at Chapel Hill.

Smith, R. L. and Shively, T. S. (1995). A point process approach to modeling trends in troposheric ozone. Atmospheric Environment 29: 3489–3499.

Tsay, R. S. (1999). Extreme value analysis of financial data. Working paper, Graduate School of Business, University of Chicago.

Zivot, E. and Wang, J. (2003). Modeling Financial Time Series with S-Plus. Springer, New York.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset