5.4 Models for Price Changes

The discreteness and concentration on “no change” make it difficult to model the intraday price changes. Campbell et al. (1997) discuss several econometric models that have been proposed in the literature. Here we mention two models that have the advantage of employing explanatory variables to study the intraday price movements. The first model is the ordered probit model used by Hauseman, Lo, and MacKinlay (1992) to study the price movements in transactions data. The second model has been considered recently by McCulloch and Tsay (2000) and is a simplified version of the model proposed by Rydberg and Shephard (2003); see also Ghysels (2000).

5.4.1 Ordered Probit Model

Let Inline be the unobservable price change of the asset under study (i.e., Inline), where Inline is the virtual price of the asset at time t. The ordered probit model assumes that Inline is a continuous random variable and follows the model

5.15 5.15

where xi is a p-dimensional row vector of explanatory variables available at time ti−1, β is a p × 1 parameter vector, Ei|xi) = 0, Var(ϵi|xi) = Inline, and Cov(ϵi, ϵj) = 0 for ij. The conditional variance Inline is assumed to be a positive function of the explanatory variable wi, that is,

5.16 5.16

where g( · ) is a positive function. For financial transactions data, wi may contain the time interval titi−1 and some conditional heteroscedastic variables. Typically, one also assumes that the conditional distribution of ϵi given xi and wi is Gaussian.

Suppose that the observed price change yi may assume k possible values. In theory, k can be infinity, but countable. In practice, k is finite and may involve combining several categories into a single value. For example, we have k = 7 in Table 5.1, where the first value “ − 3 ticks” means that the price change is − 3 ticks or lower. We denote the k possible values as {s1, … , sk}. The ordered probit model postulates the relationship between yi and Inline as

5.17 5.17

where αj are real numbers satisfying − ∞ = α0 < α1 < ⋯ < αk−1 < αk = ∞. Under the assumption of conditional Gaussian distribution, we have

5.18 5.18

where Φ(x) is the cumulative distribution function of the standard normal random variable evaluated at x, and we write σi(wi) to denote that Inline is a positive function of wi. From the definition, an ordered probit model is driven by an unobservable continuous random variable. The observed values, which have a natural ordering, can be regarded as categories representing the underlying process.

The ordered probit model contains parameters β, αi (i = 1, … , k − 1), and those in the conditional variance function σi(wi) in Eq. (5.16). These parameters can be estimated by the maximum-likelihood or Markov chain Monte Carlo methods.

Example 5.1

Hauseman et al. (1992) apply the ordered probit model to the 1988 transactions data of more than 100 stocks. Here we only report their result for IBM. There are 206,794 trades. The sample mean (standard deviation) of price change yi, time duration Δti, and bid–ask spread are − 0.0010(0.753), 27.21(34.13), and 1.9470(1.4625), respectively. The bid–ask spread is measured in ticks. The model used has nine categories for price movement, and the functional specifications are

5.19 5.19

5.20 5.20

unnumbered

where Tλ(V) = (Vλ − 1)/λ is the Box–Cox (1964) transformation of V with λ ∈ [0, 1] and the explanatory variables are defined by the following:

  • Inline = (titi−1)/100 is a rescaled time duration between the (i − 1)th and ith trades with time measured in seconds.
  • ABi−1 is the bid–ask spread prevailing at time ti−1 in ticks.
  • yiv (v = 1, 2, 3) is the lagged value of price change at tiv in ticks. With k = 9, the possible values of price changes are { − 4, − 3, − 2, − 1, 0, 1, 2, 3, 4} in ticks.
  • Viv (v = 1, 2, 3) is the lagged value of dollar volume at the (iv)th transaction, defined as the price of the (iv)th transaction in dollars times the number of shares traded (denominated in hundreds of shares). That is, the dollar volume is in hundreds of dollars.
  • SP5i−v (v = 1, 2, 3) is the 5-minute continuously compounded returns of the Standard and Poor's 500 index futures price for the contract maturing in the closest month beyond the month in which transaction (iv) occurred, where the return is computed with the futures price recorded 1 minute before the nearest round minute prior to tiv and the price recorded 5 minutes before this.
  • IBSiv (v = 1, 2, 3) is an indicator variable defined by

Inline

where Inline and Inline are the ask and bid price at time tj.

The parameter estimates and their t ratios are given in Table 5.5. All the t ratios are large except one, indicating that the estimates are highly significant. Such high t ratios are not surprising as the sample size is large. For the heavily traded IBM stock, the estimation results suggest the following conclusions:

1. The boundary partitions are not equally spaced but are almost symmetric with respect to zero.

2. The transaction duration Δti affects both the conditional mean and conditional variance of yi in Eqs. (5.19) and (5.20).

3. The coefficients of lagged price changes are negative and highly significant, indicating price reversals.

4. As expected, the bid–ask spread at time ti−1 significantly affects the conditional variance.

Table 5.5 Parameter Estimates of Ordered Probit Model in Eqs. (5.19) and (5.20) for the 1988 Transaction Data of IBM, Where t Denotes the t Ratio

NumberTable

Source: Reprinted with permission of Elsevier from Journal of Financial Economics (1992, Vol. 31, p. 345)

5.4.2 Decomposition Model

An alternative approach to modeling price change is to decompose it into three components and use conditional specifications for the components; see Rydberg and Shephard (2003). The three components are an indicator for price change, the direction of price movement if there is a change, and the size of price change if a change occurs. Specifically, the price change at the ith transaction can be written as

5.21 5.21

where Ai is a binary variable defined as

5.22 5.22

Di is also a discrete variable signifying the direction of the price change if a change occurs, that is,

5.23 5.23

where Di|(Ai = 1) means that Di is defined under the condition of Ai = 1, and Si is the size of the price change in ticks if there is a change at the ith trade and Si = 0 if there is no price change at the ith trade. When there is a price change, Si is a positive integer-valued random variable.

Note that Di is not needed when Ai = 0, and there is a natural ordering in the decomposition. Di is well defined only when Ai = 1 and Si is meaningful when Ai = 1 and Di is given. Model specification under the decomposition makes use of the ordering.

Let Fi be the information set available at the ith transaction. Examples of elements in Fi are Δtij, Aij, Dij, and Sij for j ≥ 0. The evolution of price change under model (5.21) can then be partitioned as

5.24 5.24

Since Ai is a binary variable, it suffices to consider the evolution of the probability pi = P(Ai = 1) over time. We assume that

5.25 5.25

where xi is a finite-dimensional vector consisting of elements of Fi−1 and β is a parameter vector. Conditioned on Ai = 1, Di is also a binary variable, and we use the following model for δi = P(Di = 1|Ai = 1):

5.26 5.26

where zi is a finite-dimensional vector consisting of elements of Fi−1 and γ is a parameter vector. To allow for asymmetry between positive and negative price changes, we assume that

5.27 5.27

where g(λ) is a geometric distribution with parameter λ and the parameters λj, i evolve over time as

5.28 5.28

where wi is again a finite-dimensional explanatory variable in Fi−1 and θj is a parameter vector.

In Eq. (5.27), the probability mass function of a random variable x, which follows the geometric distribution g(λ), is

Inline

We added 1 to the geometric distribution so that the price change, if it occurs, is at least 1 tick. In Eq. (5.28), we take the logistic transformation to ensure that λj, i ∈ [0, 1].

The previous specification classifies the ith trade, or transaction, into one of three categories:

1. No price change: Ai = 0 and the associated probability is (1 − pi).

2. A price increase: Ai = 1, Di = 1, and the associated probability is piδi. The size of the price increase is governed by 1 + gu, i).

3. A price drop: Ai = 1, Di = − 1, and the associated probability is pi(1 − δi). The size of the price drop is governed by 1 + gd, i).

Let Ii(j) for j = 1, 2, 3 be the indicator variables of the prior three categories. That is, Ii(j) = 1 if the jth category occurs and Ii(j) = 0 otherwise. The log-likelihood function of Eq. (5.24) becomes

Inline

and the overall log-likelihood function is

5.29 5.29

which is a function of parameters β, γ, θu, and θd.

Example 5.2

We illustrate the decomposition model by analyzing the intraday transactions of IBM stock from November 1, 1990, to January 31, 1991. There were 63 trading days and 59,838 intraday transactions in the normal trading hours. The explanatory variables used are:

1. Ai−1: the action indicator of the previous trade [i.e., the (i − 1)th trade within a trading day]

2. Di−1: the direction indicator of the previous trade

3. Si−1: the size of the previous trade

4. Vi−1: the volume of the previous trade, divided by 1000

5. Δti−1: time duration from the (i − 2)th to (i − 1)th trade

6. BAi: the bid–ask spread prevailing at the time of transaction

Because we use lag-1 explanatory variables, the actual sample size is 59,775. It turns out that Vi−1, Δti−1, and BAi are not statistically significant for the model entertained. Thus, only the first three explanatory variables are used. The model employed is

5.30 5.30

The parameter estimates, using the log-likelihood function in Eq. (5.29), are given in Table 5.6. The estimated simple model shows some dynamic dependence in the price change. In particular, the trade-by-trade price changes of IBM stock exhibit some appealing features:

1. The probability of a price change depends on the previous price change. Specifically, we have

Inline

The result indicates that a price change may occur in clusters and, as expected, most transactions are without price change. When no price change occurred at the (i − 1)th trade, then only about one out of four trades in the subsequent transaction has a price change. When there is a price change at the (i − 1)th transaction, the probability of a price change in the ith trade increases to about 0.5.

2. The direction of price change is governed by

Inline

This result says that (a) if no price change occurred at the (i − 1)th trade, then the chances for a price increase or decrease at the ith trade are about even; and (b) the probabilities of consecutive price increases or decreases are very low. The probability of a price increase at the ith trade given that a price change occurs at the ith trade and there was a price increase at the (i − 1)th trade is only 8.6%. However, the probability of a price increase is about 90% given that a price change occurs at the ith trade and there was a price decrease at the (i − 1)th trade. Consequently, this result shows the effect of bid–ask bounce and supports price reversals in high-frequency trading.

3. There is weak evidence suggesting that big price changes have a higher probability to be followed by another big price change. Consider the size of a price increase. We have

Inline

Using the probability mass function of a geometric distribution, we obtain that the probability of a price increase by one tick is 0.827 at the ith trade if the transaction results in a price increase and Si−1 = 1. The probability reduces to 0.709 if Si−1 = 2 and to 0.556 if Si−1 = 3. Consequently, the probability of a large Si is proportional to Si−1 given that there is a price increase at the ith trade.

Table 5.6 Parameter Estimates of ADS Model in Eq. (5.30) for IBM Intraday Transactions from December 1, 1990, to January 31, 1991

NumberTable

A difference between the ADS of Eq. (5.21) and ordered probit models is that the former does not require any truncation or grouping in the size of a price change.

R Demonstration for Logistic Linear Regression

The following output has been edited:

> da=read.table(“ibm91-ads.txt”,header=T)

> da1=read.table(“ibm91-adsx.txt”,header=T)

> Ai=da[,1]  % Select the variables

> Di=da[,2]

> Aim1=da1[,4]

> Dim1=da1[,5]

>

> m1=glm(Ai∼Aim1,family=binomial) %Fit a linear

   logistic model 

> summary(m1)

Call:

glm(formula = Ai ∼ Aim1, family = binomial)


Deviance Residuals:

    Min       1Q   Median       3Q      Max 

-1.1373  -0.7724  -0.7724   1.2180   1.6462 


Coefficients:

            Estimate Std. Error z value Pr(>|z|)   

(Intercept) -1.05667    0.01142  -92.55   <2e-16 ***

Aim1         0.96164    0.01827   52.62   <2e-16 ***

---

>

> di=Di[Ai==1]   % Select the cases in which Ai = 1.

> dim1=Dim1[Ai==1]

> di=(di+abs(di))/2  % Logistic regression works for 1 or 0,

       % but di is coded 1 or -1 so that change is needed.

> m2=glm(di∼dim1,family=binomial)

> summary(m2)

Call:

glm(formula = di ∼ dim1, family = binomial)


Deviance Residuals:

    Min       1Q   Median       3Q      Max 

-2.1640  -1.1493   0.4497   1.2058   2.2193 


Coefficients:

            Estimate Std. Error z value Pr(>|z|)   

(Intercept) -0.06663    0.01728  -3.855 0.000116 ***

dim1        -2.30693    0.03595 -64.171 <  2e-16 ***

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset