Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 6
Endogeneity

6.1 Introduction

There is an endogeneity problem when the error is correlated with at least one explanatory variable. This phenomenon is very common in econometrics because, compared to experimental sciences, it is not possible (or it is at least difficult) to control the data‐generating process. Among the possible causes of endogeneity, the three most important are:

simultaneity. In this case, there is an explanatory variable that is set simultaneously with the response: this is, for example, the case when one seeks to estimate a demand equation for a good, which contains the price of the good itself. The demand and the price are simultaneously set by the condition of equality of supply and demand and, therefore, a variation of the error term of the demand equation will shift the demand curve and therefore induce a variation of the quantity and the equilibrium price. The price variable is therefore endogenous.
covariate measured with error. If the “true” model is and what is observed is , the estimated model writes: , or with . Hence, is correlated with , which is therefore endogenous.
omitted variable. If the “true” model is and is unobserved, the estimated model is , with . The error of the estimated model then contains the influence of the omitted variable, and this error is correlated with if and are correlated. Once again, the covariate is then endogenous.

The OLS estimator is:

Replacing by its expression: , we obtain as a function of the errors of the model:

We then have, denoting the sample size:

The estimator is consistent () if , this expression being the vector of covariances for the population between the covariates and the error. The ordinary least squares model is therefore consistent if the covariates and the error are uncorrelated. When this condition is not met, the method of instrumental variables, which will be presented in detail in this chapter, can be used.

Concerning simultaneity, there is an additional problem as the model is not defined by one equation but by a system of equations. In this case, two strategies can be followed:

estimating only the equation of interest (limited information estimator),
estimating simultaneously all the equations (full information estimator).

The latter approach leads to a more efficient estimator, as the correlation of the errors of all the equations is taken into account. But if an equation is wrongly specified, it can contaminate the estimation of the parameters of the other equations of the model.

6.2 The Instrumental Variables Estimator

6.2.1 Generalities about the Instrumental Variables Estimator

Let us consider the following model: with . if at least one of the covariates is correlated with the errors, the OLS estimator is not consistent. In order to obtain consistency, we use the instrumental variables estimator. The instrumental variables are denoted by .¹ Denoting by the number of the covariates and by the number of instruments (not including the column of ones), the instrumental variables must verify: . Stated differently, they must not be correlated with the errors.² In the simplest case where the number of instruments equals the number of covariates, the instrumental variable estimator is simply obtained by solving the system of equations: , which is just identified. Developing this expression, we obtain: , which can also be written:

(6.1)

If there are more instruments than covariates (), is an over‐determined system of linear equations, which, except for very special cases, doesn't have a solution. In this case, two equivalent approaches can be used to obtain the optimal estimator. The first one consists in pre‐multiplying the model by :

(6.2)

It is a model that contains rows and parameters to estimate . If one considers it as a standard regression model, the variance of the errors being , the best linear estimator is the GLS estimator, and we then obtain the following instrumental variables estimator:

(6.3)

with .

The second approach is the generalized method of moments. We consider here a vector of moments: for which the variance is . Using the generalized method of moments, we seek to minimize the quadratic form of the vector of moments, using the inverse of the variance matrix of these moments:

The first‐order conditions for a minimum are: , and solving this system of linear equations, we obtain the same estimator as before.

The instrumental variables estimator is also called the two‐stage least squares estimator (2SLS), as it can be obtained by applying twice the method of ordinary least squares. When we consider the regression of on , we obtain the estimator and the fitted values . The matrix is therefore the projection matrix on the subspace defined by the columns of . This matrix is symmetric and idempotent, which means that . The instrumental variables estimator 6.3 can also be written, denoting by the fitted values of the covariates regressed on the instrumental variables:

(6.4)

and can therefore be obtained by applying OLS twice:

the first time by regressing every covariate on the instruments,
the second time by regressing the response on the fitted values of the first‐stage estimation.

The variance of the instrumental variables estimator is:

The estimator is therefore the more efficient the larger the variance of , which means that and are highly correlated.

6.2.2 The within Instrumental Variables Estimator

The specificity of panel data methods is that the error term is modeled as having two components, an individual effect and an idiosyncratic term. Therefore, the correlation between covariates and instrumental variables, on the one hand, and the errors of the model, on the other hand, must be analyzed separately for each component of the error. In this section, we consider the estimation of the model transformed in deviations from individual means. This transformation wipes out the individual effect; therefore, there is no reason to take care of the correlation between the covariates and the individual effects. The is obtained by pre‐multiplying the model first by : and then by ,

(6.5)

and applying GLS to this transformed data, the variance matrix of the errors of this model being :

or, denoting by: the projection matrix defined by the within transformation of the instruments:

(6.6)

A similar reasoning can be followed for the between model. We consider the between transformation of the model , with the same transformation applied to the instruments (). The instrumental variables estimator is obtained by pre‐multiplying the model by :

(6.7)

and applying to this transformed model the GLS estimator:

(6.8)

with .

The is consistent, even if the individual effects are correlated with the covariates. On the contrary, the is consistent only if there is no correlation. If this hypothesis is verified, none of them is efficient, as each of them take into account only one component of variability.

Example 6‐1 within 2SLS estimator – `SeatBelt` data set

Cohen and Einav (2003) study the influence of using seat belts on the number of deaths on American roads; they consider occupants of the vehicles involved in accidents (about 35,000 killed a year) and non‐occupants (e.g., pedestrians; about 5,000 killed a year). They use panel data for the 50 American states for the 1983‐1997 period. This dataset, called SeatBelt, is available in the pder package. The main covariate is the rate of seat belt usage. Two main questions are analyzed:

the first one concerns the behavior compensation theory developed by Peltzman (1975). According to this theory, using the seat belt makes the driver more confident and leads him to adopt a less prudent driving behavior. Combined with the expected negative effect from seat belts on occupants' deaths, the global effect on mortality may then be insignificant, or even positive if the mortality of non‐occupants increases with the use of seat belts,
the second deals with the problem of endogeneity: if driving conditions get worse, for example for meteorological reasons, other things being equal, road mortality will increase, but seat belt use will also increase, as the drivers are conscious that the probability of having an accident increases. There is therefore in this case a correlation between the error term of the mortality equation and the seat belt use variable. In this case, not taking this endogeneity into account will induce a downward bias on the estimation of the seat belt‐use coefficient.

Cohen and Einav (2003) use three estimators. First, the model is estimated using OLS and therefore the endogeneity is not taken into account. The second is the within estimator; in this case, the problem of the correlation between the individual effect and the covariate is taken into account as the within transformation wipes out the individual effects. On the contrary, the problem of correlation between the idiosyncratic part of the error and the covariate remains. This last problem is solved using the estimator. The instruments used are variables that indicate the laws concerning the use of seat belts. These variables (ds, dp and dsp) are correlated with the use of seat belts but not with the errors. Other variables are also used as controls (and are described in the help page of the dataset).

The instrumental variables estimator is computed using the plm function. The instruments are specified with a two‐part formula, using the Formula package (Zeileis and Croissant, 2010). The first part indicates the covariates of the model, while the second part indicates the instruments. Often, a large subset of covariates are used as instruments. In order to avoid repeating almost the same list of variables twice, it is possible to use a more efficient syntax using the . operator, constructing the second part of the formula by updating the first part. For example, if the covariates are x1, x2 and x3, only x2 is endogenous, and there is only one external instrument z, the model to be estimated can be described by either of the two equivalent following formulas:

 y ˜ x1 + x2 + x3 | x1 + x3 + z
y ˜ x1 + x2 + x3 | . - x2 + z

The three models estimated by Cohen and Einav (2003), which are reproduced below, include time fixed effects. The response (occfat) is the number of vehicle occupants killed on the road.

 data("SeatBelt", package = "pder")
SeatBelt$occfat <- with(SeatBelt, log(farsocc / (vmtrural + vmturban)))
ols <- plm(occfat ˜ log(usage) + log(percapin) + log(unemp) + log(meanage) +
           log(precentb) + log(precenth)+ log(densrur) +
           log(densurb) + log(viopcap) + log(proppcap) +
           log(vmtrural) + log(vmturban) + log(fueltax) +
           lim65 + lim70p + mlda21 + bac08, SeatBelt,
           effect = "time")
fe <- update(ols, effect = "twoways")
ivfe <- update(fe, . ˜ . |  . - log(usage) + ds + dp +dsp)

rbind(ols = coef(summary(ols))[1,],
      fe = coef(summary(fe))[1, ],
      w2sls = coef(summary(ivfe))[1, ])
      Estimate Std. Error t-value  Pr(>|t|)
ols     0.1140    0.02547   4.478 9.252e-06
fe     -0.0535    0.02252  -2.376 1.790e-02
w2sls  -0.1334    0.04482  -2.975 3.079e-03

The results confirm that the endogeneity problem is very important. For the first fitted model, the seat belt‐use coefficient is significantly positive. It becomes significantly negative for the fixed effects model, which means that usage is strongly correlated with the individual effects. Finally, this coefficient increases importantly (in absolute value) if instrumental variables are used, which indicates that the idiosyncratic error is also correlated with usage.

In order to test the behavior compensation theory, the authors estimate the same models, this time using the number of non‐occupants killed (noccfat) as response.

 SeatBelt$noccfat <- with(SeatBelt, log(farsnocc / (vmtrural + vmturban)))
nivfe <- update(ivfe, noccfat ˜ . | .)
coef(summary(nivfe))[1, ]
  Estimate Std. Error    t-value   Pr(>|t|)
  -0.04237    0.10312   -0.41091    0.68133

The results indicate that seat belt use has no influence on out‐of ‐vehicle mortality, in contradiction with Peltzman (1975)'s theory of behavior compensation.

6.3 Error Components Instrumental Variables Estimator

In the previous section, the potential correlation between some covariates and the individual effects has been treated drastically by using the within transformation, which wipes out the individual effects. In this section, we present the error component instrumental variables estimator. The two components of the error being present in this model, it is in this case essential to tackle the issue of a potential correlation of some covariates with the two components of the error.

6.3.1 The General Model

Suppose in a first step that the idiosyncratic component of the error is not correlated with the covariates. In this case, if all the covariates are uncorrelated with the individual effects, the unbiased efficient estimator is the GLS estimator. This estimator enables, on the one hand, to take into account part of the inter‐individual variation in the sample and, on the other hand, to estimate parameters associated with covariates that don't exhibit temporal variations.

If, on the contrary, all the covariates are correlated with the individual effects, Mundlak (1978) (see subsection 4.2) has shown that the efficient estimator, which is the GLS estimator, is the same as the within estimator if the correlation between the individual effects and the covariates (more precisely the individual means of the covariates) is taken into account.

When only some covariates are correlated with the individual effects, none of the two previous estimators is appropriate any more:

the GLS estimator is not consistent anymore because of the correlation of some covariates with the individual effects,
the within estimator is still consistent but not efficient any more, as it doesn't take into account the fact that some covariates are uncorrelated with the individual effects but wipes out all the inter‐individual variation in the sample, especially the covariates that don't exhibit any temporal variation.

The best solution in this case consists then in using an estimator that, on the one hand, uses instrumental variables and, on the other hand, exploits the two sources of variability of the panel in an optimal way. The essential question is then to find good instruments, which is often a difficult task. The richness of panel data allows to overcome this problem. Actually, every covariate can generate two instrumental variables, using the between and the within transformations. If a rank condition that will be detailed later on is checked, the model can then be estimated without any external instrument. This approach has been used by Hausman and Taylor (1981), Amemiya and MaCurdy (1986), and Breusch et al. (1989).

If, from now, we suspect that some covariates are also correlated with the idiosyncratic part of the error, then none of the estimators we have listed above is consistent. We then use an instrumental variables estimator (within or GLS) using external instruments. This strategy has been developed by Baltagi (1981) with his “error component two‐stage least squares” estimator and by Balestra and Varadharajan‐Krishnakumar (1987) with their “generalized two‐ stage least squares” estimator, which differ by the way the instruments are introduced in the model.

This two branches of the literature have been developed separately, and this dichotomy exists also in most software packages, which usually provide two different functions to estimate these models. We'll follow the approach of Cornwell et al. (1992), who provide a unified view of panel models with instrumental variables. These authors consider three kinds of variables:

the endogenous variables, which are correlated with the two components of the error,
the simply exogenous variables, which are correlated with the individual effects but not with the idiosyncratic part of the error,
the doubly exogenous variables, which are uncorrelated with both components of the error.

Variables from the first category don't provide any usable instrument. For the second one, the within transformation is a valid instrument, as it is by construction orthogonal to the individual effects and by hypothesis uncorrelated with the idiosyncratic part. Finally, each covariate of the third category provides two instruments by using the within and the between transformation.

Consider now the specific case of time‐invariant covariates. For these variables, and . Therefore, such a variable provides either one instrument, if it is uncorrelated with the individual effects (the covariate itself), or no instrument.

We start with the model to be estimated written in matrix form:

With the usual hypotheses concerning the error component model, the variance matrix of the error is: . We first pre‐multiply the model by: and then obtain a transformed model for which the errors are iid.

We then apply to this model the instrumental variables method, using a set of instruments, which, denoting by the doubly exogenous variables, by the simply exogenous variables, and by the whole set of instruments, can be written:

where is a set of variables that will be defined later. For now, just consider that these variables must provide valid instruments when the between transformation is applied.

The instrumental variables estimator is, denoting by the projection matrix defined by the instruments:

The two matrices and being orthogonal, the projection matrix may also be written as the sum of two projection matrices defined by the instruments transformed by the within and the between matrices:

The estimator is then:

or also, denoting :

(6.9)

One can check that, as in the simple error component model, this estimator is a weighted average of the within and the between estimators: , with:

Several models proposed in the literature are special cases of this general model.

6.3.2 Special Cases of the General Model

6.3.2.1 The within Model

Firstly, if there are no external instruments and if all the covariates are simply exogenous, we have and , and the within estimator results.

Then, if all the covariates are either simply exogenous or endogenous and if the external instruments are simply exogenous, we also have , and is constituted only by simply exogenous covariates and external instruments. The condition for identification is then that the number of external instruments must be at least equal to the number of endogenous covariates. We then have the within instrumental variables estimator:

6.3.2.2 Error Components Two Stage Least Squares

Baltagi (1981)'s estimator is the special case where , which means that all the instruments (and potentially some of the covariates) are assumed to be doubly exogenous and are therefore used twice. We start from equations 6.5 and 6.7, which leads respectively to the within and between estimators. Stacking these two equations, we obtain:

which is justified by the fact that the vector of parameters to be estimated is the same in the two equations. In order to apply GLS, we compute the variance of the errors of the stacked model:

We then apply the formula of the GLS estimator:

and we finally obtain:

(6.10)

which is the special case of the general model defined by equation 6.9 for which .

6.3.2.3 The Hausman and Taylor Model

In the Hausman and Taylor (1981) model, there are no endogenous variables, only simply or doubly exogenous variables. We then have , and . Moreover, the authors stress the presence of variables with () or without () time variation. The set of instruments they use is:

Only covariates that exhibit time variation may be used with their within transformation ) and doubly exogenous time‐invariant variables are used without transformation as instruments (). Without external instruments, denoting by the number of covariates of the 4 categories, the number of instruments is as the number of covariates is: . The model is then identified if , i.e., if the number of doubly exogenous time‐varying variables (which provide two instruments) is greater than the number of time‐invariant simply exogenous variables, which provide no instrument.

6.3.2.4 The Amemiya‐Macurdy Estimator

Hausman and Taylor (1981)'s estimator is consistent if the individual means of the doubly exogenous variables are uncorrelated with the individual effects. Amemiya and MaCurdy (1986) use the stronger hypothesis that the doubly exogenous variables are uncorrelated with the individual effects for each period. We then have: for every doubly exogenous covariate. The corresponding instrument matrix is constructed the following way. Let be the matrix of doubly exogenous instruments of dimension for individual . is a vector of length obtained by stacking the columns of . The instrument matrix for individual is then , and for the whole sample, we obtain a matrix of dimension :

(6.11)

6.3.2.5 The Breusch, Mizon and Schmidt's Estimator

Breusch et al. (1989) expand the instruments used by Amemiya and MaCurdy (1986) by assuming that the within transformations of simply exogenous covariates are valid instruments at every period. Stated differently: . We then obtain the further matrix of instruments by applying to the same transformation than the one used in equation 6.11. The other contribution of Breusch et al. (1989) is to show how the different estimators can be presented in a consistent and nested way. They use the fact that the projection subspace defined by is the same as the one defined by :

Hausman and Taylor (1981): ,
Amemiya and MaCurdy (1986): ,
Breusch et al. (1989): ,

As each estimator adds instruments to the previous one, if these instruments are valid, it is necessarily more efficient. Moreover, the validity of extra instruments may be tested by comparing the two models with a Hausman test.

6.3.2.6 Balestra and Varadharajan‐Krishnakumar Estimator

This last estimator, proposed by Balestra and Varadharajan‐Krishnakumar (1987), is not, contrary to the others, a special case of the general model previously presented. For this model, called the estimator (for “generalized two‐stage least squares”), the same transformation is applied to the instruments that is applied also to the covariates and to the response. Therefore, the matrix of instruments is:

Baltagi and Li (1992) have shown that the instruments used by Baltagi (1981), , perform the same projection as and . The instruments used by Balestra and Varadharajan‐Krishnakumar (1987) are therefore a subset of those used by Baltagi (1981), the supplementary instruments used by Baltagi (1981) being either or . Therefore, the estimator of Baltagi (1981) is necessarily not less efficient than the one of Balestra and Varadharajan‐Krishnakumar (1987). Baltagi and Li (1992) show, using White (1986), that the supplementary instruments used by Baltagi (1981) are redundant, which means that they don't add any gain in terms of asymptotic efficiency. Consequently, both estimators have the same asymptotic variance.

However, the estimator of Balestra and Varadharajan‐Krishnakumar (1987) has an important drawback. A part of the between component of every instrumental variable is included in the instruments, and consequently, the estimator of Balestra and Varadharajan‐Krishnakumar (1987) is unable to take into account simply exogenous instruments.

With plm, the way instruments are introduced is indicated by the inst.method argument: 'baltagi' indicates that instruments are introduced with the within and the between transformations, 'amc' uses the set of instruments used by Amemiya and MaCurdy (1986), 'bmsc' the one used by Breusch et al. (1989), and 'bvk' indicates that the instrumental variables are transformed the same way as the covariates and the response, as proposed by Balestra and Varadharajan‐Krishnakumar (1987).

Example 6‐2 EC2SLS estimator – `ForeignTrade` data set

Kinal and Lahiri (1993) studied the determinants of international trade for developing countries and especially the measure of the price and income elasticities. This question is very important because it crucially determines the growth and debt of these countries. The panel dataset used concerns 31 developing countries, for the period 1964‐1986. It is available as ForeignTrade in the pder package.

More precisely, Kinal and Lahiri (1993) estimate three equations: the first one defines the demand for imports, the second one the demand for exports, and the last one the exports supply. The authors suppose that:

the demand for imports imports increases with the domestic income gnp, decreases with the price of imports in local currency divided by domestic prices pmpci, and rises with the one‐period lag of the ratio of reserves to imports resimp.
exports demand exports rises with the world income gnpw and decreases with the relative price of exports with respect to their foreign substitutes pxpw,
exports supply exports increases with the world price in domestic currency divided by the domestic consumer price index pwpci, with the potential domestic product pgnp (used as a proxy for the capital stock) and also depends positively on a variable that represents the influence of the imports in the supply of exports importspmpx (measured by imports in local currency divided by export price).³

All the variables are per capita and in logs, in order to avoid heteroscedasticity problems.

In order to take the dynamics of adjustment into account, a one‐period lag of the response is introduced as a covariate in every equation.

gnp, exports, imports, and their lags (and therefore resimp and importspmpx) are assumed to be endogenous, as are the exports price (which induces that pxpw is endogenous) and the domestic consumer price index is endogenous (which induces that pmcpi and pwcpi are also endogenous). Among the covariates, only gnpw and pgnp are assumed to be exogenous and can therefore be used as instruments. Numerous external instruments are also introduced: a linear trend trend, the population pop, the exchange rate exrate, the consumption consump, the disposable income income, the reserves reserves, money supply money, the consumer price index cpi, import prices pm, export prices px, and world prices pw, most of the time with a one‐period lag.

Kinal and Lahiri (1993) is an extension of the article of Khan and Knight (1988), who estimated a system of equations explaining the determinants of international trade for developing countries using the within transformation. They looked for a more efficient estimator, and for this purpose they employed the estimator. However, the latter is consistent only if the instruments are uncorrelated with the individual effects. Their strategy is to use the same specification for the within and the estimators and to test the hypothesis of exogeneity of the instruments through a Hausman test.

We present below the results obtained for the imports demand equation. The within and the models are estimated. Kinal and Lahiri (1993) use a nonstandard method to estimate the variance of the error components. It is similar to Nerlove (1971), but with a degrees of freedom correction. It is reproduced here by using the random.dfcor argument.

 data("ForeignTrade", package = "pder")
w1 <- plm(imports˜pmcpi + gnp + lag(imports) + lag(resimp)  |
          lag(consump) + lag(cpi) + lag(income) + lag(gnp) + pm +
          lag(invest) + lag(money) + gnpw + pw + lag(reserves) +
          lag(exports) + trend + pgnp + lag(px),
          ForeignTrade, model = "within")
r1 <- update(w1, model = "random", random.method = "nerlove",
             random.dfcor = c(1, 1), inst.method = "baltagi")

The hypothesis of no correlation between the instruments and the individual effects implies that the within and the GLS models are consistent, the latter being more efficient. On the contrary, if this hypothesis is rejected, only the within model is consistent. In order to test this hypothesis the authors used the Hausman (1978) test:

 phtest(r1, w1)

Hausman Test

data:  imports ˜ pmcpi + gnp + lag(imports) + lag(resimp) | lag(consump) +  ...
chisq = 11, df = 4, p-value = 0.03
alternative hypothesis: one model is inconsistent

The hypothesis of no correlation between the instruments and the individual effects is rejected at the 5% threshold.⁴ One solution would be to maintain the within estimator, but Kinal and Lahiri (1993), following Cornwell et al. (1992), considered two kinds of instruments:

those that are not correlated with the individual effects and that therefore can be used twice using the within and the between transformations,
those that are correlated with the individual effects and that can therefore only be used in their within transformation.

Such a model is defined using a three‐part formula:

the second part indicates the doubly exogenous instruments,
the third part indicates the simply exogenous instruments.

Kinal and Lahiri (1993) finally got the following specification:

 r1b <- plm(imports ˜ pmcpi + gnp + lag(imports) + lag(resimp) |
            lag(consump) + lag(cpi) + lag(income) + lag(px) +
            lag(reserves) + lag(exports) | lag(gnp) + pm +
            lag(invest) + lag(money) + gnpw + pw  + trend + pgnp,
            ForeignTrade, model = "random", inst.method = "baltagi",
            random.method = "nerlove", random.dfcor = c(1, 1))

phtest(w1, r1b)

Hausman Test

data:  imports ˜ pmcpi + gnp + lag(imports) + lag(resimp) | lag(consump) +  ...
chisq = 7.1, df = 4, p-value = 0.1
alternative hypothesis: one model is inconsistent

Based on the Hausman (1978) test, the hypothesis of consistency of the GLS estimator is no longer rejected. Results are presented below; the within and GLS estimators give very similar results.

 rbind(within = coef(w1), ec2sls = coef(r1b)[-1])
          pmcpi     gnp lag(imports) lag(resimp)
within -0.05873 0.02890       0.9512     0.05215
ec2sls -0.05420 0.01361       0.9482     0.04195

The short‐term elasticity of imports demand is directly given by the price coefficient. The long‐term elasticity is obtained by dividing this coefficient by one minus the coefficient of the lagged response. We then have:

 elast <- sapply(list(w1, r1, r1b),
                function(x) c(coef(x)["pmcpi"],
                              coef(x)["pmcpi"] / (1 - coef(x)["lag(imports)"])))
dimnames(elast) <- list(c("ST", "LT"), c("w1", "r1", "r1b"))
elast
         w1      r1     r1b
ST -0.05873 -0.0552 -0.0542
LT -1.20393 -1.1953 -1.0465

The use of this GLS estimator, which efficiently exploits part of the inter‐individual variation, has dramatically reduced the standard deviations of the coefficients.

 rbind(within = coef(summary(w1))[, 2],
      ec2sls = coef(summary(r1b))[-1, 2])
         pmcpi      gnp lag(imports) lag(resimp)
within 0.02915 0.041235      0.03067    0.008257
ec2sls 0.02180 0.006999      0.01289    0.006709

Example 6‐3 Hausman‐Taylor estimator – `TradeEU` data set

The analysis of international trade is often based on the gravity model, inspired by the law of universal gravitation in physics, which indicates that a particle attracts every other particle in the universe using a force that is directly proportional to the product of their masses and inversely proportional to the square of the distance between their centers. By similarity, in international trade the volume of exchange between two countries (imports and exports) is linked to the “masses” of both countries (which can be measured by the population or by their national product) and by the distance between them. Many econometric analyses of the gravity model have drawn on cross sections of countries. The problem of these studies is that they are unable to take into account unobservable heterogeneity at the country level, which leads to biased estimators. In this respect, the use of panel data seems very useful, but the fact that some covariates are correlated with individual effects often leads to employing the within estimator. The problem in this case is that that the time‐invariant covariates disappear: yet some of these can be of major interest, especially the distance between two countries. The estimator of Hausman and Taylor (1981), which enables, on the one hand, to tackle the problem of correlation between some covariates and the individual effects and on the other hand to estimate the coefficients associated to time‐invariant covariates, is very useful in this respect.

Serlenga and Shin (2007) estimate a gravity model for 14 countries of the European Union⁵ observed over 42 years (1960‐2001). In this panel, the individual unit of observation is not a country but a pair of countries for which the volume of trade is given by the sum of bilateral exports and imports. There are, therefore, “individuals”.

The response trade is the logarithm of the sum of bilateral imports and exports. The covariates are: gdp, the sum of the logarithms of the two national products; dist, the distance between the capitals of the two countries; sim, a measure of the similarity between the pair of countries; rlf, the relative factor endowment; and rer, the logarithm of the real exchange rate. To this quantitative variables, several qualitative variables are added: mutual adhesion to the European Community, cee and to the Euro Zone emu; common border; bor; and common language, lan.

The dataset, called TradeEU, is available in the pder package.

 data("TradeEU", package = "pder")

Following the authors, we first estimate the OLS and the within model:

 ols <- plm(trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan, TradeEU,
          model = "pooling", index = c("pair", "year"))
fe <- update(ols, model = "within")
fe

Model Formula: trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan

Coefficients:
   gdp    rer    rlf    sim    cee    emu
1.8125 0.0610 0.0325 1.1723 0.3093 0.0852

As expected, coefficients associated to dist, bor, and lan are not estimated in the within model, as these covariates disappear with the within transformation. On the contrary, the random effects estimator produces estimates for their coefficients.

 re <- update(fe, model = "random")
re

Model Formula: trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan

Coefficients:
(Intercept)         gdp        dist         rer         rlf
   -13.9303      1.7949     -0.5909      0.0690      0.0334
        sim         cee         emu         bor         lan
     1.1427      0.3182      0.0927      0.4414      0.4172

The results of the random effects model indicate a distance elasticity of bilateral trade of about and that having a common border or a common language have a similar effect (an increase of about 40%).

 phtest(re, fe)

Hausman Test

data:  trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan
chisq = 13, df = 6, p-value = 0.04
alternative hypothesis: one model is inconsistent

With the Hausman test, we reject the hypothesis of no correlation at the 5% threshold.

Serlenga and Shin (2007) consider that, among the time‐invariant variables, only lan is correlated with the individual effects. Two Hausman and Taylor (1981) models are then estimated. In the first one, the only doubly exogenous variable is the real exchange rate rer. In this case, the instrumental variables estimator is just identified, as there is only one instrument (the between transformation of rer) and only one endogenous variable lan. In the second one, domestic product gdp and relative factor endowment rlf are also used as instruments.

 ht1 <- plm(trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan |
           rer + dist + bor | gdp + rlf + sim + cee + emu + lan ,
           data = TradeEU, model = "random", index = c("pair", "year"),
           inst.method = "baltagi", random.method = "ht")
ht2 <- update(ht1, trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan |
              rer + gdp + rlf + dist + bor| sim + cee + emu + lan)

Note than random.method is set to 'ht' so that the within residuals used to compute the variance of the components of the error are purged of the influence of the time‐invariant covariates.⁶ The consistency of either specification is not rejected by the Hausman test.

 phtest(ht1, fe)

Hausman Test

data:  trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan |  ...
chisq = 5e-25, df = 6, p-value = 1
alternative hypothesis: one model is inconsistent
phtest(ht2, fe)

Hausman Test

data:  trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan |  ...
chisq = 2.2, df = 6, p-value = 0.9
alternative hypothesis: one model is inconsistent

The last estimated model is suggested by Baltagi (2012). It is similar to the second specification but uses the instruments suggested by Amemiya and MaCurdy (1986) instead. The results are presented in table 6.1 by using the texreg package (see Leifeld, 2013).

 ht2am <- update(ht2, inst.method = "am")

 library("texreg")
texreg(list(ols, fe, re, ht1, ht2, ht2am),
       custom.model.names = c("OLS", "FE", "RE", "HT1", "HT2", "AM2"),
       caption = "Estimations of the gravity model.", label = "table:gravity",
       custom.gof.names  = c("R$^2$", "Adj. R$^2$", "Num. obs.", "s\_idios",
                             "s\_id"),
       scriptsize = FALSE)

images — Table 6.1 Estimations of the gravity model.

The results of table 6.1 show first that the coefficients of the time‐varying covariates are identical for the within and the just identified Hausman and Taylor (1981) estimator. This is not the case with the ht2 model, which is overidentified, as noted by Baltagi (2012). Serlenga and Shin (2007) insist on the fact that the Hausman and Taylor (1981) estimations lead to a great reduction of the influence of the distance and an important increase of the influence of common language and common border. This last conclusion is qualified by Baltagi (2012), which uses the more efficient Amemiya and MaCurdy (1986) estimator. The latter introduces further orthogonality conditions by imposing that doubly exogenous variables be uncorrelated with individual effects at any time, while the Hausman and Taylor (1981) estimator simply requires no correlation between individual effects and the averages of said variables. If these conditions are valid (which can be tested through the Hausman procedure), this estimator is necessarily not less efficient than that of Hausman and Taylor (1981).

 phtest(ht2am, fe)

Hausman Test

data:  trade ˜ gdp + dist + rer + rlf + sim + cee + emu + bor + lan |  ...
chisq = 10, df = 6, p-value = 0.1
alternative hypothesis: one model is inconsistent

The validity of the supplementary instruments used for the Amemiya and MaCurdy (1986) estimator is not rejected by the Hausman test. The standard deviation of the endogenous variable (lan) is much lower than in the Hausman and Taylor (1981) estimator (0.24 vs 0.68). The coefficients of the three time‐invariant covariates are closer to the OLS coefficients than to the Hausman and Taylor (1981) coefficients.

6.4 Estimation of a System of Equations

Instead of estimating only one equation, we can consider a whole system of simultaneous equations, in order to take into account the correlation between the errors of different equations. The estimator obtained is a mix of the 2SLS estimator described in the previous chapter and the SUR estimator (see 3.2.4).

6.4.1 The Three Stage Least Squares Estimator

When there is no correlation between the covariates and the error, the relevant model for the system of equations is the SUR model, which is a GLS estimator and is described in section 3.2. Denoting by the matrix of covariance of the errors of the equations, the variance of the errors of the system is , and the SUR estimator is:

This expression involves square matrices of dimensions equal to the sample size. It is therefore not operational for large samples, and it is numerically inefficient anyway. It is therefore preferred, as often happens for GLS estimators, to apply OLS on transformed data. Denoting by the elements of the matrix , each variable of the model is transformed by pre‐multiplying it by: . We then have:

The three‐stage least squares estimator is obtained by using the moment conditions: , for which the variance is: . Consistently with the method of moments approach, the estimator is obtained by minimizing a quadratic form of the vector of moments, using the inverse of the variance matrix of these moments:

First order conditions for a minimum are:

Solving this linear system of equations, we obtain the 3SLS estimator:

(6.12)

The 3SLS estimator may be obtained by employing the instrumental variables estimator, pre‐multiplying the covariates and the response by and the instruments by . The instruments are then and define the following projection matrix:

But:

We then have

Using this projection matrix in the formula of the instrumental variables estimator 6.3 we finally get:

(6.13)

which is the formula 6.12 of the 3SLS estimator. Of course, as in the GLS estimator, is in practice unknown and shall be estimated based on the results from a consistent preliminary estimation.

The practical computation of the 3SLS estimator consists then of the following steps:

each equation is first estimated independently using the instrumental variables estimator, which leads to a matrix of residuals which is a consistent estimate of the errors of the equations,
the covariance matrix of the errors of the system is then estimated:
the Cholesky decomposition of this matrix is computed: ,
the variables are transformed using this matrix: , and .
and finally the instrumental variables estimator is applied to the transformed system.

The computation of the within or between 3SLS estimators is straightforward, as it consists in applying the 3SLS to within or between transformed data.

6.4.2 The Error Components Three Stage Least Squares Estimator

Balestra and Varadharajan‐Krishnakumar (1987) and Baltagi (1981) have proposed 3SLS estimators that use the inter‐ and intra‐individual variations of the data in an optimal way.

From now, three indexes must be considered, the individual and time indexes as usual, but also the equation index .

Denoting by , the error vector for individual and equation , the error vector for the system of equations is:

The covariance matrix of the errors is then:

The presence of individual effects makes this model specific compared to the standard 3SLS estimator. Compared to the standard error component model, scalars and are replaced by two covariance matrices and .

The 3SLS estimator can then be computed the following way:

firstly, the different equations are estimated using 2SLS so that a consistent estimator of the matrix of the errors of the different equations may be computed;
then, and are estimated by and ,
covariates and responses are transformed by pre‐multiplying them by: ,
instrumental variables are transformed by pre‐multiplying them by: ,
the 2SLS estimator is then applied to the transformed data.

As for the 2SLS estimator, the difference between the estimators of Baltagi (1981) and Balestra and Varadharajan‐Krishnakumar (1987) is that the former uses the within and the between transformations of the instruments, while the latter uses a quasi‐difference transformation.

Example 6‐4 error components 3SLS – `ForeignTrade` data set

Kinal and Lahiri (1993) estimate the system composed of the demand for imports and the demand for exports by 3SLS. To compute this estimator with plm, one has to use as first argument a list containing the description of the equations in the system.

 eqimp <- imports ˜ pmcpi + gnp + lag(imports) +
                lag(resimp) | lag(consump) + lag(cpi) + lag(income) +
                lag(px) + lag(reserves) + lag(exports) | lag(gnp) + pm +
                lag(invest) + lag(money) + gnpw + pw  + trend + pgnp
eqexp <- exports ˜ pxpw + gnpw + lag(exports) |
                lag(gnp) + pw + lag(consump) + pm + lag(px) + lag(cpi) |
                lag(money) + gnpw +  pgnp + pop + lag(invest) +
                lag(income) + lag(reserves) + exrate
r12 <- plm(list(import.demand = eqimp,
                export.demand = eqexp),
           data = ForeignTrade, index = 31, model = "random",
           inst.method = "baltagi", random.method = "nerlove",
           random.dfcor = c(1, 1))
summary(r12)
Oneway (individual) effect Random Effect Model
   (Nerlove's transformation)
Call:
plm.list(formula = list(import.demand = eqimp, export.demand = eqexp),
    data = ForeignTrade, model = "random", random.method = "nerlove",
    inst.method = "baltagi", index = 31, ... = pairlist(random.dfcor = c(1,
        1)))

Balanced Panel: n = 31, T = 24, N = 744

Effects:

  Estimated standard deviations of the error
      import.demand export.demand
id           0.0619        0.0782
idios        0.1439        0.1200

  Estimated correlation matrix of the individual effects
              import.demand export.demand
import.demand         1.000             .
export.demand         0.138             1

  Estimated correlation matrix of the idiosyncratic effects
              import.demand export.demand
import.demand        1.0000             .
export.demand        0.0975             1

 - import.demand
             Estimate Std. Error t-value Pr(>|t|)
(Intercept)   0.39874    0.11899    3.35  0.00083 ***
pmcpi        -0.05407    0.02170   -2.49  0.01282 *
gnp           0.01103    0.00531    2.08  0.03785 *
lag(imports)  0.95046    0.01187   80.05  < 2e-16 ***
lag(resimp)   0.03948    0.00634    6.22  6.3e-10 ***
‐‐‐
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

 - export.demand
             Estimate Std. Error t-value Pr(>|t|)
(Intercept)    0.1437     0.1395    1.03   0.3032
pxpw          -0.0615     0.0195   -3.16   0.0016 **
gnpw           0.1144     0.0534    2.14   0.0322 *
lag(exports)   0.9465     0.0133   71.11   <2e-16 ***
‐‐‐
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The coefficients for the imports demand equation are very close to those we obtained using the 2SLS estimator. The correlation between the two components of the errors of the two equations is about 10%. Taking into account this correlation slightly reduces the standard errors of the coefficients, as illustrated below.

 rbind(ec2sls = coef(summary(r1b))[-1, 2],
      ec3sls = coef(summary(r12), "import.demand")[-1, 2])
        pmcpi      gnp lag(imports) lag(resimp) (Intercept)
ec2sls 0.0218 0.006999      0.01289    0.006709      0.0218
ec3sls 0.0217 0.005308      0.01187    0.006342      0.1395
           pxpw    gnpw lag(exports)
ec2sls 0.006999 0.01289     0.006709
ec3sls 0.019467 0.05336     0.013310

6.5 More Empirical Examples

Acconcia et al. (2014) seek to estimate the multiplier effect of public spending. This is a difficult task, as public spending can hardly be considered exogenous. They use a panel of 95 Italian administrative regions (provinces) for the years 1990‐1999 and take advantage of the implementation of anti‐mafia laws, which resulted in the eviction of some elected officials who were replaced by external commissioners. This replacement, which led to a drastic reduction in local public spending, represents an exogenous source of variation in public spending that can be usefully employed as instrument. Using a fixed effects 2SLS estimator, they estimate the long‐term public spending multiplier to be 1.95, a much larger value than the one obtained using the within estimator. The Mafia dataset is available in the pder package.

Egger and Pfaffermayr (2004) studied the determinants of bilateral trade of two countries, Germany and the United States, with their partners, bilateral trade being measured by imports and exports on the one hand, and by foreign direct investment on the other. The authors suspect that the individual effect, which indicates a propensity to trade with a given country for geographical and cultural reasons, is correlated with the distance. In this case, this variable, which is the only time‐invariant one, is certainly correlated with the individual effect. The authors use the estimator of Hausman and Taylor (1981) for each equation and also for the system of two equations. The data are provided as TradeFDI in the pder package.

Hutchison and Noy (2005) study the effects of twin crises, characterized by the simultaneous occurrence of a bank and a currency crisis, on the wealth of countries. The panel consists of 24 developing countries for the 1975‐1997 period. The response is the growth rate of the GDP and the two main covariates are the lag of the growth rate and a dummy variable indicating the occurrence of a twin crisis. Employing the lag of the growth rate as a covariate induces an endogeneity problem, which the authors tackle using an error component 2SLS estimator. The results indicate that the cost of a currency crisis is about 5‐8% in terms of growth every year for about 2‐4 years, while for the bank crisis this is about 8‐10%. The article doesn't provide any evidence of a specific effect of twin crises. The data are provided as TwinCrises in the pder package.

Cornwell and Trumbull (1994) and Baltagi (2006) estimate a crime economics model for the counties of North Carolina. The response is the criminality rate and, among the covariates, they introduce the probability of being arrested and the number of policemen per inhabitant. These two covariates induce an endogeneity problem: one actually wants to estimate the causal effect of police on crime, but a reverse causality effect is also likely, because more crime will induce the presence of more policemen. Two instrumental variables are used: the offense mix, which is defined as the ratio of crimes involving face‐to‐face contact to those that do not, and the per capita tax revenue. The first instrument is positively correlated with the probability of being arrested (because the offender may be identified by the victim). The second variable is positively correlated with the number of policemen, more tax income indicating a strong preference for public services and particularly for security. The 2SLS error component model indicates a much stronger effect of the probability of being arrested than for the other estimators, especially the within estimator. The data are provided as Crime in the plm package.

Baltagi and Khanti‐Akom (1990) and Cornwell and Rupert (1988) estimate a wage function using a panel of American individuals, with particular interest in the return to education. A well‐known problem of such studies is that unobserved characteristics of individuals, called abilities, are part of the individual effects and may be correlated with education. Using the within model, the education covariate disappears: the use of the estimator of Hausman and Taylor (1981) is therefore very relevant in this context. Two time‐invariant covariates (being black and being a female) are assumed exogenous, while the level of education is endogenous. Some other time‐varying covariates are assumed exogenous and therefore provide two instruments so that the model is identified. The coefficient of education from the Hausman and Taylor (1981) estimator is larger than the one obtained using GLS (0.14 vs 0.10). The data are provided as Wages in the plm package.

Notes

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

	OLS	FE	RE	HT1	HT2	AM2
(Intercept)

gdp

dist

rer

rlf

sim

cee

emu

bor						0.44

lan						0.43

R	0.90	0.90	0.90	0.90	0.90	0.90
Adj. R	0.90	0.90	0.90	0.90	0.90	0.90
Num. obs.	3822	3822	3822	3822	3822	3822
s_idios			0.29	0.29	0.29	0.29
s_id			0.52	0.65	0.67	0.67