The correction to our paper has now been published at Journal of Business and Economic Statistics. It is available at https://doi.org/10.1080/07350015.2023.2174123.
In my 2018 paper with Beth Tipton, published in the Journal of Business and Economic Statistics, we considered how to do cluster-robust variance estimation in fixed effects models estimated by weighted (or unweighted) least squares. As explained in my previous post, we were recently alerted that Theorem 2 in the paper is incorrect as stated. It turns out, the conditions in the original version of the theorem are too general. A more limited version of the Theorem does actually hold, but only for models estimated using ordinary (unweighted) least squares, under a working model that assumes independent, homoskedastic errors. In this post, I’ll give the revised theorem, following the notation and setup of the previous post (so better read that first, or what follows won’t make much sense!).
Theorem 2, revised
Consider the model where is an vector of responses for cluster , is an matrix of focal predictors, is an matrix of additional covariates that vary across multiple clusters, and is an matrix encoding cluster-specific fixed effects, all for . Let be the set of predictors that vary across clusters and be the full set of predictors. Let be an absorbed version of the focal predictors and the covariates. The cluster-robust variance estimator for the coefficients of is where are the CR2 adjustment matrices.
If we assume a working model in which for and estimate the model by ordinary least squares, then the CR2 adjustment matrices have a fairly simple form: where is the symmetric square root of the Moore-Penrose inverse of . However, this form is computationally expensive because it involves the full set of predictors, , including the cluster-specific fixed effects . If the model is estimated after absorbing the cluster-specific fixed effects, then it would be convenient to use the adjustment matrices based on the absorbed predictors only, The original version of Theorem 2 asserted that , which is not actually the case. However, for ordinary least squares with the independent, homoskedastic working model, we can show that . Thus, it doesn’t matter whether we use or to calculate the cluster-robust variance estimator. We’ll get the same result either way, but is bit easier to compute.
Here’s a formal statement of Theorem 2:
Let and assume that have full rank . If and for , then , where and are as defined in Equation 3 and Equation 4, respectively.
Proof
We can prove this revised Theorem 2 by showing how can be constructed in terms of and . First, because for any , it follows that is idempotent, i.e.,
Next, denote the thin QR decomposition of as , where is semi-orthogonal and has the same rank as . Next, let and observe that this can be written as It can then be seen that It follows that because . Further, as well.
Now, let and observe that this can be written as because .
We then construct the full adjustment matrix as Showing that will suffice to verify that is the symmetric square root of the Moore-Penrose inverse of . Because is idempotent, , and , we have
From the representation of in Equation 5, it is clear that .
Back to top