Testing for the Appropriate Level of Clustering in Linear Regression Models

James G. MacKinnon, Morten Ørregaard Nielsen, Matthew D. Webb

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

4 Citations (Scopus)

Abstract

The overwhelming majority of empirical research that uses cluster-robust inference assumes that the clustering structure is known, even though there are often several possible ways in which a dataset could be clustered. We propose two tests for the correct level of clustering in regression models. One test focuses on inference about a single coefficient, and the other on inference about two or more coefficients. We provide both asymptotic and wild bootstrap implementations. The proposed tests work for a null hypothesis of either no clustering or “fine” clustering against alternatives of “coarser” clustering. We also propose a sequential testing procedure to determine the appropriate level of clustering. Simulations suggest that the bootstrap tests perform very well under the null hypothesis and can have excellent power. An empirical example suggests that using the tests leads to sensible inferences.

Original languageEnglish
JournalJournal of Econometrics
Volume235
Issue2
Pages (from-to)2027-2056
Number of pages30
ISSN0304-4076
DOIs
Publication statusPublished - Aug 2023

Keywords

  • CRVE
  • Cluster-robust variance estimator
  • Clustered data
  • Grouped data
  • Robust inference
  • Wild bootstrap
  • Wild cluster bootstrap

Fingerprint

Dive into the research topics of 'Testing for the Appropriate Level of Clustering in Linear Regression Models'. Together they form a unique fingerprint.

Cite this