When should I use Random Effects?

When you believe the unobserved unit effect is uncorrelated with all regressors, and you want to estimate effects of time-invariant variables that FE cannot identify.

What is the key assumption of Random Effects?

The unobserved individual effect is uncorrelated with all regressors in all time periods. Verify with the Hausman test: if it rejects, FE is preferred.

What is the most common mistake with Random Effects?

Using RE when the Hausman test rejects — indicating the RE assumption is violated and FE should be preferred. Also, not considering the Mundlak/correlated RE approach as a diagnostic.

Method·intermediate·9 min read

PanelEstablished

Random Effects

A more efficient alternative to fixed effects when the unobserved effect is uncorrelated with regressors.

When to Use: When you believe the unobserved unit effect is uncorrelated with all regressors, and you want to estimate effects of time-invariant variables that FE cannot identify.
Assumption: The unobserved individual effect is uncorrelated with all regressors in all time periods. Verify with the Hausman test: if it rejects, FE is preferred.
Mistake: Using RE when the Hausman test rejects — indicating the RE assumption is violated and FE should be preferred. Also, not considering the Mundlak/correlated RE approach as a diagnostic.
Reading Time: ~9 min read · 11 sections · 9 interactive exercises

One-Line Implementation

Rplm::plm(y ~ x1 + x2, data = df, index = c('id', 't'), model = 'random') |> lmtest::coeftest(vcov. = vcovHC)

Stataxtreg y x1 x2, re vce(robust)

PythonRandomEffects(df['y'], df[['x1', 'x2']]).fit(cov_type='robust') # linearmodels

Download Full Analysis Code

Complete scripts with diagnostics, robustness checks, and result export.

Motivating Example: Cross-Country Growth Regressions

You want to understand what predicts economic growth across countries. Your panel has 100 countries observed over 30 years, and you are interested in the effects of institutions, trade openness, and human capital on GDP growth.

The challenge is that some of your most important variables — geography, colonial history, legal origin — are time-invariant. A fixed effects model would absorb all of these into the country fixed effects, making them impossible to estimate. But these time-invariant variables are precisely the ones you care about.

If you are willing to assume that the unobserved country-level factors (captured by the country effect $\alpha_i$ ) are uncorrelated with your regressors, the random effects model lets you estimate the effects of both time-varying and time-invariant variables. This assumption is strong, but in some settings it is defensible — and the payoff is substantial (Islam, 1995).

AOverview

The Model

The panel data model is the same as in fixed effects:

Y_{it} = X_{it}'\beta + Z_i'\gamma + \alpha_i + \varepsilon_{it}

where $Z_i$ are time-invariant covariates (like geography) and $\alpha_i$ is the unobserved unit effect. The key difference is in the assumption about $\alpha_i$ .

Fixed effects allows $\alpha_i$ to be arbitrarily correlated with the regressors and eliminates it via the within transformation .
Random effects treats $\alpha_i$ as a random variable drawn from a distribution, and assumes it is uncorrelated with $X_{it}$ and $Z_i$ .

Why Does the RE Assumption Matter?

If $\alpha_i$ is uncorrelated with the regressors, RE produces more efficient estimates (smaller standard errors) than FE. Intuitively, RE uses both within-unit and between-unit variation, while FE discards the between-unit information entirely.

RE also allows you to estimate the coefficients on time-invariant variables ( $\gamma$ ), which FE cannot.

The RE Estimator as a Weighted Average

The RE estimator is a matrix-weighted average of the within (FE) estimator and the between estimator. In the balanced-panel, single-regressor case, this weighting simplifies to a scalar formula:

\hat{\beta}_{RE} = \omega \hat{\beta}_{FE} + (1 - \omega) \hat{\beta}_{BE}

where $\omega$ depends on the ratio of the idiosyncratic variance $\sigma^2_\varepsilon$ to the unit-effect variance $\sigma^2_\alpha$ and the number of time periods $T$ . When $\sigma^2_\alpha$ is large (lots of persistent between-unit heterogeneity), RE puts more weight on the within estimator and approaches FE. In the general multivariate or unbalanced-panel case, $\omega$ becomes a matrix weight and the expression above no longer applies as a simple scalar formula.

Common Confusions

BIdentification

The Core Assumption

E[\alpha_i \mid X_{i1}, X_{i2}, \ldots, X_{iT}, Z_i] = E[\alpha_i] = 0

The unobserved unit effect is uncorrelated with all regressors in all time periods. This condition says that unobserved permanent unit heterogeneity is uncorrelated with all included regressors.

When Is the RE Assumption Plausible?

RE is most defensible when:

Treatment is randomly assigned — In an RCT with panel data, the unit effect $\alpha_i$ is, by design, uncorrelated with treatment. RE is efficient here.
The between and within estimates agree — If the Hausman test does not reject, and the coefficients are substantively similar, RE may be appropriate.
You are studying time-invariant variables — If the research question demands estimating time-invariant effects, RE (or ) is your only option.
Prediction is the goal — If you want to predict outcomes (not estimate causal effects), RE can be more useful than FE.

When Is It NOT Plausible?

RE is rarely credible when:

Selection into treatment is likely — If units into treatment based on unobserved characteristics.
Cross-country regressions with endogenous institutions — Country-level unobservables (culture, geography) likely affect both institutions and growth.
The Hausman test strongly rejects — A large test statistic means FE and RE estimates differ substantially, and FE is the safer choice.

CVisual Intuition

Recall the FE picture: each unit has its own intercept, and FE fits separate within-unit regression lines. RE does something subtler. It partially pools the unit-specific intercepts toward the overall mean. Units with more observations (or more precise data) keep more of their own intercept; units with less data are pulled more toward the grand mean.

Think of RE as a compromise: it does not fully trust the between-unit comparisons (like pooled OLS), but it does not fully discard them either (like FE). It uses the data to find a well-suited blend.

This partial pooling is exactly what hierarchical/multilevel models do. In the random-intercepts case, RE and hierarchical linear models (HLM) are mathematically equivalent, just with different jargon. Econometricians say "random effects." Psychometricians and education researchers say "multilevel models." The core estimation is the same, though multilevel modeling frameworks can accommodate richer structures (random slopes, cross-classified designs) that go beyond the standard econometric RE specification.

DMathematical Derivation

Don't worry about the notation yet — here's what this means in words: RE uses a partial demeaning (quasi-demeaning) that optimally blends within-unit and between-unit variation, depending on how much of the total variance is due to the unit effect.

The composite error is $u_{it} = \alpha_i + \varepsilon_{it}$ . Under RE assumptions:

\text{Var}(u_{it}) = \sigma^2_\alpha + \sigma^2_\varepsilon

\text{Cov}(u_{it}, u_{is}) = \sigma^2_\alpha \quad \text{for } t \neq s

The correlation structure within units motivates generalized least squares (GLS). The RE estimator applies a quasi-demeaning transformation:

\tilde{Y}_{it} = Y_{it} - \hat{\theta}\bar{Y}_i

where

\hat{\theta} = 1 - \sqrt{\frac{\sigma^2_\varepsilon}{\sigma^2_\varepsilon + T\sigma^2_\alpha}}

This scalar expression holds for the balanced panel, single-regressor case. In general, RE applies a matrix transformation $\Omega^{-1/2}$ to the stacked data, where $\Omega$ is the block-diagonal error covariance matrix.

When $\sigma^2_\alpha$ is large (lots of permanent unit differences), $\theta \to 1$ , and RE approaches FE (full demeaning). When $\sigma^2_\alpha = 0$ (no unit effects), $\theta = 0$ , and RE reduces to pooled OLS.

The RE estimator is then OLS on the quasi-demeaned data:

\hat{\beta}_{RE} = \left(\sum_i \sum_t \tilde{X}_{it}\tilde{X}_{it}'\right)^{-1} \left(\sum_i \sum_t \tilde{X}_{it}\tilde{Y}_{it}\right)

Hausman test statistic:

H = (\hat{\beta}_{FE} - \hat{\beta}_{RE})'\left[\widehat{\text{Var}}(\hat{\beta}_{FE}) - \widehat{\text{Var}}(\hat{\beta}_{RE})\right]^{-1}(\hat{\beta}_{FE} - \hat{\beta}_{RE})

Under $H_0$ (RE is consistent), $H \sim \chi^2_k$ , where $k$ is the number of time-varying regressors. In practice, the matrix $[\widehat{\text{Var}}(\hat{\beta}_{FE}) - \widehat{\text{Var}}(\hat{\beta}_{RE})]$ may not be positive definite when different variance estimates are used. Using the same estimate of $\sigma^2_u$ for both estimators avoids this issue .

EImplementation

1# Requires: plm, fixest
2library(plm)
3
4# --- Step 1: Declare Panel Structure ---
5# pdata.frame() tells plm the unit and time identifiers.
6# This enables within/between/random effects transformations.
7pdata <- pdata.frame(df, index = c("country_id", "year"))
8
9# --- Step 2: Random Effects Estimation ---
10# RE assumes the unit effect alpha_i is uncorrelated with all regressors.
11# RE is more efficient than FE because it uses both within- and
12# between-unit variation. It also allows estimating time-invariant
13# variables (like "institutions") that FE would absorb.
14re_fit <- plm(growth ~ trade_openness + human_capital + institutions,
15            data = pdata, model = "random")
16# Output: GLS estimates with both time-varying and time-invariant effects
17summary(re_fit)
18
19# --- Step 3: Fixed Effects for Comparison ---
20# FE uses only within-unit variation and allows arbitrary correlation
21# between unit effects and regressors. Time-invariant variables are dropped.
22fe_fit <- plm(growth ~ trade_openness + human_capital,
23            data = pdata, model = "within")
24
25# --- Step 4: Hausman Test ---
26# Tests H0: RE is consistent (unit effects uncorrelated with regressors).
27# Rejection (low p-value) favors FE. Failure to reject does not prove RE.
28# Compare FE and RE coefficients substantively, not just statistically.
29phtest(fe_fit, re_fit)
30
31# --- Step 5: Mundlak (Correlated RE) Approach ---
32# Add group means of time-varying regressors to the RE model.
33# If group-mean coefficients are significant, the RE assumption is violated.
34# Mundlak gives FE-consistent estimates PLUS time-invariant effects.
35library(fixest)
36df$mean_trade <- ave(df$trade_openness, df$country_id)
37df$mean_hcap <- ave(df$human_capital, df$country_id)
38mundlak <- feols(growth ~ trade_openness + human_capital + institutions +
39               mean_trade + mean_hcap | 0,
40               data = df, vcov = ~country_id)
41# Significant mean_trade or mean_hcap => RE assumption is violated
42summary(mundlak)

Requiresplm fixest

FDiagnostics

The Hausman Test in Practice

Run the Hausman test and report the statistic and p-value. But interpret it carefully:

p < 0.05: Reject RE. Use FE. The unobserved effect likely correlates with regressors.
p > 0.05: Cannot reject RE. But failure to reject does not prove RE is correct — you may simply lack power.

The Mundlak Test (A Better Alternative)

The Mundlak (1978) approach adds group means of time-varying regressors to the RE model. If the coefficients on the group means are jointly significant, the RE assumption is violated. The Mundlak test is asymptotically equivalent to the Hausman test under homoskedasticity, but is more intuitive and easier to implement with robust or clustered SEs (under which the classical Hausman test is invalid).

Breusch-Pagan LM Test

Tests whether the variance of the unit effect ( $\sigma^2_\alpha$ ) is zero. If it is, there are no unit effects and pooled OLS is fine. If $\sigma^2_\alpha > 0$ , either FE or RE is needed.

Interpreting Your Results

RE coefficients on time-varying variables reflect both within-unit and between-unit variation. They are efficient but potentially biased if the RE assumption fails.
RE coefficients on time-invariant variables (like geography or gender) are identified entirely from between-unit variation. They are only valid if the RE assumption holds.
If FE and RE estimates are substantively similar (not just statistically indistinguishable), this agreement is reassuring. Report both.
The Mundlak approach gives you the strengths of both approaches: FE-consistent estimates of time-varying effects plus estimates of time-invariant effects, all in one model.

GWhat Can Go Wrong

Problem	What It Does	How to Fix It
Violated RE assumption	Coefficients are biased and inconsistent	Use FE or Mundlak correlated random effects (CRE)
Hausman test lacks power	Fails to reject RE even when it should	Use Mundlak test; rely on economic reasoning
Interpreting RE as causal without justification	Reviewers will flag this immediately	Explicitly defend the uncorrelation assumption or use FE
Confusion with multilevel/HLM models	Same estimator, different jargon	Recognize they are equivalent; use whichever framing your audience expects
Using RE because FE "eats up too much variation"	Not a valid justification for RE	Low within-variation means low power, but FE is still consistent. RE gains efficiency at the cost of potential bias

What Can Go Wrong

Violated RE Assumption (Correlated Unit Effects)

The unobserved country effect (e.g., institutional quality) is uncorrelated with regressors (trade openness, human capital)

RE estimate of trade openness effect: 0.032 (true effect: 0.030). Hausman test p = 0.61. RE is efficient and consistent.

What Can Go Wrong

Relying on a Low-Power Hausman Test

Hausman test has adequate power due to sufficient within-unit variation and sample size

Hausman statistic = 18.4 (p = 0.001). Clear rejection of RE. Researcher uses FE, avoiding bias.

What Can Go Wrong

Using RE to Estimate Time-Invariant Effects Without Justification

RE assumption is carefully defended: in an RCT with panel data, randomization ensures the unit effect is uncorrelated with treatment

RE estimate of the time-invariant treatment effect: 0.15 (SE = 0.04). Defensible because randomization (in expectation) makes the unit effect uncorrelated with treatment, which is the substantive content of the RE assumption.

Concept Check

You run a Hausman test comparing FE and RE estimates. The test statistic is 3.2 with 4 degrees of freedom (p = 0.52). What is the correct interpretation?

The RE assumption is definitely satisfied. Use RE.There is no evidence against RE, but you generally want to also consider economic reasoning about whether the unit effect is plausibly uncorrelated with regressors.FE and RE give identical results.The test is invalid because the p-value is too high.

HPractice

Concept Check

A researcher wants to estimate the effect of a country's legal origin (common law vs. civil law) on economic growth using panel data. She uses random effects because 'legal origin is time-invariant and cannot be estimated with fixed effects.' Is this a valid justification for RE?

Yes — FE cannot estimate time-invariant effects, so RE is the only option.No — wanting to estimate a time-invariant effect does not validate the RE assumption. She should use the Mundlak/CRE approach instead.Yes — as long as the Hausman test does not reject RE.No — time-invariant effects can never be credibly estimated in observational panel data.

Concept Check

You estimate both FE and RE models. The FE coefficient on trade openness is 0.03 (SE = 0.01) and the RE coefficient is 0.09 (SE = 0.006). What does the large difference between these estimates suggest?

The RE model is more efficient, so the RE estimate is more reliable.The unit effect is likely correlated with the regressors, biasing the RE estimate upward. Use FE.The FE standard error is larger, so the FE estimate is less trustworthy.Both estimates are valid — they just answer different questions.

Concept Check

A Hausman test comparing FE and RE gives a test statistic of 2.1 with 4 degrees of freedom (p = 0.72). A colleague concludes: 'The RE assumption is satisfied, so RE is definitely the right model.' What is wrong with this reasoning?

Nothing — p = 0.72 strongly supports the RE assumption.Failure to reject does not prove the null. The test may lack power, and substantive reasoning about whether the unit effect is exogenous is equally important.The Hausman test is invalid with robust standard errors.Always use FE regardless of the Hausman test result.

Concept Check

In an RCT with panel data, a researcher uses random effects rather than fixed effects. Is this defensible?

No — always use fixed effects with panel data.Yes — randomization ensures the unit effect is uncorrelated with treatment, satisfying the RE assumption. RE is more efficient in this setting.Yes — but only if the sample size is large enough.No — you typically cannot use RE unless you first run a Hausman test.

Guided Exercise

A study of cross-country growth uses RE because the key variable of interest (colonial legal origin) is time-invariant. The Hausman test gives p = 0.23. A Mundlak test (adding group means of trade and human capital to the RE model) shows both group means are significant at the 5% level. What should the researcher do?

The Hausman and Mundlak tests disagree. Which is more reliable, and what estimator should be used?

Error Detective

Read the analysis below carefully and identify the errors.

A researcher studies the effect of democracy (a slowly varying variable) on economic growth using panel data for 80 countries over 20 years. They estimate a random effects model because "fixed effects removes all the cross-country variation in democracy, leaving insufficient within-country variation." They report:

xtreg growth democracy trade human_capital, re

Coefficient on democracy: 0.45 (SE = 0.12, p < 0.001). They write: "The Hausman test gives p = 0.08, which does not reject RE at the 5% level. Therefore, RE is the appropriate estimator and democracy has a positive causal effect on growth."

Select all errors you can find:

Using 'insufficient within-variation' as justification for RE(Justification for RE)

Treating a borderline Hausman test as definitive support for RE(Hausman test interpretation)

Claiming causal effect without defending the exogeneity of the unit effect(Causal claim)

Error Detective

Read the analysis below carefully and identify the errors.

A health economist studies the effect of hospital staffing ratios on patient outcomes across 500 hospitals over 10 years. They estimate both FE and RE models. The FE estimate of nurse-to-patient ratio on mortality is -0.03 (SE = 0.02, p = 0.13). The RE estimate is -0.08 (SE = 0.01, p < 0.001). They report:

"The RE model is preferred because (1) the Hausman test does not reject (p = 0.19), and (2) the RE estimate is more precisely estimated. We conclude that increasing nurse staffing significantly reduces mortality."

Select all errors you can find:

Choosing RE because it gives a more significant result(Estimator selection rationale)

Ignoring that the FE and RE estimates differ substantially(Comparison of FE and RE estimates)

Referee Exercise

Read the paper summary below and write a brief referee critique (2-3 sentences) of the identification strategy.

Paper Summary

A study examines the effect of teacher certification type (traditional vs. alternative) on student test scores. Using panel data from 2,000 schools over 8 years, the authors estimate a random effects model because certification type varies mostly between schools, not within schools over time. They argue RE is necessary to identify the effect of this near-time-invariant variable. The Hausman test gives p = 0.31. They find that traditional certification raises math scores by 0.15 standard deviations (p < 0.01).

Key Table

Variable	Coefficient	SE	p-value
Traditional cert. share	0.150	0.042	0.000
Student-teacher ratio	-0.030	0.011	0.006
% Free lunch	-0.220	0.018	0.000
School RE	Yes
Year FE	Yes
Hausman test p-value	0.31
N (school-years)	16,000

Authors' Identification Claim

The Hausman test does not reject RE, supporting our use of the random effects estimator. Because certification type is nearly time-invariant, FE would absorb most of the identifying variation and produce imprecise estimates.

ISwap-In: When to Use Something Else

Fixed effects: When the RE assumption (unit effects uncorrelated with regressors) is implausible — FE allows arbitrary correlation between unit effects and covariates at the cost of discarding between-unit variation.
Correlated Random Effects (Mundlak): When you want to test the RE assumption while still estimating time-invariant effects — add group means of time-varying regressors to the RE model.
First differencing: Equivalent to FE with two periods. With more periods, FE is generally more efficient unless errors follow a random walk.
Arellano-Bond GMM: For dynamic panels with a lagged dependent variable, where both FE and RE are biased.

JReviewer Checklist

Critical Reading Checklist

0 of 8 items checked0%

Is the RE assumption (uncorrelation of unit effects with regressors) explicitly discussed and justified?
Is the Hausman test reported? Is it interpreted correctly (not as proof that RE is right)?
Is the Mundlak/CRE specification explored as a robustness check?
Are FE results shown alongside RE for comparison?
If time-invariant variables are of interest, is this explicitly stated as the reason for using RE?
Are robust standard errors used?
Is the Breusch-Pagan test reported (to verify that unit effects exist at all)?
Is the distinction between RE and multilevel models acknowledged if the audience spans disciplines?

Paper Library

Has replication code

Foundational (7)

Bell, A., & Jones, K. (2015). Explaining Fixed Effects: Random Effects Modeling of Time-Series Cross-Sectional and Panel Data.

Political Science Research and MethodsDOI: 10.1017/psrm.2014.7

Bell and Jones argue that the 'within-between' random-effects model (closely related to the Mundlak approach) can outperform pure fixed effects in certain settings because it allows explicit decomposition of within- and between-unit effects while accounting for unobserved heterogeneity. This approach retains the unbiasedness of the within estimator for time-varying regressors while also estimating between-unit effects that fixed effects discard. The paper provides practical guidance for researchers who need to estimate both types of effects or who have time-invariant regressors that fixed effects cannot identify.

Hausman, J. A. (1978). Specification Tests in Econometrics.

EconometricaDOI: 10.2307/1913827

Hausman develops a general test for comparing two estimators—one consistent under a broader set of assumptions (fixed effects) and one efficient under stronger assumptions (random effects). The 'Hausman test' for choosing between fixed and random effects is one of the most frequently used specification tests in applied economics.

Hausman, J. A., & Taylor, W. E. (1981). Panel Data and Unobservable Individual Effects.

EconometricaDOI: 10.2307/1911406

Hausman and Taylor develop an instrumental variables estimator for panel data that allows consistent estimation of coefficients on time-invariant variables even when individual effects are correlated with some regressors. The Hausman-Taylor estimator occupies a middle ground between fixed effects (which cannot estimate time-invariant coefficients) and random effects (which requires strict exogeneity).

Hofmann, D. A. (1997). An Overview of the Logic and Rationale of Hierarchical Linear Models.

Journal of ManagementDOI: 10.1177/014920639702300602

Hofmann introduces hierarchical linear models to the management research community, explaining when and why multilevel random-effects models are appropriate for organizational data with nested structures. This tutorial is highly influential in promoting multilevel methods in management journals.

Laird, N. M., & Ware, J. H. (1982). Random-Effects Models for Longitudinal Data.

BiometricsDOI: 10.2307/2529876

Laird and Ware develop the general framework for random-effects models in longitudinal data, integrating fixed population parameters with random individual-level effects. This paper is foundational for the mixed-effects modeling approach widely used in biostatistics and social sciences.

Mundlak, Y. (1978). On the Pooling of Time Series and Cross Section Data.

EconometricaDOI: 10.2307/1913646

Mundlak shows that the fixed effects estimator can be understood as an OLS regression that includes the group means of all time-varying regressors. This 'correlated random effects' interpretation bridges the fixed effects and random effects models and clarifies exactly what assumption is being relaxed.

Wooldridge, J. M. (2019). Correlated Random Effects Models with Unbalanced Panels.

Journal of EconometricsDOI: 10.1016/j.jeconom.2018.12.010

Wooldridge extends the correlated random effects (CRE) framework to handle unbalanced panels, which are the norm in applied research. This paper shows how to combine the flexibility of fixed effects with the ability to estimate effects of time-invariant variables, making the CRE approach practical for real-world datasets.

Application (2)

Aguinis, H., Gottfredson, R. K., & Culpepper, S. A. (2013). Best-Practice Recommendations for Estimating Cross-Level Interaction Effects Using Multilevel Modeling.

Journal of ManagementDOI: 10.1177/0149206313478188

Aguinis, Gottfredson, and Culpepper provide detailed guidance for management researchers on estimating cross-level interaction effects in multilevel models. They address common problems including insufficient statistical power, centering decisions, and effect size reporting that frequently lead to unreliable results in organizational research. The paper offers concrete recommendations for sample size, model specification, and interpretation that improve the credibility of multilevel interaction analyses.

Islam, N. (1995). Growth Empirics: A Panel Data Approach.

Quarterly Journal of EconomicsDOI: 10.2307/2946651

Islam applies panel data methods—including random effects and fixed effects—to the cross-country growth regression framework, showing that accounting for unobserved country heterogeneity substantially changes estimates of convergence rates. This paper demonstrates the importance of choosing between fixed and random effects in macroeconomic growth empirics.

Survey (8)

Allison, P. D. (2009). Fixed Effects Regression Models.

SAGE PublicationsDOI: 10.4135/9781412993869

Allison's concise and accessible monograph compares fixed effects and random effects models for panel data, providing practical guidance on model selection, estimation, and interpretation. It is particularly useful for social scientists seeking an intuitive understanding of when each approach is appropriate.

Angrist, J. D., & Pischke, J.-S. (2009). Mostly Harmless Econometrics: An Empiricist's Companion.

Princeton University PressDOI: 10.1515/9781400829828

Angrist and Pischke write one of the most influential modern textbooks on applied econometrics, organizing the field around a design-based approach to causal inference. The book provides essential treatments of instrumental variables, difference-in-differences, and regression discontinuity, each grounded in the potential outcomes framework. It remains the standard reference for graduate students learning to evaluate and implement identification strategies.

Baltagi, B. H. (2021). Econometric Analysis of Panel Data.

SpringerDOI: 10.1007/978-3-030-53953-5

Baltagi provides the standard graduate-level textbook on panel data econometrics, covering one-way and two-way error component models, fixed and random effects, dynamic panels, unbalanced panels, spatial panels, and limited dependent variable panel models. The book emphasizes the theoretical foundations, asymptotic properties, and Monte Carlo evidence on different panel estimators, and is the primary reference for understanding the assumptions, properties, and trade-offs of panel data methods.

Clark, T. S., & Linzer, D. A. (2015). Should I Use Fixed or Random Effects?.

Political Science Research and MethodsDOI: 10.1017/psrm.2014.32

Clark and Linzer provide practical guidance on choosing between fixed and random effects, arguing the decision depends on the research question, sample size, and the degree of correlation between unit effects and covariates. They demonstrate via simulation that random effects can outperform fixed effects when the number of units is small or when between-unit variation is of substantive interest. The paper challenges the common practice of defaulting to fixed effects solely because the Hausman test rejects.

Peterson, M. F., Arregle, J.-L., & Martin, X. (2012). Multilevel Models in International Business Research.

Journal of International Business StudiesDOI: 10.1057/jibs.2011.59

Peterson, Arregle, and Martin review the use of multilevel random-effects models in international business research, where firms are nested within countries. They discuss best practices for modeling cross-level effects and the importance of accounting for the hierarchical structure of international data.

Rabe-Hesketh, S., & Skrondal, A. (2012). Multilevel and Longitudinal Modeling Using Stata.

Stata Press

Rabe-Hesketh and Skrondal provide a comprehensive practical guide to multilevel (hierarchical) models in Stata, which generalize the random effects framework to more complex nested data structures. It is an essential reference for applied researchers implementing multilevel models.

Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical Linear Models: Applications and Data Analysis Methods.

SAGE Publications

Raudenbush and Bryk popularize hierarchical linear models (HLM), which are random-effects models for nested data structures such as students within schools, in this influential textbook. It becomes the standard reference for multilevel modeling in education, psychology, and organizational research.

Wooldridge, J. M. (2010). Econometric Analysis of Cross Section and Panel Data.

MIT Press

Wooldridge's graduate textbook covers duration and hazard models in Chapter 22, including the Cox proportional hazard model, parametric alternatives (Weibull, exponential), and the treatment of censoring and truncation in survival data.

One-Line Implementation

Download Full Analysis Code

Motivating Example: Cross-Country Growth Regressions#

AOverview#

The Model#

Why Does the RE Assumption Matter?#

The RE Estimator as a Weighted Average#

Common Confusions#

BIdentification#

The Core Assumption#

When Is the RE Assumption Plausible?#

When Is It NOT Plausible?#

CVisual Intuition#

DMathematical Derivation#

EImplementation#

FDiagnostics#

The Hausman Test in Practice#

The Mundlak Test (A Better Alternative)#

Breusch-Pagan LM Test#

Interpreting Your Results#

GWhat Can Go Wrong#

Violated RE Assumption (Correlated Unit Effects)

Relying on a Low-Power Hausman Test

Using RE to Estimate Time-Invariant Effects Without Justification

HPractice#

Paper Summary

Key Table

Authors' Identification Claim

ISwap-In: When to Use Something Else#

JReviewer Checklist#

Critical Reading Checklist

Paper Library

Foundational (7)

Application (2)

Survey (8)

Tags

Motivating Example: Cross-Country Growth Regressions

AOverview

The Model

Why Does the RE Assumption Matter?

The RE Estimator as a Weighted Average

Common Confusions

BIdentification

The Core Assumption

When Is the RE Assumption Plausible?

When Is It NOT Plausible?

CVisual Intuition

DMathematical Derivation

EImplementation

FDiagnostics

The Hausman Test in Practice

The Mundlak Test (A Better Alternative)

Breusch-Pagan LM Test

Interpreting Your Results

GWhat Can Go Wrong

HPractice

ISwap-In: When to Use Something Else

JReviewer Checklist