When should I use Quantile Treatment Effects (QTE)?

When you suspect the treatment effect varies across the outcome distribution and average effects mask important heterogeneity.

What is the key assumption of Quantile Treatment Effects (QTE)?

For conditional QTE: correct specification of the conditional quantile function. For unconditional QTE: the recentered influence function correctly linearizes the quantile functional.

What is the most common mistake with Quantile Treatment Effects (QTE)?

Interpreting conditional quantile regression coefficients as effects on unconditional quantiles. Use RIF regression for unconditional quantile effects.

Method·advanced·15 min read

Model-BasedModern

Quantile Treatment Effects (QTE)

Estimates how treatment shifts the entire outcome distribution, revealing heterogeneous effects across quantiles that average effects conceal.

When to Use: When you suspect the treatment effect varies across the outcome distribution and average effects mask important heterogeneity.
Assumption: For conditional QTE: correct specification of the conditional quantile function. For unconditional QTE: the recentered influence function correctly linearizes the quantile functional.
Mistake: Interpreting conditional quantile regression coefficients as effects on unconditional quantiles. Use RIF regression for unconditional quantile effects.
Reading Time: ~15 min read · 11 sections · 8 interactive exercises

One-Line Implementation

Rrq(y ~ treatment + x1 + x2, tau = c(0.1, 0.25, 0.5, 0.75, 0.9), data = df)

Statasqreg y treatment x1 x2, q(.1 .25 .5 .75 .9) reps(500)

Python[smf.quantreg('y ~ treatment + x1 + x2', data=df).fit(q=q) for q in [.1,.25,.5,.75,.9]]

Download Full Analysis Code

Complete scripts with diagnostics, robustness checks, and result export.

Motivating Example: The Jobs First Welfare Experiment

A state government introduces a subsidized job training program for unemployed workers. A randomized evaluation finds that the program increases average earnings by $1,200 per year — a statistically significant and seemingly encouraging result. Policymakers celebrate.

But a closer look at the data reveals something the average conceals. Workers at the very bottom of the earnings distribution saw no change — both treated and untreated had zero earnings. Workers in the middle saw earnings increase substantially. Workers at the top, however, saw their earnings decrease, possibly because they were diverted from better opportunities into the standardized program. The modest positive average of $1,200 is a weighted blend of zero effects at the bottom, large gains in the middle, and losses at the top. No single individual experienced the "average" effect.

This kind of distributional heterogeneity is exactly the pattern studied by Bitler et al. (2006), who analyzed the Connecticut Jobs First welfare reform experiment and found that mean impacts masked dramatic distributional heterogeneity. The program had no effect at the bottom of the earnings distribution (where both groups had zero earnings), raised earnings in the middle quantiles, and lowered them at the upper quantiles. A policymaker who saw only the mean effect would have drawn the wrong conclusions about who benefits and who is harmed.

solve this problem by estimating how the treatment shifts each quantile of the outcome distribution separately. Instead of asking "what is the average effect?", QTE asks "what is the effect at the 10th percentile? The 25th? The median? The 90th?" The result is a quantile process — a curve showing how the treatment effect varies across the entire distribution. When this curve is flat, the average treatment effect tells the whole story. When it is not flat, the average is misleading, and QTE reveals the heterogeneity that average effects conceal.

AOverview

What Quantile Treatment Effects Do

Standard regression methods — OLS, difference-in-differences, IV — estimate the effect of a treatment on the mean of the outcome distribution. This quantity is the (ATE). But the mean can hide as much as it reveals. A treatment that helps the poor and hurts the rich, a drug that cures some patients and harms others, a policy that compresses or widens the wage distribution — all of these scenarios produce the same ATE if the gains and losses happen to average out.

Quantile treatment effects estimate the impact of a treatment at every point of the outcome distribution. The key object is the quantile process:

\text{QTE}(\tau) = Q_\tau(Y(1)) - Q_\tau(Y(0)), \quad \tau \in (0, 1)

where $Q_\tau(Y(d))$ is the $\tau$ -th quantile of the potential outcome distribution under treatment status $d$ . At $\tau = 0.5$ , this quantity is the effect on the median. At $\tau = 0.1$ , it is the effect on the 10th percentile — the bottom of the distribution. At $\tau = 0.9$ , it is the effect on the 90th percentile — the top.

Conditional vs. Unconditional QTE

There is a critical distinction between two types of quantile effects that researchers frequently confuse:

Conditional quantile effects Koenker and Bassett (1978) estimate $Q_\tau(Y | X = x)$ — the $\tau$ -th quantile of $Y$ conditional on covariates. Standard targets this object. The treatment coefficient in a conditional quantile regression measures how treatment shifts the conditional quantile function at a given point in the covariate space.

Firpo et al. (2009) estimate the effect on the $\tau$ -th quantile of the marginal (population) distribution of $Y$ , integrating over the covariate distribution. These effects are the quantities that matter for policy — "did the program raise the 10th percentile of wages in the population?" — but standard quantile regression does not estimate them. is needed instead.

When to Use QTE

You suspect the treatment effect varies across the outcome distribution — e.g., a training program helps low-earners more than high-earners
You care about distributional outcomes like inequality, poverty rates, or tail behavior
You want to know whether a treatment compresses or widens the outcome distribution
The average effect is small or zero, but you suspect offsetting effects at different parts of the distribution

When NOT to Use QTE

You are interested in who is affected (by observed characteristics) — use subgroup analysis or estimation instead
Your outcome is binary or count-valued — quantile regression is designed for continuous outcomes
You have very small sample sizes — quantile regression at extreme quantiles requires substantial data in the tails
The treatment is endogenous and you lack instruments — standard QR does not solve endogeneity (use IV-QR: Chernozhukov and Hansen (2005))

Common Confusions

Frequently asked questions about QTE

Q: Does the treatment effect at the 10th percentile mean the effect on people at the 10th percentile of the pre-treatment distribution? No. The QTE at $\tau = 0.10$ compares the 10th percentile of $Y(1)$ to the 10th percentile of $Y(0)$ . The individuals at the 10th percentile under treatment may not be the same individuals who were at the 10th percentile without treatment. This interpretation requires the assumption (that treatment does not change ranks), which is strong and often implausible.
Q: If the QTE is the same at every quantile, does that mean there is no heterogeneity? It means the treatment shifted the entire distribution by a constant amount — a pure location shift. There may still be heterogeneity by observed characteristics (age, gender, education), but the distributional heterogeneity captured by QTE is zero. A flat quantile process is consistent with a homogeneous treatment effect.
Q: Can I use quantile regression with fixed effects for panel data? Standard quantile regression with individual dummies suffers from the incidental parameters problem. Machado and Santos Silva (2019) propose a method-of-moments approach (MMQR) that avoids this problem. In Stata, see xtqreg.
Q: How many quantiles should I report? A standard set is $\{0.10, 0.25, 0.50, 0.75, 0.90\}$ . It is good practice to report the quantile process plot showing a finer grid (e.g., every 5th percentile). Avoid cherry-picking quantiles that show significant effects.

Connection to Other Methods

OLS Regression: OLS estimates the effect on the conditional mean. QTE estimates effects on conditional (or unconditional) quantiles. When the treatment is a pure location shift (the entire distribution shifts by a constant), the OLS estimate and every QTE estimate target the same parameter.
Causal Forests: Causal forests estimate heterogeneous treatment effects by observed covariates (CATE). QTE estimates heterogeneity across the outcome distribution. CATE and QTE are complementary — CATE tells you who is helped; QTE tells you where in the distribution the shift occurs.
Experimental Design: Randomization solves the selection problem for QTE just as it does for ATE. With experimental data, quantile regression identifies causal QTE without additional assumptions beyond randomization.
IV / 2SLS: Standard QR does not handle endogeneity. Chernozhukov and Hansen (2005) develop IV methods for quantile regression. The instrumental variable quantile regression (IVQR) model identifies quantile treatment effects when treatment is endogenous.

BIdentification

For quantile treatment effects to have a causal interpretation, specific assumptions must hold depending on the research design and the type of QTE being estimated.

Assumption 1: Unconfoundedness (for observational data)

Plain language: Conditional on observed covariates $X$ , treatment assignment is independent of potential outcomes at each quantile. This condition is the same as in OLS, but applied quantile by quantile.

Formally: $Q_\tau(Y(d) | X, D) = Q_\tau(Y(d) | X)$ for $d \in \{0, 1\}$ and all $\tau \in (0, 1)$ .

Under experimental data (randomization), this assumption holds by design. Under observational data, it requires the same selection-on-observables argument as for the ATE.

Assumption 2: Correct Specification of the Conditional Quantile Function

Plain language: The linear quantile regression model correctly specifies how covariates shift the conditional quantile function. Unlike OLS, where linearity is guaranteed to provide the best linear approximation to the CEF, quantile regression is only consistent for the conditional quantile function if the model is correctly specified.

Formally: $Q_\tau(Y | X) = X'\beta(\tau)$ for each $\tau$ .

Angrist et al. (2006) study quantile regression under misspecification. They show that misspecified QR coefficients minimize a weighted mean-squared specification-error loss and derive an omitted-variable-bias formula for quantile regression, providing tools to assess what QR estimates when the linear-in-parameters model is wrong.

Assumption 3: Rank Invariance (for individual-level interpretation)

Plain language: Treatment does not change an individual's rank in the outcome distribution. The person at the 30th percentile without treatment would also be at the 30th percentile with treatment (just at a different level). This assumption allows interpreting QTE as individual-level treatment effects.

Formally: $F_{Y(1)}(Y_i(1)) = F_{Y(0)}(Y_i(0))$ for all $i$ .

Rank invariance is a very strong assumption. It is plausible when treatment is a small perturbation (e.g., a modest wage subsidy) but implausible when treatment fundamentally reshuffles outcomes (e.g., a job training program that transforms some workers' career trajectories while leaving others unchanged). Without rank invariance, QTE identifies distributional effects but not individual-level effects.

CVisual Intuition

Adjust the treatment effects at different quantiles to see how the QTE curve departs from the constant ATE. When the treatment compresses the distribution (helping the bottom more than the top), the QTE curve slopes downward and the ATE masks important heterogeneity.

DMathematical Derivation

The Quantile Regression Estimator

Don't worry about the notation yet — here's what this means in words: Quantile regression estimates conditional quantile functions by minimizing an asymmetric loss function. The estimator is consistent for the linear conditional quantile model under correct specification.

Setup. We want to estimate $Q_\tau(Y | X) = X'\beta(\tau)$ for a given $\tau \in (0, 1)$ .

Step 1: The check function. Define the check function (also called the pinball loss):

\rho_\tau(u) = u \cdot (\tau - \mathbf{1}(u < 0)) = \begin{cases} \tau \cdot u & \text{if } u \geq 0 \\ (\tau - 1) \cdot u & \text{if } u < 0 \end{cases}

This function penalizes positive residuals by $\tau$ and negative residuals by $(1 - \tau)$ . At $\tau = 0.5$ , it reduces to the absolute value (median regression). At $\tau = 0.9$ , it penalizes under-predictions (positive residuals) nine times more heavily than over-predictions.

Step 2: The optimization problem. The quantile regression estimator $\hat{\beta}(\tau)$ solves:

\hat{\beta}(\tau) = \arg\min_{\beta} \sum_{i=1}^{n} \rho_\tau(Y_i - X_i'\beta)

This minimization is a linear programming problem and can be solved efficiently using simplex or interior point methods (Koenker & Bassett, 1978).

Step 3: Asymptotic distribution. Under regularity conditions:

\sqrt{n}(\hat{\beta}(\tau) - \beta(\tau)) \xrightarrow{d} N\left(0, \frac{\tau(1-\tau)}{f_\varepsilon(0)^2} E[X_i X_i']^{-1}\right)

where $f_\varepsilon(0)$ is the density of the regression error at zero (equivalently, the conditional density of $Y$ given $X$ evaluated at the $\tau$ -th quantile, assumed constant in $X$ ). This simplified i.i.d.-errors form requires that the conditional density at the quantile does not depend on $X$ . In general, the variance has the sandwich form $J^{-1}\Omega J^{-1}$ with $J = E[f_{Y|X}(Q_\tau(Y \mid X) \mid X) X X']$ and $\Omega = \tau(1-\tau) E[X X']$ . The "sparsity" $1/f_\varepsilon(0)$ replaces the variance $\sigma^2$ that appears in OLS asymptotics. Estimation of $f_{Y|X}$ at the quantile is a nuisance — bootstrap standard errors are generally preferred in practice.

Step 4: Comparison with OLS. OLS minimizes $\sum_i (Y_i - X_i'\beta)^2$ , giving the conditional mean $E[Y \mid X]$ . Quantile regression minimizes $\sum_i \rho_\tau(Y_i - X_i'\beta)$ , giving the conditional $\tau$ -th quantile. The key difference: OLS produces a single $\beta$ vector; quantile regression produces a function $\beta(\tau)$ that varies with $\tau$ .

RIF Regression for Unconditional Quantile Effects

Don't worry about the notation yet — here's what this means in words: RIF regression linearizes the quantile functional so that OLS on the transformed outcome recovers the unconditional quantile effect.

The problem. Standard quantile regression estimates the conditional quantile effect: how $X$ shifts $Q_\tau(Y | X)$ . But policymakers typically want the unconditional quantile effect: how $X$ shifts $Q_\tau(Y)$ , the $\tau$ -th quantile of the population distribution. These quantities are not the same — the law of iterated expectations does not hold for quantiles: $E_X[Q_\tau(Y|X)] \neq Q_\tau(Y)$ .

Step 1: The influence function. The influence function of the $\tau$ -th quantile functional is:

IF(Y; Q_\tau, F_Y) = \frac{\tau - \mathbf{1}(Y \leq Q_\tau)}{f_Y(Q_\tau)}

where $f_Y(Q_\tau)$ is the density of $Y$ evaluated at $Q_\tau$ .

Step 2: Recentering. The RIF adds back the quantile itself (Firpo et al., 2009):

RIF(Y; Q_\tau) = Q_\tau + \frac{\tau - \mathbf{1}(Y \leq Q_\tau)}{f_Y(Q_\tau)}

The key property: $E[RIF(Y; Q_\tau)] = Q_\tau$ . This property means that the RIF is a transformation of $Y$ whose expectation equals the distributional statistic of interest.

Step 3: OLS on the RIF. Regressing $RIF(Y; Q_\tau)$ on $X$ by OLS gives:

E[RIF(Y; Q_\tau) \mid X] \approx X'\gamma(\tau)

The coefficients $\gamma(\tau)$ have the interpretation of unconditional quantile partial effects — the marginal effect of a small change in $X$ on the $\tau$ -th quantile of the unconditional distribution of $Y$ .

Step 4: Implementation. In practice:

Estimate $\hat{Q}_\tau$ as the sample $\tau$ -th quantile
Estimate $\hat{f}_Y(\hat{Q}_\tau)$ using kernel density estimation
Compute $\widehat{RIF}_i = \hat{Q}_\tau + (\tau - \mathbf{1}(Y_i \leq \hat{Q}_\tau)) / \hat{f}_Y(\hat{Q}_\tau)$
Run OLS of $\widehat{RIF}$ on $X$ to get $\hat{\gamma}(\tau)$

EImplementation

Step-by-Step Workflow

Estimate OLS first. Report the ATE as a benchmark. The ATE is the effect on the conditional mean and serves as the reference for the quantile analysis.
Run conditional quantile regression at the standard quantiles $\{0.10, 0.25, 0.50, 0.75, 0.90\}$ . Use bootstrap standard errors (not asymptotic) — they are more reliable in practice.
Plot the quantile process. Estimate at a fine grid (every 5th percentile from 0.05 to 0.95) and plot the treatment coefficient with 95% confidence bands. Overlay the OLS estimate as a horizontal dashed line. If the QTE curve lies entirely within the OLS confidence interval, there is no evidence of distributional heterogeneity.
Test for heterogeneity. Use the Wald test from simultaneous quantile regression (sqreg in Stata) to test $H_0: \beta_{\text{treatment}}(0.10) = \beta_{\text{treatment}}(0.25) = \cdots = \beta_{\text{treatment}}(0.90)$ .
Estimate unconditional quantile effects if the policy question concerns population quantiles. Use RIF regression (Firpo-Fortin-Lemieux). Compare conditional and unconditional estimates — large differences indicate that covariates importantly reshape the mapping from conditional to unconditional quantiles.
Check for quantile crossing. Verify that fitted quantile lines do not cross: $\hat{Q}_{0.10}(Y|X) < \hat{Q}_{0.50}(Y|X) < \hat{Q}_{0.90}(Y|X)$ for all observations. Crossing indicates model misspecification.
Report the interquantile range effect. The difference $\text{QTE}(0.90) - \text{QTE}(0.10)$ tells you whether the treatment compresses (negative) or widens (positive) the distribution.

Standard Errors and Inference

Bootstrap SEs are preferred for conditional quantile regression. The asymptotic variance involves the conditional density $f_{Y|X}$ at the quantile, which is difficult to estimate reliably.
For simultaneous quantile regression (sqreg in Stata), the bootstrap is performed jointly across quantiles, enabling valid cross-quantile tests.
For RIF regression, standard OLS standard errors are valid because the RIF-transformed outcome is treated as the dependent variable in an OLS regression. Heteroscedasticity-robust SEs are recommended.
For extreme quantiles ( $\tau < 0.05$ or $\tau > 0.95$ ), inference is unreliable in moderate samples. Report these with appropriate caveats or omit them.

1# Requires: quantreg
2library(quantreg)
3
4# --- Step 1: Quantile Regression at Key Quantiles ---
5# rq() minimizes the asymmetric "check function" loss at each quantile tau.
6# Unlike OLS (which estimates the conditional mean), QR estimates the
7# conditional tau-th quantile of Y given X.
8taus <- c(0.10, 0.25, 0.50, 0.75, 0.90)
9qr_fits <- lapply(taus, function(tau) {
10rq(y ~ treatment + x1 + x2, tau = tau, data = df)
11})
12
13# --- Step 2: Summarize with Bootstrap Standard Errors ---
14# Analytical SEs for QR can be unreliable; bootstrap is the standard.
15# R=500 = number of bootstrap replications. Increase for final results.
16# Compare the treatment coefficient across quantiles: if it varies,
17# the treatment effect is heterogeneous across the outcome distribution.
18lapply(qr_fits, function(fit) summary(fit, se = "boot", R = 500))
19
20# --- Step 3: Quantile Process Plot ---
21# Estimates the treatment coefficient at a fine grid of quantiles (0.05 to 0.95).
22# The plot shows how the treatment effect varies across the distribution.
23# A flat line = constant effect; a slope = heterogeneous effects.
24qr_full <- rq(y ~ treatment + x1 + x2, tau = seq(0.05, 0.95, 0.05), data = df)
25# Plots coefficients with 95% confidence bands vs. quantile index
26plot(summary(qr_full, se = "boot"))

FDiagnostics

F.1 Quantile Crossing

If the estimated conditional quantile at $\tau_1$ exceeds the estimated conditional quantile at $\tau_2 > \tau_1$ for some observations, the model is misspecified. This crossing means the linear model for $Q_\tau(Y|X) = X'\beta(\tau)$ implies a negative conditional density for some covariate values, which is generally infeasible.

What to do: Check the fraction of observations with crossings. A small fraction (< 5%) is common and not alarming. Extensive crossing suggests that a linear specification is inadequate — consider adding interactions, polynomial terms, or using a more flexible model.

F.2 Heterogeneity Test

Test whether the treatment effect is constant across quantiles. If the Wald test from sqreg fails to reject $H_0: \beta(\tau_1) = \beta(\tau_2) = \cdots = \beta(\tau_K)$ , there is no statistical evidence that QTE adds information beyond the ATE.

F.3 Goodness of Fit

There is no $R^2$ analog for quantile regression with a universally agreed interpretation. The pseudo- $R^2$ proposed by Koenker and Machado (1999) — $1 - \hat{V}(\tau) / \tilde{V}(\tau)$ , where $\hat{V}$ is the minimized check function and $\tilde{V}$ is the check function from the intercept-only model — can be reported but should not be over-interpreted.

F.4 Sensitivity to Bandwidth (RIF Regression)

RIF regression requires a kernel density estimate $\hat{f}_Y(Q_\tau)$ . The choice of bandwidth affects the RIF values and hence the estimated unconditional quantile effects. Report sensitivity to different bandwidth choices (e.g., Silverman's rule of thumb, oversmoothed bandwidth, and half the default bandwidth).

Reading the Quantile Process

The quantile process plot is the primary tool for communicating QTE results. It shows the treatment coefficient $\hat{\beta}_{\text{treatment}}(\tau)$ on the vertical axis and the quantile $\tau$ on the horizontal axis, with a 95% confidence band.

Key patterns to look for:

Flat line: The treatment effect is constant across the distribution. The ATE tells the whole story. QTE adds no information.
Downward slope: The treatment helps the bottom of the distribution more than the top. The treatment compresses the distribution (reduces inequality).
Upward slope: The treatment helps the top more than the bottom. The treatment widens the distribution (increases inequality).
Zero crossing: The treatment helps some quantiles and hurts others. The ATE may be near zero despite large effects at specific quantiles.
U-shape or inverted-U: The treatment has complex distributional effects, with the middle of the distribution responding differently from the tails.

Reporting Conventions

Report the ATE (OLS) alongside the QTE. The ATE is the benchmark. Readers need to see whether QTE reveals additional heterogeneity.
Report QTE at standard quantiles in a table: $\{0.10, 0.25, 0.50, 0.75, 0.90\}$ with standard errors and confidence intervals.
Include the quantile process plot. The quantile process plot is the most informative visual. Show the OLS estimate as a horizontal dashed line with its confidence band for comparison.
Report the interquantile range effect: QTE(0.90) - QTE(0.10). This quantity measures the treatment's effect on the spread of the distribution.
Report the Wald test for equality of treatment effects across quantiles. If you fail to reject, acknowledge that QTE does not provide statistically significant evidence of distributional heterogeneity.

GWhat Can Go Wrong

What Can Go Wrong

Conditional vs. Unconditional Quantile Confusion

Researcher uses RIF regression to estimate unconditional quantile effects of a minimum wage increase on the population wage distribution

The minimum wage increase raised the 10th percentile of the unconditional wage distribution by $0.85/hour (SE = 0.22, p < 0.001) and had no significant effect on the 90th percentile ($0.03, p = 0.88). The wage distribution compressed.

What Can Go Wrong

Crossing Quantile Lines Signal Misspecification

Linear quantile regression applied to log wages, where the conditional distribution is approximately symmetric and the linear model fits well across quantiles

Fitted quantile lines are well-separated: Q(0.10) < Q(0.50) < Q(0.90) for all observations. Zero crossings detected. The linear conditional quantile model is adequate.

What Can Go Wrong

Extreme Quantile Instability

QTE estimated at tau in {0.10, 0.25, 0.50, 0.75, 0.90} with n = 5,000 observations. Each quantile has hundreds of observations in its neighborhood, and bootstrap confidence intervals are reasonably tight.

QTE at tau = 0.10: $2,100 (SE = 480). QTE at tau = 0.90: -$350 (SE = 520). Confidence intervals are informative and allow meaningful comparisons across quantiles.

What Can Go Wrong

Rank Invariance Assumed Without Justification

Researcher reports QTE as distributional shifts without claiming they represent individual-level effects. States: 'The 10th percentile of earnings increased by $3,500, but we cannot determine whether this finding reflects the effect on specific individuals or a reshuffling of ranks.'

The interpretation is honest about what QTE can and cannot identify. The distributional finding is policy-relevant (the bottom of the distribution improved) regardless of whether rank invariance holds.

HPractice

H.1 Concept Checks

Concept Check

A researcher estimates a quantile regression of wages on a job training program indicator and finds: QTE(0.10) = $3,500 (p < 0.001), QTE(0.50) = $1,200 (p = 0.02), QTE(0.90) = -$800 (p = 0.15). The OLS estimate is $1,100 (p = 0.03). What does this pattern tell you?

The program helps everyone — even the QTE at 0.90 is probably just noise.The program has the largest effect at the bottom of the earnings distribution, with diminishing effects at higher quantiles. The OLS average of $1,100 masks important heterogeneity — the program is much more beneficial for low earners than the average suggests.The QTE results are inconsistent with the OLS result because the OLS estimate ($1,100) does not equal the QTE at the median ($1,200).The interquantile range effect is $3,500 - (-$800) = $4,300, meaning the program increased inequality by $4,300.

Concept Check

A labor economist reports: 'We estimated the effect of unionization on wages using quantile regression at tau = 0.10, 0.25, 0.50, 0.75, and 0.90. The coefficients show that unions raise wages more at the bottom of the wage distribution, consistent with wage compression.' A critic responds: 'Your conditional quantile regression does not tell you about the unconditional wage distribution.' Is the critic correct?

No — quantile regression always estimates effects on the unconditional distribution.Yes — conditional quantile regression estimates `Q_tau(Y|X)`, not `Q_tau(Y)`. To learn about the unconditional wage distribution, the researcher needs RIF regression or a different approach.It depends on whether the covariates are balanced between union and non-union workers.The critic is wrong because the researcher used multiple quantiles, which covers the entire distribution.

Concept Check

You run simultaneous quantile regression (sqreg) and test the null hypothesis that the treatment coefficient is equal across the five quantiles {0.10, 0.25, 0.50, 0.75, 0.90}. The Wald test gives chi-squared = 5.2 with 4 degrees of freedom (p = 0.27). What should you conclude?

The treatment has no effect on the outcome.There is no statistically significant evidence that the treatment effect varies across quantiles. The ATE may adequately summarize the treatment's impact.The quantile regression is misspecified.Report only the OLS result and discard the quantile regression entirely.

H.2 Guided Exercise

Guided Exercise

Interpreting QTE from a Job Training Program Evaluation

You evaluate a randomized job training program using data from 2,000 participants (1,000 treated, 1,000 control). The outcome is annual earnings ($). You estimate both OLS and conditional quantile regression with bootstrap SEs (500 replications). Your output:

Method	tau	Coeff	Boot SE	95% CI	p-value
OLS (mean)	—	$1,150	$420	[$327, $1,973]	0.006
QR	0.10	$3,200	$710	[$1,808, $4,592]	< 0.001
QR	0.25	$2,050	$530	[$1,011, $3,089]	< 0.001
QR	0.50	$1,100	$480	[$159, $2,041]	0.022
QR	0.75	$350	$620	[-$865, $1,565]	0.573
QR	0.90	-$650	$880	[-$2,375, $1,075]	0.460

Wald test for equality of treatment across quantiles: chi2(4) = 14.8, p = 0.005. Quantile crossing: 0 of 2,000 observations show Q(0.10) > Q(0.50); 0 show Q(0.50) > Q(0.90).

You also estimate RIF regression for unconditional quantile effects:

tau	UQE	SE	p-value
0.10	$2,800	$650	< 0.001
0.50	$1,050	$440	0.017
0.90	-$400	$820	0.625

H.3 Error Detective

Error Detective

Read the analysis below carefully and identify the errors.

A management researcher studies whether CEO succession (external vs. internal hire) affects firm performance differently across the performance distribution. She estimates quantile regression of ROA on an `external_CEO` indicator and controls (firm size, industry dummies, leverage) using panel data with 3,200 firm-year observations and 640 unique firms. She includes firm fixed effects as dummy variables in the quantile regression. She reports:

qreg roa external_ceo firm_size leverage i.industry i.firm_id, quantile(0.10) qreg roa external_ceo firm_size leverage i.industry i.firm_id, quantile(0.50) qreg roa external_ceo firm_size leverage i.industry i.firm_id, quantile(0.90)

Results: "External CEOs reduce ROA at the 10th percentile by 2.1 pp (p = 0.03), have no effect at the median (0.3 pp, p = 0.72), and increase ROA at the 90th percentile by 1.8 pp (p = 0.04). This pattern shows that external CEOs increase performance inequality within firms."

She uses asymptotic standard errors and does not report quantile crossing diagnostics.

Select all errors you can find:

Firm fixed effects in quantile regression: incidental parameters problem(Model specification / panel structure)

Asymptotic standard errors used instead of bootstrap(Standard errors / inference)

No quantile crossing diagnostics reported(Diagnostics)

Conditional QTE interpreted as effect on unconditional within-firm performance distribution(Interpretation)

Error Detective

Read the analysis below carefully and identify the errors.

A health economist evaluates a randomized preventive care program using an RCT with 800 participants (400 treated, 400 control). The outcome is annual healthcare expenditure. She reports quantile treatment effects at nine quantiles from 0.05 to 0.95. Her results:

tau	QTE	SE	p-value
0.05	-$12,400	$8,200	0.13
0.10	-$2,100	$680	0.002
0.25	-$450	$210	0.03
0.50	-$180	$150	0.23
0.75	$50	$320	0.88
0.90	$1,200	$890	0.18
0.95	$8,900	$6,100	0.14

She concludes: "The program dramatically reduces healthcare costs for the sickest patients (tau = 0.05: $12,400 reduction) but increases costs for the healthiest (tau = 0.95: $8,900 increase). The program should be targeted to high-cost patients."

She does not report OLS, does not test for equality across quantiles, and does not discuss rank invariance.

Select all errors you can find:

Extreme quantile estimates (tau = 0.05, 0.95) are unreliable with n = 800(Extreme quantile reliability)

Missing OLS baseline estimate(Reporting completeness)

Rank invariance assumed in the policy recommendation(Interpretation / rank invariance)

H.4 You Are the Referee

Referee Exercise

Read the paper summary below and write a brief referee critique (2-3 sentences) of the identification strategy.

Paper Summary

The authors study whether a minimum wage increase affected the earnings distribution differently across quantiles. Using state-level panel data from 2010-2020 (25 states, quarterly observations), they estimate conditional quantile regressions of log earnings on a minimum wage indicator, controlling for state unemployment rate, industry composition, and state and quarter fixed effects. They report that the minimum wage increase raised earnings significantly at the 10th percentile ($0.45, p = 0.002) and 25th percentile ($0.28, p = 0.011), had no effect at the median ($0.05, p = 0.62), and reduced earnings at the 90th percentile (-$0.18, p = 0.04). They conclude that the policy compressed the unconditional earnings distribution and recommend the policy as an inequality-reducing tool.

Key Table

Variable	Coefficient	SE	p-value
Min wage x tau=0.10	0.450	0.145	0.002
Min wage x tau=0.25	0.280	0.110	0.011
Min wage x tau=0.50	0.050	0.098	0.620
Min wage x tau=0.75	-0.080	0.125	0.522
Min wage x tau=0.90	-0.180	0.088	0.041
State FE	Yes
Quarter FE	Yes
N (state-quarters)	1,100

Authors' Identification Claim

The authors claim that controlling for state and quarter fixed effects and observable confounders isolates the causal effect of the minimum wage on the earnings distribution at each quantile.

ISwap-In: When to Use Something Else

OLS Regression: Estimates the conditional mean effect. Conditional QTE at $\tau = 0.50$ estimates the effect on the conditional median (for unconditional effects on the population median, use RIF regression). If the outcome distribution is symmetric, the conditional mean and conditional median coincide.
Causal Forests: Estimates heterogeneous treatment effects across the covariate space. Complementary to QTE: causal forests find subgroups with different effects; QTE finds distributional shifts.
Matching: Can be combined with QTE by matching on propensity scores and then estimating quantile effects within the matched sample.
Experimental Design: Randomization is the cleanest setting for QTE. With experimental data, the QTE at each quantile has a causal interpretation without functional form assumptions (Bitler et al., 2006).

JReviewer Checklist

Paper Library

Has replication code

Foundational (5)

Chernozhukov, V., & Hansen, C. (2005). An IV Model of Quantile Treatment Effects.

EconometricaDOI: 10.1111/j.1468-0262.2005.00570.x

Chernozhukov and Hansen develop an instrumental variable framework for quantile regression to address endogeneity. Proposes the inverse quantile regression (IQR) method that exploits moment conditions implied by the structural quantile model. Provides conditions under which quantile treatment effects are identified with endogenous treatments, extending quantile regression to credible causal inference settings.

Firpo, S., Fortin, N. M., & Lemieux, T. (2009). Unconditional Quantile Regressions.

EconometricaDOI: 10.3982/ECTA6822

Firpo, Fortin, and Lemieux introduce the recentered influence function (RIF) regression for estimating unconditional quantile effects. They show that standard quantile regression estimates conditional quantile effects that do not aggregate to unconditional effects. RIF regression transforms the outcome variable so that OLS on the transformed outcome recovers the effect of covariates on unconditional quantiles. The key innovation enabling policy-relevant distributional analysis.

Koenker, R., & Bassett, G., Jr. (1978). Regression Quantiles.

EconometricaDOI: 10.2307/1913643

Koenker and Bassett introduce quantile regression, proposing to estimate conditional quantile functions by minimizing an asymmetric absolute loss (check function), generalizing least absolute deviations to arbitrary quantiles. Establishes asymptotic theory and demonstrates robustness to outliers and heteroscedasticity relative to OLS.

Koenker, R., & Machado, J. A. F. (1999). Goodness of Fit and Related Inference Processes for Quantile Regression.

Journal of the American Statistical AssociationDOI: 10.1080/01621459.1999.10473882

Koenker and Machado develop goodness-of-fit measures for quantile regression, including the pseudo-R-squared based on the ratio of minimized check functions that has become the standard fit statistic for quantile regression models.

Machado, J. A. F., & Santos Silva, J. M. C. (2019). Quantiles via Moments.

Journal of EconometricsDOI: 10.1016/j.jeconom.2019.04.009

Machado and Santos Silva show that, under a conditional location-scale structure, regression quantiles can be estimated by estimating conditional means. This 'quantiles via moments' approach makes it possible to use tools developed for mean regression in distributional-effects settings, and it can be adapted to panel data with fixed effects by avoiding the incidental parameters problem.

Application (2)

Angrist, J. D., Chernozhukov, V., & Fernandez-Val, I. (2006). Quantile Regression under Misspecification, with an Application to the U.S. Wage Structure.

EconometricaDOI: 10.1111/j.1468-0262.2006.00671.x

Angrist, Chernozhukov, and Fernandez-Val study quantile regression under misspecification, showing that QR coefficients minimize a weighted mean-squared specification-error loss and deriving an omitted-variable-bias formula for quantile regression. Applying this framework to U.S. Census wage data, they document continued residual inequality growth in the 1990s, primarily in the upper half of the distribution.

Bitler, M. P., Gelbach, J. B., & Hoynes, H. W. (2006). What Mean Impacts Miss: Distributional Effects of Welfare Reform Experiments.

American Economic ReviewDOI: 10.1257/aer.96.4.988

Bitler, Gelbach, and Hoynes apply quantile treatment effects to experimental data from the Connecticut Jobs First welfare reform program. They show that the average treatment effect masks dramatic heterogeneity: the program had no impact at the bottom of the earnings distribution, increased earnings in the middle, and decreased earnings at the top. The paper demonstrates why distributional analysis is essential for evaluating social programs whose effects vary across the outcome distribution.

Survey (1)

Angrist, J. D., & Pischke, J.-S. (2009). Mostly Harmless Econometrics: An Empiricist's Companion.

Princeton University PressDOI: 10.1515/9781400829828

Angrist and Pischke write one of the most influential modern textbooks on applied econometrics, organizing the field around a design-based approach to causal inference. The book provides essential treatments of instrumental variables, difference-in-differences, and regression discontinuity, each grounded in the potential outcomes framework. It remains the standard reference for graduate students learning to evaluate and implement identification strategies.

One-Line Implementation

Download Full Analysis Code

Motivating Example: The Jobs First Welfare Experiment#

AOverview#

What Quantile Treatment Effects Do#

Conditional vs. Unconditional QTE#

When to Use QTE#

When NOT to Use QTE#

Common Confusions#

Connection to Other Methods#

BIdentification#

Assumption 1: Unconfoundedness (for observational data)#

Assumption 2: Correct Specification of the Conditional Quantile Function#

Assumption 3: Rank Invariance (for individual-level interpretation)#

CVisual Intuition#

DMathematical Derivation#

The Quantile Regression Estimator#

RIF Regression for Unconditional Quantile Effects#

EImplementation#

Step-by-Step Workflow#

Standard Errors and Inference#

FDiagnostics#

F.1 Quantile Crossing#

F.2 Heterogeneity Test#

F.3 Goodness of Fit#

F.4 Sensitivity to Bandwidth (RIF Regression)#

Reading the Quantile Process#

Reporting Conventions#

GWhat Can Go Wrong#

Conditional vs. Unconditional Quantile Confusion

Crossing Quantile Lines Signal Misspecification

Extreme Quantile Instability

Rank Invariance Assumed Without Justification

HPractice#

H.1 Concept Checks#

H.2 Guided Exercise#

H.3 Error Detective#

H.4 You Are the Referee#

Paper Summary

Key Table

Authors' Identification Claim

ISwap-In: When to Use Something Else#

JReviewer Checklist#

Critical Reading Checklist

Paper Library

Foundational (5)

Application (2)

Survey (1)

Tags

Motivating Example: The Jobs First Welfare Experiment

AOverview

What Quantile Treatment Effects Do

Conditional vs. Unconditional QTE

When to Use QTE

When NOT to Use QTE

Common Confusions

Connection to Other Methods

BIdentification

Assumption 1: Unconfoundedness (for observational data)

Assumption 2: Correct Specification of the Conditional Quantile Function

Assumption 3: Rank Invariance (for individual-level interpretation)

CVisual Intuition

DMathematical Derivation

The Quantile Regression Estimator

RIF Regression for Unconditional Quantile Effects

EImplementation

Step-by-Step Workflow

Standard Errors and Inference

FDiagnostics

F.1 Quantile Crossing

F.2 Heterogeneity Test

F.3 Goodness of Fit

F.4 Sensitivity to Bandwidth (RIF Regression)

Reading the Quantile Process

Reporting Conventions

GWhat Can Go Wrong

HPractice

H.1 Concept Checks

H.2 Guided Exercise

H.3 Error Detective

H.4 You Are the Referee

ISwap-In: When to Use Something Else

JReviewer Checklist