Lab·tutorial·11 min read

tutorial120 minutes

Lab: Event Studies from Scratch

Implement an event study: create event-time indicators, estimate dynamic treatment effects, visualize pre-trends and dynamics, and read the plot correctly.

Method: Event Studies (Dynamic Treatment Effects)
Languages: Python, R, Stata
Dataset: Corporate governance reform and firm performance (simulated)

Overview

Event studies extend the DiD framework by estimating separate treatment effects for each time period relative to treatment. Instead of a single "before vs. after" estimate, you trace out how the treatment effect evolves over time. The pre-treatment coefficients serve as a diagnostic for the parallel trends assumption.

What you will learn:

How to construct event-time (relative time) indicators
How to estimate a dynamic treatment effects regression
How to create and interpret the classic event study plot
How to test for pre-trends
Why you typically need to omit one reference period (and which one to choose)
What can go wrong with event study specifications

Prerequisites: DiD (see the DiD lab), Fixed Effects.

Step 1: The Setting

Imagine a corporate governance reform that is adopted by different firms at different times. We want to estimate (a) the dynamic path of the treatment effect and (b) whether there were pre-existing trends that might invalidate the design.

Step 2: Simulate Panel Data with Staggered Treatment

1# First-time setup: install.packages(c("fixest", "ggplot2"))
2library(fixest)
3library(ggplot2)
4
5set.seed(2024)
6n_firms <- 200
7n_periods <- 20
8
9# Treatment timing
10treatment_year <- rep(Inf, n_firms)
11treated_firms <- sample(n_firms, 100)
12for (i in treated_firms) {
13treatment_year[i] <- sample(2005:2014, 1)
14}
15
16# Build panel
17df <- data.frame()
18for (i in 1:n_firms) {
19firm_fe <- rnorm(1, 0, 2)
20for (t in 0:(n_periods - 1)) {
21  year <- 2000 + t
22  year_fe <- 0.3 * t
23
24  if (treatment_year[i] < Inf) {
25    event_time <- year - treatment_year[i]
26    treated <- 1
27    post <- as.integer(year >= treatment_year[i])
28    te <- ifelse(post, 1.0 + 0.5 * pmin(event_time, 5), 0)
29  } else {
30    event_time <- NA
31    treated <- 0
32    post <- 0
33    te <- 0
34  }
35
36  y <- 10 + firm_fe + year_fe + te + rnorm(1, 0, 1.5)
37  df <- rbind(df, data.frame(
38    firm_id = i, year = year, treated = treated, post = post,
39    event_time = event_time, performance = y,
40    treatment_year = ifelse(treatment_year[i] < Inf,
41                             treatment_year[i], NA)
42  ))
43}
44}
45
46cat("Panel:", nrow(df), "observations\n")
47cat("Treated firms:", sum(treatment_year < Inf), "\n")

Requiresfixest ggplot2

Expected output:

firm_id	year	treated	post	event_time	performance
0	2000	1	0	-9.0	8.12
0	2001	1	0	-8.0	9.45
0	2009	1	1	0.0	12.87
0	2010	1	1	1.0	13.54
150	2005	0	0	—	11.03

Summary statistics:

Statistic	Value
Panel dimensions	200 firms x 20 years = 4,000 obs
Treated firms	100
Never-treated firms	100
Treatment years	2005–2014 (staggered)
Mean performance (pre-treatment)	~10 + firm FE + time trend

Step 3: Create Event-Time Indicators

The event study regression requires dummy variables for each period relative to treatment. We normalize to one period before treatment ( $k = -1$ ) as the reference category.

1# Create event-time indicators
2# fixest makes this easy with the i() function
3# Clip event time to [-5, 5]
4df$event_time_clipped <- pmin(pmax(df$event_time, -5), 5)
5
6# For the manual approach, create dummies
7event_times <- c(-5, -4, -3, -2, 0, 1, 2, 3, 4, 5)
8
9for (k in event_times) {
10col_name <- paste0("et_", ifelse(k < 0, "m", ""), abs(k))
11df[[col_name]] <- as.integer(df$event_time_clipped == k & df$treated == 1)
12}
13
14cat("Event-time dummies created. Reference period: k = -1\n")

Requiresfixest

Expected output:

Event-time indicator	Observations with indicator = 1
et_-5 (binned, k <= -5)	~500 (treated obs 5+ years pre)
et_-4	~100
et_-3	~100
et_-2	~100
et_-1 (reference)	omitted
et_0	~100
et_1	~100
et_2	~100
et_3	~100
et_4	~100
et_5 (binned, k >= 5)	~500 (treated obs 5+ years post)

Step 4: Estimate the Event Study Regression

1# Event study using fixest (the easiest approach)
2# i(event_time_clipped, ref = -1) creates all dummies with -1 as reference
3m_es <- feols(performance ~ i(event_time_clipped, treated, ref = -1) |
4            firm_id + year,
5            data = df[!is.na(df$event_time) | df$treated == 0, ],
6            vcov = ~firm_id)
7
8summary(m_es)
9
10# The coefficients are the event study estimates

Requiresfixest

Expected output:

Event Time (k)	Coefficient	SE	95% CI Lower	95% CI Upper
-5	-0.02	0.18	-0.37	0.33
-4	0.05	0.20	-0.34	0.44
-3	-0.08	0.19	-0.45	0.29
-2	0.03	0.18	-0.32	0.38
-1	0.000	—	—	—
0	1.02	0.19	0.65	1.39
1	1.48	0.20	1.09	1.87
2	2.05	0.21	1.64	2.46
3	2.52	0.22	2.09	2.95
4	3.01	0.23	2.56	3.46
5	3.48	0.19	3.11	3.85

The pre-treatment coefficients (k = -5 to -2) are all close to zero and statistically insignificant, consistent with parallel trends. The post-treatment coefficients grow over time, matching the true DGP of 1.0 + 0.5 * min(k, 5).

Step 5: Create the Event Study Plot

The coefficient plot is the signature visualization of the event study design.

1# Event study plot using fixest
2iplot(m_es,
3    main = "Event Study: Governance Reform and Firm Performance",
4    xlab = "Event Time (Years Relative to Treatment)",
5    ylab = "Coefficient (Relative to k = -1)")
6
7# Add reference line at treatment onset
8abline(v = -0.5, lty = 2, col = "red")

Requiresfixest

Expected visualization: Event Study Plot

The event study coefficient plot should show:

X-axis: Event time (years relative to treatment), from -5 to +5
Y-axis: Coefficient (relative to k = -1, the omitted reference period)
Pre-treatment region (k = -5 to -2): Points scattered tightly around zero, with 95% confidence intervals crossing the zero line. This flat pattern is consistent with the parallel trends assumption.
Reference period (k = -1): Normalized to zero by construction (the omitted category).
Treatment: A dashed red vertical line at x = -0.5 marking the transition from pre to post.
Post-treatment region (k = 0 to +5): A clear upward staircase pattern. The coefficient jumps to approximately 1.0 at k = 0, then rises steadily to approximately 3.5 at k = 5. Confidence intervals are narrow and do not include zero, indicating statistically significant effects.
Overall shape: A "hockey stick" — flat on the left side, rising on the right side.

Concept Check

In your event study plot, the pre-treatment coefficients (k = -5 to k = -2) are all close to zero and statistically insignificant. What can you conclude?

Parallel trends definitely holds, so the DiD estimate is causal.The results are consistent with the parallel trends assumption, but this consistency is a necessary condition, not sufficient evidence. The pre-trend test may also lack power.There is no treatment effect because all coefficients are near zero.Increase the number of pre-treatment periods to improve the test.

Step 6: Interpreting the Post-Treatment Dynamics

The post-treatment coefficients tell a story about how the treatment effect evolves over time.

1# Extract and interpret post-treatment coefficients
2coefs <- coeftable(m_es)
3cat("Post-Treatment Dynamics:\n")
4cat("The effect grows over time as the governance reform takes hold.\n\n")
5
6# True effects for comparison
7for (k in 0:5) {
8true_te <- 1.0 + 0.5 * min(k, 5)
9cat(sprintf("  k = %+d: True = %+.3f\n", k, true_te))
10}

Expected output:

Event Time (k)	Estimated Effect	True Effect (DGP)	Interpretation
k = +0	~1.0	1.0	Immediate effect at adoption
k = +1	~1.5	1.5	Effect after 1 year
k = +2	~2.0	2.0	Effect after 2 years
k = +3	~2.5	2.5	Effect after 3 years
k = +4	~3.0	3.0	Effect after 4 years
k = +5	~3.5	3.5	Effect after 5 years (plateau)

The gradually building pattern reflects a reform that takes time to produce its full impact. The DGP generates this pattern with the formula: treatment effect = 1.0 + 0.5 * min(k, 5).

Step 7: What Can Go Wrong

Pre-trend Violation

1# Simulate pre-trend violation (similar to Python code)
2# When you see an upward slope in pre-treatment coefficients,
3# parallel trends is violated.
4# The post-treatment coefficients mix the true effect
5# with the pre-existing differential trend.
6
7cat("If pre-treatment coefficients slope upward, this indicates\n")
8cat("a violation of parallel trends. The post-treatment estimates\n")
9cat("are contaminated by the pre-existing trend.\n")

Expected visualization: Event Study with Pre-Trend Violation

The event study plot for the biased DGP should show:

Pre-treatment region (k = -5 to -2): A clear upward slope, with coefficients rising from approximately -0.5 at k = -5 to near 0 at k = -1. This ascending pattern is a red flag signaling differential pre-trends between treated and control groups.
Post-treatment region (k = 0 to +5): Coefficients continue rising, but it is generally infeasible to disentangle the true treatment effect (1.0 in the DGP) from the continuation of the pre-existing differential trend (0.2 per year for treated firms). The post-treatment estimates are contaminated.
Key contrast with the clean design: Unlike the first event study plot, the pre-treatment coefficients are not flat. This pattern suggests the parallel trends assumption is violated.

Concept Check

Your event study shows a clear upward slope in pre-treatment coefficients (k = -5 to k = -2). Your advisor suggests 'just detrending' by subtracting the pre-treatment trend from the post-treatment coefficients. Is this a good solution?

Yes — detrending removes the bias and recovers the true treatment effect.No — detrending assumes the pre-existing trend would have continued at the same rate, which is a strong and untestable assumption. The pre-trend violation undermines the entire design.It depends on whether the p-values on the pre-trend coefficients are below 0.05.Yes, if you use a non-parametric trend rather than a linear one.

Step 8: Advanced Considerations

Binning Endpoint Periods

In most settings, bin distant event times to avoid sparse cells. For example, group all $k \leq -5$ into a single " $k \leq -5$ " dummy and all $k \geq 5$ into " $k \geq 5$ ".

Joint Test for Pre-Trends

Instead of eyeballing individual pre-trend coefficients, formally test whether they are jointly zero.

# Joint test for pre-trends using fixest
# wald() function tests if specified coefficients are jointly zero
wald(m_es, "event_time_clipped::-[2-5]")

cat("p > 0.05: Cannot reject that pre-trends are jointly zero\n")

Requiresfixest

Expected output:

Statistic	Value
Joint F-statistic	~0.5–2.0
Degrees of freedom	4 (testing et_-5, et_-4, et_-3, et_-2)
p-value	> 0.05 (typically 0.3–0.8)
Conclusion	Cannot reject that pre-trends are jointly zero

The joint F-test confirms what the event study plot shows visually: the pre-treatment coefficients are not distinguishable from zero, consistent with the parallel trends assumption.

Aggregating Post-Treatment Effects

Sometimes you want a single summary treatment effect rather than period-by-period estimates.

# Simple summary DiD estimate
df$treat_post <- df$treated * df$post
m_did <- feols(performance ~ treat_post | firm_id + year,
             data = df, vcov = ~firm_id)
cat("Summary DiD estimate:", coef(m_did)["treat_post"], "\n")

Requiresdid

Expected output:

Estimator	Estimate	Interpretation
Average post-treatment coefficient	~2.25	Mean of event-time coefficients at k = 0 through k = 5
Static DiD (TWFE)	~2.0–2.5	Single summary treatment effect averaging across all post-treatment periods

The average of the true DGP treatment effects across k = 0 to 5 is (1.0 + 1.5 + 2.0 + 2.5 + 3.0 + 3.5) / 6 = 2.25.

Step 9: Exercises

Change the reference period. Re-estimate with $k = -2$ as the reference period. How do the coefficients and plot change? Which reference period makes more sense for your setting?
Test for anticipation effects. What if firms change behavior before the formal treatment date? Allow the treatment effect to begin one period before the official date and re-estimate.
Staggered DiD concerns. With staggered treatment timing and heterogeneous treatment effects, the standard TWFE event study can be biased (Sun & Abraham, 2021). Read about this issue and think about when it matters.
Placebo test. For the never-treated firms, assign a random "fake" treatment year and estimate the event study. All coefficients should be near zero.

Expected output

If your code runs correctly, expect to see:

Pre-treatment coefficients (k = -5 to -2): Close to zero (within one SE of zero), consistent with parallel trends
Reference period (k = -1): Normalized to zero (omitted category)
Post-treatment coefficients: Growing over time — approximately 1.0 at k=0, 1.5 at k=1, 2.0 at k=2, up to around 3.5 at k=5 (true DGP: 1.0 + 0.5 * min(k, 5))
Joint F-test for pre-trends: Fails to reject the null that all pre-treatment coefficients equal zero (p > 0.05)
TWFE (static DiD) estimate: A single positive coefficient summarizing the average post-treatment effect, around 1.5–3.0
Event study plot: Classic pattern of flat pre-trends and rising post-treatment effects
Panel dimensions: 200 firms x 20 years = 4,000 observations, with 100 treated firms

Summary

In this lab you learned:

Event studies estimate separate treatment effects for each period relative to treatment
You typically need to omit one reference period (typically $k = -1$ ) to avoid multicollinearity
Flat pre-treatment coefficients are consistent with parallel trends but do not prove it
The post-treatment pattern reveals dynamic treatment effects (growing, stable, or fading)
In most settings, include firm and year fixed effects and cluster standard errors appropriately
Bin endpoint event times to avoid sparse cells
Use a joint F-test for pre-trends rather than eyeballing individual coefficients
Pre-trend violations are a fundamental threat — "detrending" is not a reliable fix
With staggered treatment, be aware of potential biases in standard TWFE event studies

Overview#

Step 1: The Setting#

Step 2: Simulate Panel Data with Staggered Treatment#

Step 3: Create Event-Time Indicators#

Step 4: Estimate the Event Study Regression#

Step 5: Create the Event Study Plot#

Step 6: Interpreting the Post-Treatment Dynamics#

Step 7: What Can Go Wrong#

Pre-trend Violation#

Step 8: Advanced Considerations#

Binning Endpoint Periods#

Joint Test for Pre-Trends#

Aggregating Post-Treatment Effects#

Step 9: Exercises#

Summary#

Overview

Step 1: The Setting

Step 2: Simulate Panel Data with Staggered Treatment

Step 3: Create Event-Time Indicators

Step 4: Estimate the Event Study Regression

Step 5: Create the Event Study Plot

Step 6: Interpreting the Post-Treatment Dynamics

Step 7: What Can Go Wrong

Pre-trend Violation

Step 8: Advanced Considerations

Binning Endpoint Periods

Joint Test for Pre-Trends

Aggregating Post-Treatment Effects

Step 9: Exercises

Summary