Practice·Design Stage·11 min read

Design Stage

Pre-Analysis Plans & Pre-Registration

Commit to your analysis before seeing the results — the antidote to the garden of forking paths.

Applies To: Experimental Design
Reading Time: ~11 min read · 12 sections · 2 interactive exercises · 11 papers

When to Use Pre-Registration

Pre-register any time you have researcher degrees of freedom — choices about outcomes, specifications, sample restrictions, or subgroups — that could influence results. Pre-registration applies to: randomized controlled trials (register before data collection), quasi-experimental studies where the data source and policy change are known (register before analysis), studies with multiple outcomes or subgroup analyses, and any project where you want to credibly distinguish confirmatory from exploratory findings. Registration is most valuable before data collection, but registering before analysis still provides meaningful constraint.

Why It Matters

Pre-registration separates confirmatory from exploratory analysis. Without it, readers cannot distinguish genuine findings from results that emerged through specification searching. A time-stamped analysis plan demonstrates that your results were not reverse-engineered from the data, which is why top journals in economics and political science now routinely require or reward pre-registration for experimental studies.

Binding Your Own Hands

Here is a confession that no one in academia enjoys making: given a dataset, a smart researcher can almost always find something that looks significant. Not through fraud. Not through conscious manipulation. Just through the ordinary, human process of exploring data, trying different specifications, and — without realizing it — gravitating toward the results that tell the most interesting story.

This temptation is not a moral failing. It is a structural problem with how hypothesis testing works. The standard frequentist interpretation of a p-value (the probability of seeing data this extreme under the null) presumes the analysis was specified before looking at the data. To the extent the data influence analytic choices — which variables to include, how to define the sample, which outcomes to emphasize — the nominal p-value diverges from its claimed sampling-distribution interpretation, often substantially. (Adaptive-inference methods can in principle correct for some data-dependent choices, but they require pre-specifying the space of choices, which is itself a kind of pre-registration.)

Pre-analysis plans are the solution. They are documents, written and time-stamped before you analyze your data, that specify exactly what you plan to do. They bind your hands against inadvertent data mining and give your results a credibility that post-hoc analyses cannot match.

If this idea sounds constraining, that reaction is natural. That constraint is the point.

Interactive: The Garden of Forking Paths

Andrew Gelman and Eric Loken coined a vivid metaphor for this problem: the garden of forking paths (Gelman & Loken, 2014).

At every stage of analysis, you face choices:

How do you define the treatment? (Binary? Continuous? Dosage?)
Which control variables do you include?
How do you handle outliers? (Winsorize? Trim? Log-transform?)
Which sample restrictions do you impose? (Age cutoffs? Time windows?)
Which outcomes do you emphasize?
Which subgroups do you examine?
How do you cluster standard errors?

Each choice is a fork. Each fork leads to a different result. If you make these choices after seeing the data — even subconsciously — you are walking a path through the garden that the data itself has selected. The resulting p-value no longer reflects the probability you think it does.

The forking paths problem is distinct from . P-hacking implies intent. The forking paths problem arises even with the most careful intentions, because researchers naturally gravitate toward specifications that "make sense" — and "makes sense" is often a synonym for "gives interesting results."

What Goes in a Pre-Analysis Plan

A good pre-analysis plan (PAP) has the following components. Not every study needs all of them, but the more you can specify in advance, the stronger your credibility.

1. Research Question and Hypotheses

State your hypotheses clearly, in directional terms when possible. "We hypothesize that the program will increase test scores" is better than "we will examine the effect of the program on test scores."

2. Data Description

What data will you use? (Survey, administrative, experimental)
What is the sample? (Who is included, who is excluded, and why?)
What is the unit of observation?
When was/will the data be collected?

3. Variable Definitions

Treatment variable: How is treatment defined? What constitutes treatment and control?
Primary outcome(s): The main outcome variable(s) you will analyze. Be precise about construction (e.g., "math test score, standardized to have mean zero and standard deviation one in the control group").
Secondary outcomes: Outcomes you will examine but that are not the main focus.
Control variables: Which covariates will you include and why?

4. Estimation Strategy

What is the estimating equation? Write it out:

Y_i = \alpha + \beta D_i + X_i'\gamma + \varepsilon_i

What standard errors will you use? (Robust? Clustered? At what level?)
What is the unit of randomization vs. unit of analysis?

5. Subgroup Analyses

Which subgroups will you examine? (By gender, age, baseline characteristics?)
Are these exploratory or confirmatory?

6. Multiple Testing

How will you handle multiple outcomes? (Bonferroni? FDR? Romano-Wolf? Index construction?)
Which outcomes are grouped into families?

7. Sample Size and Power Calculations

What is your expected sample size?
What is the minimum detectable effect?
What assumptions underlie the power calculation?

8. Missing Data and Attrition

How will you handle missing data? (Complete cases? Imputation?)
What is your plan if attrition is differential?
Will you compute Lee bounds?

9. Deviations Protocol

Under what circumstances would you deviate from the plan?
How will you flag deviations in the paper?

Where to Register

Three major registries dominate the social sciences:

AEA RCT Registry

URL: AEA RCT Registry
Best for: Randomized controlled trials in economics
Features: Time-stamped, publicly searchable, selected fields can be embargoed until trial completion
Cost: Free
Accepted by: Widely recognized; required by AEA journals for field experiments, accepted by journals that allow recognized registries

EGAP Registry

URL: EGAP Registry
Best for: Searching existing pre-registrations from political science and policy experiments (archive)
Features: Strong community of practice, design-based studies
Cost: Free
Status: Closed to new registrations as of October 15, 2023. Existing entries remain searchable; EGAP now directs researchers to OSF or the AEA RCT Registry for new pre-registrations.

OSF (Open Science Framework)

URL: Open Science Framework (OSF)
Best for: Any study in any field, including observational studies
Features: Most flexible, integrates with GitHub, supports pre-prints and data hosting
Cost: Free

How to Report Pre-Registration

Once you have pre-registered your study, you need to reference it properly in your paper and handle the inevitable deviations transparently.

Referencing the Pre-Registration in Your Paper

Papers based on a pre-registered study typically include the registration details. In the methods section or a footnote, provide:

The registry name and registration number (e.g., "AEA RCT Registry #AEARCTR-0005432")
The date of registration
A URL or DOI linking to the pre-analysis plan
Whether the PAP was registered before data collection, before data analysis, or after data collection but before analysis

Template language for the methods section:

This study was pre-registered on [Registry Name] on [Date] (Registration #[Number]; [URL]). The pre-analysis plan specified the primary outcomes, estimation strategy, and subgroup analyses reported below. All deviations from the pre-analysis plan are explicitly noted.

Handling Deviations from the Plan

You will almost certainly deviate from your pre-analysis plan. This deviation is normal and expected. What matters is transparency. For each deviation:

Report the pre-registered analysis first. In most settings, show what you originally planned, even if you now believe a different specification is better.
Present the revised analysis alongside it. Show both versions so readers can assess the sensitivity.
Explain why you deviated. Common reasons include discovering data quality issues, a variable being unavailable, or a pre-specified model failing to converge. State the reason clearly.
Flag the deviation explicitly. Use a footnote, a dedicated paragraph, or a table note to mark each change from the plan.

Reporting Pre-Registered and Exploratory Analyses Together

A well-structured paper contains both types of analyses, clearly distinguished:

Confirmatory analyses are those specified in the PAP. Present these as your primary results. They carry the strongest evidentiary weight because they were chosen before seeing the data.
Exploratory analyses are everything else — additional subgroups, alternative specifications, new outcomes you discovered along the way. These exploratory results are valuable and merit reporting, but it is important to clearly label them as exploratory.

A clean structure looks like the following:

Main results section: Report the pre-registered primary specification and primary outcomes.
Additional results or extensions: Report exploratory analyses, clearly labeled. Use language like "In exploratory analyses not included in our pre-analysis plan, we find that..."
Robustness section: Show that results hold under alternative specifications, including both pre-registered robustness checks and additional ones.

Template language for exploratory findings:

The following analyses were not included in our pre-analysis plan and should be interpreted as exploratory. We report them for transparency and to motivate future pre-registered investigations.

How to Do It: Code

While pre-registration is primarily a planning exercise, several tools help structure and document your pre-analysis plan:

1# --- Step 1: Load DeclareDesign ---
2# DeclareDesign lets you formally declare and diagnose your research
3# design BEFORE collecting data — ideal for pre-registration
4library(DeclareDesign)
5
6# --- Step 2: Declare the full design ---
7# Specify the data model, estimand, assignment, measurement, and estimator
8design <- declare_model(
9N = 500, U = rnorm(N),                    # 500 units, normal noise
10potential_outcomes(Y ~ 0.3 * Z + U)       # true ATE = 0.3
11) +
12declare_inquiry(ATE = mean(Y_Z_1 - Y_Z_0)) +     # target estimand
13declare_assignment(Z = complete_ra(N, m = 250)) +  # 1:1 randomization
14declare_measurement(Y = reveal_outcomes(Y ~ Z)) +  # reveal observed Y
15declare_estimator(Y ~ Z, inquiry = "ATE")           # difference-in-means
16
17# --- Step 3: Diagnose power, bias, and coverage ---
18# Run 500 simulations to estimate power before the study begins
19# Include this output in your PAP's power analysis section
20diagnose_design(design, sims = 500)

RequiresDeclareDesign

A Template Walkthrough

Here is a concrete example of what a pre-analysis plan looks like for a hypothetical study evaluating a job training program:

Title: The Effect of WorkReady Job Training on Employment and Earnings

Hypotheses: H1: WorkReady increases employment rates at 12 months (primary). H2: WorkReady increases quarterly earnings at 12 months (primary). H3: WorkReady increases employment rates at 24 months (secondary).

Design: Randomized controlled trial. 2,000 applicants randomized 1:1 to treatment/control.

Primary specification: $Y_i = \alpha + \beta \cdot \text{Assigned}_i + X_i'\gamma + \varepsilon_i$ where $X_i$ includes age, gender, education, baseline earnings (all pre-specified). Standard errors: robust (HC1). Estimand: Intent-to-treat (ITT).

Multiple testing: Primary outcomes (H1, H2) will be adjusted using Holm step-down. Secondary outcome (H3) reported with unadjusted p-values but flagged as secondary.

Subgroups: We will examine heterogeneity by gender and by baseline education (above/below median). These subgroup analyses are labeled as exploratory.

Attrition: If differential attrition exceeds 5 percentage points, we will compute Lee bounds.

Power: With N = 2,000, we can detect a 4-percentage-point increase in employment (from a control mean of 55%) with 80% power at the 5% level.

The Promises and Perils

Olken (2015) offers a thoughtful assessment of both the benefits and costs of pre-analysis plans.

Promises:

Eliminates (or greatly reduces) data mining and specification searching
Distinguishes confirmatory from exploratory analyses
Increases the credibility of significant results
Provides a clear record of what was planned vs. what was discovered

Perils:

Can be excessively rigid, preventing researchers from learning from the data
May discourage creative exploration that leads to genuine discoveries
A badly written PAP can lock you into a bad specification
Vague PAPs may not actually constrain anything

The resolution is straightforward: pre-register your confirmatory analyses, clearly label any deviations, and conduct exploratory analyses freely — just call them exploratory. A paper can contain both pre-registered and exploratory results — this combination is the norm, not the exception. The key is transparency about which is which.

Common Mistakes

Pitfalls to avoid

Writing the PAP after seeing the data. Post-hoc registration defeats the entire purpose. Register before you analyze. Ideally, register before you collect the data.
Writing a PAP so vague it constrains nothing. "We will run regressions of outcomes on treatment" is not a pre-analysis plan. Specify the outcomes, the controls, the sample, and the standard errors.
Treating deviations as failures. You will almost certainly deviate from your PAP. That outcome is fine. The PAP is not a straitjacket — it is a benchmark. Report both the pre-specified analysis and your revised analysis, and explain why you deviated.
Not pre-registering because you are doing observational work. The logic of pre-registration applies to any study with researcher degrees of freedom. If you know your data source and your question, you can pre-register.
Forgetting to specify how you will handle multiple outcomes. One of the biggest benefits of a PAP is forcing you to think about multiple testing before you see the results.
Over-specifying. you typically do not need to pre-register every robustness check. Focus on the primary analysis. Leave room for exploration.

Concept Check

A researcher pre-registers a study with one primary outcome (test scores) and five secondary outcomes. The primary outcome shows p = 0.04 and two secondary outcomes show p = 0.03 each. What should the researcher report?

All three results as significant, since they all have p < 0.05.Only the primary outcome as significant, and discard the secondary results.The primary outcome as significant without adjustment, and the secondary outcomes with appropriate multiple testing corrections, clearly labeled.Apply Bonferroni correction to all six tests combined.

Paper Library

Has replication code

Foundational (5)

Gelman, A., & Loken, E. (2014). The Statistical Crisis in Science.

American ScientistDOI: 10.1511/2014.111.460

Gelman and Loken argue that data-dependent analysis creates a 'garden of forking paths' that explains why many statistically significant comparisons do not hold up. They emphasize that researchers' analytical choices conditional on data characteristics inflate false positive rates even without deliberate p-hacking.

Hollenbeck, J. R., & Wright, P. M. (2017). Harking, Sharking, and Tharking: Making the Case for Post Hoc Analysis of Scientific Data.

Journal of ManagementDOI: 10.1177/0149206316679487

Hollenbeck and Wright introduce the concept of 'Tharking' (Transparently Hypothesizing After Results Are Known), arguing that post hoc analysis of scientific data is valuable when conducted and reported transparently. They distinguish destructive HARKing from constructive post hoc exploration, making the case that management researchers should embrace exploratory analysis in discussion sections rather than disguising it as confirmatory.

Miguel, E., Camerer, C., Casey, K., Cohen, J., Esterling, K. M., Gerber, A., Glennerster, R., Green, D. P., Humphreys, M., Imbens, G., Laitin, D., Madon, T., Nelson, L., Nosek, B. A., Petersen, M., Sedlmayr, R., Simmons, J. P., Simonsohn, U., & Van der Laan, M. (2014). Promoting Transparency in Social Science Research.

ScienceDOI: 10.1126/science.1245317

Miguel and a coalition of leading social scientists call for greater transparency in research, including pre-registration of studies and analysis plans, open data, and replication. This short but influential piece in Science helps establish the norms and infrastructure for pre-registration in social science.

Nosek, B. A., Ebersole, C. R., DeHaven, A. C., & Mellor, D. T. (2018). The Preregistration Revolution.

Proceedings of the National Academy of SciencesDOI: 10.1073/pnas.1708274114

Nosek and colleagues make the case for widespread adoption of pre-registration, arguing that it distinguishes confirmatory from exploratory analyses, reduces publication bias, and increases the credibility of empirical research. This paper helps catalyze the pre-registration movement across the social sciences.

Simmons, J. P., Nelson, L. D., & Simonsohn, U. (2011). False-Positive Psychology: Undisclosed Flexibility in Data Collection and Analysis Allows Presenting Anything as Significant.

Psychological ScienceDOI: 10.1177/0956797611417632

Simmons, Nelson, and Simonsohn demonstrate how researcher degrees of freedom in data collection and analysis can inflate false-positive rates dramatically. Their paper, which proposes disclosure requirements and pre-registration as solutions, is one of the catalysts for the replication crisis and pre-registration movement.

Application (2)

Aguinis, H., Ramani, R. S., & Alabduljader, N. (2018). What You See Is What You Get? Enhancing Methodological Transparency in Management Research.

Academy of Management AnnalsDOI: 10.5465/annals.2016.0011

Aguinis, Ramani, and Alabduljader review methodological transparency in management research and advocate for pre-registration, open data, and open materials. They document the extent of undisclosed analytical flexibility in management studies and propose concrete steps for improvement.

Casey, K., Glennerster, R., & Miguel, E. (2012). Reshaping Institutions: Evidence on Aid Impacts Using a Preanalysis Plan.

Quarterly Journal of EconomicsDOI: 10.1093/qje/qje027

Casey, Glennerster, and Miguel pre-registered their analysis plan for a community-driven development program in Sierra Leone and apply multiple testing corrections (including the Westfall-Young step-down procedure and family-wise error rate adjustments) across outcome families. This paper is one of the most prominent examples of rigorous multiple testing adjustment in a field experiment, demonstrating that many individually significant effects lose significance after correction.

Survey (4)

Christensen, G., & Miguel, E. (2018). Transparency, Reproducibility, and the Credibility of Economics Research.

Journal of Economic LiteratureDOI: 10.1257/jel.20171350

Christensen and Miguel survey the transparency and reproducibility landscape in economics, documenting the growing adoption of pre-registration through the AEA RCT Registry and other platforms. They present evidence on the prevalence of specification searching and publication bias, and make the case that pre-registration combined with pre-analysis plans substantially improves the credibility of empirical findings.

Coffman, L. C., & Niederle, M. (2015). Pre-Analysis Plans Have Limited Upside, Especially Where Replications Are Feasible.

Journal of Economic PerspectivesDOI: 10.1257/jep.29.3.81

Coffman and Niederle offer a skeptical perspective on pre-analysis plans, arguing that their benefits are limited when replication is feasible and that rigid adherence to pre-specified analyses can prevent researchers from learning from the data. This paper provides important counterarguments in the pre-registration debate.

Haven, T. L., & Van Grootel, L. (2019). Preregistering Qualitative Research.

Accountability in ResearchDOI: 10.1080/08989621.2019.1580147

Haven and Van Grootel explore extending pre-registration to qualitative research, discussing what elements of qualitative studies can and should be pre-registered. This paper broadens the pre-registration conversation beyond quantitative experimental designs.

Olken, B. A. (2015). Promises and Perils of Pre-Analysis Plans.

Journal of Economic PerspectivesDOI: 10.1257/jep.29.3.61

Olken provides a balanced assessment of pre-analysis plans in development economics, discussing both benefits (reduced specification searching, increased credibility) and costs (loss of flexibility, difficulty specifying analyses in advance). This paper is essential reading for understanding the practical tradeoffs of pre-registration.

When to Use Pre-Registration#

Why It Matters#

Binding Your Own Hands#

Interactive: The Garden of Forking Paths#

What Goes in a Pre-Analysis Plan#

1. Research Question and Hypotheses#

2. Data Description#

3. Variable Definitions#

4. Estimation Strategy#

5. Subgroup Analyses#

6. Multiple Testing#

7. Sample Size and Power Calculations#

8. Missing Data and Attrition#

9. Deviations Protocol#

Where to Register#

AEA RCT Registry#

EGAP Registry#

OSF (Open Science Framework)#

How to Report Pre-Registration#

Referencing the Pre-Registration in Your Paper#

Handling Deviations from the Plan#

Reporting Pre-Registered and Exploratory Analyses Together#

How to Do It: Code#

A Template Walkthrough#

The Promises and Perils#

Common Mistakes#

Concept Check#