Four programs. One learning infrastructure.

ES supports municipalities, schools, agencies, and civic institutions in designing, running, measuring, and sharing policy experiments. Good first pilots are boring in the best sense: practical, measurable, low-risk, and locally useful.

Civic Experiment Sprints

Run 90-day experiments with public institutions. Every sprint produces a public report with the question, design, result, limitations, equity check, and replication notes.

  • Does SMS increase library attendance?
  • Which permit workflow reduces processing delays?
  • Which outreach increases school program participation?
  • Which park design increases usage of underused spaces?

Prioritize pilots with

  • A clear public-service question
  • A reversible intervention
  • A measurable outcome within 30–90 days
  • Low downside risk
  • An implementation owner
  • Available baseline data
  • No serious equity or rights concern
  • A result that another municipality could plausibly replicate

Avoid early pilots involving

  • Policing
  • Welfare eligibility
  • School discipline
  • Criminal justice
  • Medical treatment
  • Housing access
  • Immigration
  • Any setting where randomization could materially affect rights, safety, or access to essential services

Recommended first wedge

Libraries are likely the strongest first pilot category. Civic trust is high, interventions are usually low-risk, outcomes are measurable, participation is voluntary, local staff are mission-aligned, and findings can travel across municipalities.

Public Experiment Registry

An open repository of civic experiments. Every record includes hypothesis, design, outcomes, effect size, limitations, and replication status. Null, negative, inconclusive, and harmful results are included — that is one of the main public goods.

View the registry schema →

Municipal Fellows

Train local leaders in the skills required to run rigorous civic experiments. Build local experimentation capacity that persists after the fellowship ends.

Curriculum

Causal inference
Experimentation
Ethics
Measurement
Decision science

Evidence Commons

Shared infrastructure for civic experimentation: experiment templates, dashboards, analysis tools, and implementation guides. ES can run on spreadsheets and lightweight tools for early pilots. Advanced workflows may integrate with DoOperator and Decision Process infrastructure while communities retain full data ownership.

Three ready-to-run designs.

Good first pilots are boring in the best sense. These templates are practical, measurable, low-risk, and locally useful.

Template 1

Library Attendance Sprint

Which outreach message increases attendance at free library programs?

Design

  • Randomly assign eligible patrons to message variants
  • Measure registrations, attendance, opt-outs, and complaints
  • Publish message text, sample size, effect estimate, and limitations
  • Include replication guidance

Template 2

Permit Workflow Sprint

Does a clearer checklist reduce incomplete permit applications?

Design

  • Compare current instructions against a revised checklist for new applicants
  • Measure completion rate, staff follow-up time, and processing delay
  • Measure applicant satisfaction
  • Avoid withholding required information from anyone

Template 3

Parks Usage Sprint

Which low-cost signage or programming prompt increases use of an underused public space?

Design

  • Rotate signage or programming prompts by location or week
  • Measure foot traffic, QR scans, and event participation
  • Check whether effects differ by neighborhood or time of day
  • Publish maintenance and equity findings

Every experiment produces a public record.

The public should be able to understand the conclusion without trusting a black box. Every completed experiment produces a short public report in this format.

01

Question

What uncertainty did the institution want to reduce?

02

Context

Where was the experiment run, and why did it matter locally?

03

Design

Who or what was assigned, what varied, and over what period?

04

Outcomes

What was measured, including guardrails and complaints.

05

Result

Effect size, uncertainty, and practical interpretation.

06

Limitations

What the result cannot prove.

07

Equity check

Whether effects or burdens differed across groups.

08

Decision

Continue, expand, revise, stop, or replicate.

09

Replication notes

What another community would need to try it.

10

Data availability

What aggregate data, code, or templates can be shared.

Four phases. One pilot at a time.

Phase 0

Discovery

Months 0–3

  • Website
  • Concept memo
  • 20 stakeholder interviews
  • Identify 1 pilot

One signed pilot

Phase 1

Pilot

Months 3–6

  • Experiment charter
  • Implementation
  • Public report

One measurable improvement

Phase 2

Early Network

Months 6–18

  • 5–10 experiments
  • Annual report
  • Registry launch

Evidence of replication

Phase 3

Institutionalization

Years 2–3

  • Nonprofit launch
  • Fellowship cohort
  • Annual conference

Independent municipalities adopting methods

Ready to run your first experiment?

We are in Phase 0. One signed pilot, one public result. If you lead a library, parks department, school district, or civic agency — let’s talk.