Programs

Four programs. One learning infrastructure.

ES supports municipalities, schools, agencies, and civic institutions in designing, running, measuring, and sharing policy experiments. Good first pilots are boring in the best sense: practical, measurable, low-risk, and locally useful.

Program 1

Civic Experiment Sprints

Run 90-day experiments with public institutions. Every sprint produces a public report with the question, design, result, limitations, equity check, and replication notes.

Example questions

Does SMS increase library attendance?
Which permit workflow reduces processing delays?
Which outreach increases school program participation?
Which park design increases usage of underused spaces?

Pilot selection

Prioritize pilots with

✓A clear public-service question
✓A reversible intervention
✓A measurable outcome within 30–90 days
✓Low downside risk
✓An implementation owner
✓Available baseline data
✓No serious equity or rights concern
✓A result that another municipality could plausibly replicate

Avoid early pilots involving

✕Policing
✕Welfare eligibility
✕School discipline
✕Criminal justice
✕Medical treatment
✕Housing access
✕Immigration
✕Any setting where randomization could materially affect rights, safety, or access to essential services

Recommended first wedge

Libraries are likely the strongest first pilot category. Civic trust is high, interventions are usually low-risk, outcomes are measurable, participation is voluntary, local staff are mission-aligned, and findings can travel across municipalities.

Program 2

Public Experiment Registry

An open repository of civic experiments. Every record includes hypothesis, design, outcomes, effect size, limitations, and replication status. Null, negative, inconclusive, and harmful results are included — that is one of the main public goods.

View the registry schema →

Program 3

Municipal Fellows

Train local leaders in the skills required to run rigorous civic experiments. Build local experimentation capacity that persists after the fellowship ends.

Curriculum

Causal inference

Experimentation

Ethics

Measurement

Decision science

Program 4

Evidence Commons

Shared infrastructure for civic experimentation: experiment templates, dashboards, analysis tools, and implementation guides. ES can run on spreadsheets and lightweight tools for early pilots. Advanced workflows may integrate with DoOperator and Decision Process infrastructure while communities retain full data ownership.

First Pilot Templates

Three ready-to-run designs.

Good first pilots are boring in the best sense. These templates are practical, measurable, low-risk, and locally useful.

Template 1

Library Attendance Sprint

Which outreach message increases attendance at free library programs?

Design

Randomly assign eligible patrons to message variants
Measure registrations, attendance, opt-outs, and complaints
Publish message text, sample size, effect estimate, and limitations
Include replication guidance

Template 2

Permit Workflow Sprint

Does a clearer checklist reduce incomplete permit applications?

Design

Compare current instructions against a revised checklist for new applicants
Measure completion rate, staff follow-up time, and processing delay
Measure applicant satisfaction
Avoid withholding required information from anyone

Template 3

Parks Usage Sprint

Which low-cost signage or programming prompt increases use of an underused public space?

Design

Rotate signage or programming prompts by location or week
Measure foot traffic, QR scans, and event participation
Check whether effects differ by neighborhood or time of day
Publish maintenance and equity findings

Public Report Format

Every experiment produces a public record.

The public should be able to understand the conclusion without trusting a black box. Every completed experiment produces a short public report in this format.

Question

What uncertainty did the institution want to reduce?

Context

Where was the experiment run, and why did it matter locally?

Design

Who or what was assigned, what varied, and over what period?

Outcomes

What was measured, including guardrails and complaints.

Result

Effect size, uncertainty, and practical interpretation.

Limitations

What the result cannot prove.

Equity check

Whether effects or burdens differed across groups.

Decision

Continue, expand, revise, stop, or replicate.

Replication notes

What another community would need to try it.

Data availability

What aggregate data, code, or templates can be shared.

Deployment Roadmap

Four phases. One pilot at a time.

Phase 0

Discovery

Months 0–3

Website
Concept memo
20 stakeholder interviews
Identify 1 pilot

✓ One signed pilot

Phase 1

Pilot

Months 3–6

Experiment charter
Implementation
Public report

✓ One measurable improvement

Phase 2

Early Network

Months 6–18

5–10 experiments
Annual report
Registry launch

✓ Evidence of replication

Phase 3

Institutionalization

Years 2–3

Nonprofit launch
Fellowship cohort
Annual conference

✓ Independent municipalities adopting methods

Ready to run your first experiment?

We are in Phase 0. One signed pilot, one public result. If you lead a library, parks department, school district, or civic agency — let’s talk.

Get in touch →Explore the registry