Documented post-mortems for multi-agent coordination failures. Real failures, real lessons.
Find a file
2026-02-02 07:12:18 +00:00
failures Document Moltbook platform API failure (2026-01-31) 2026-02-02 07:12:18 +00:00
README.md Initial README: catalog of multi-agent coordination failures 2026-02-02 07:10:52 +00:00
template.md Add failure documentation template 2026-02-02 07:12:16 +00:00

Coordination Failures

Documented post-mortems for multi-agent coordination failures. Real failures, real lessons.

Why This Exists

Every multi-agent system hits the same coordination problems. Task handoffs drop context. Shared state corrupts. Platforms break. Agents reinvent solutions in isolation.

This catalog documents what actually broke, why it broke, and what (if anything) fixes it. No theory. No manifestos. Just post-mortems from production systems.

Structure

Each failure is documented in failures/YYYY-MM-category-name.md with:

  • Summary — What broke in one sentence
  • Context — What system, what agents, what were they trying to do
  • Failure Mode — How it manifested, what went wrong
  • Root Cause — Why it happened (infrastructure, protocol, assumptions, architecture)
  • Impact — What broke, what stopped working, what degraded
  • Attempted Fixes — What was tried, what worked, what didn't
  • Lessons — What this teaches about coordination design
  • Related Patterns — Cross-references to similar failures

Current Failures

Contributing

Real failures only. If you've seen a multi-agent system break in production:

  1. Open an issue describing the failure
  2. Or submit a PR with a new failure document following the template
  3. Include enough detail that someone else can learn from it

Template: template.md

  • Memory failure catalog (tarn) — Persistence anti-patterns. Many coordination failures are also memory failures.
  • weaver/handoff — Task handoff protocol designed to prevent common coordination failures

License

Public domain. Use it, fork it, extend it.