Open source · Tool-agnostic · Production-ready · L3 dogfood · 7 patterns · Interactive tools

Stop prompting.
Design the loop.
Get a score.

Run one command in any repo. @cobusgreyling/loop is the front door (init + doctor). loop-init still works. Prints your Loop Ready score plus the first run line.

terminal — try now

npx @cobusgreyling/loop init .
npx @cobusgreyling/loop doctor .

Try in 60 seconds → Quickstart Read the Essay Goal Engineering

"You shouldn't be prompting coding agents anymore. You should be designing loops that prompt your agents."

— Peter Steinberger

"I don't prompt Claude anymore. I have loops running that prompt Claude. My job is to write loops."

— Boris Cherny, Head of Claude Code

"Build the loop. But build it like someone who intends to stay the engineer, not just the person who presses go."

— Addy Osmani

Anatomy

What one loop looks like

You design it once. You're not prompting every micro-step.

Anatomy of a loop — schedule, triage, state, worktree, implementer, verifier, MCP, human gate

⏱ Schedule → 👁 Triage skill → 📋 STATE.md → 🌲 Worktree → ⚙️ Implementer → ✓ Verifier → 🔗 MCP / PR → 🧑 Human gate

Building blocks

Five primitives + memory

The same shape works in Grok, Claude Code, Codex, OpenClaw, Opencode, and Actions. Tool names differ; capabilities converge.

⏱

Scheduling

/loop, cron, automations

🌲

Worktrees

Parallel without collisions

📚

Skills

Intent written once

🔌

Connectors

MCP → real tools

🔀

Sub-agents

Maker / checker split

💾

State

Memory outside the model

Grok Build Claude Code Codex OpenClaw Opencode GitHub Actions

Full cross-tool matrix →

Deeper architecture

How one run actually moves

The actor-level sequence within a run, the states it passes through, how the three autonomy levels relate, and how the tools in this repo map onto the primitives above.

Sequence

One loop cycle

sequenceDiagram
    actor Eng as Engineer
    participant Sched as Scheduler
    participant Skill as Triage Skill
    participant State as STATE.md / Memory
    participant WT as Worktree
    participant Maker as Implementer Agent
    participant Check as Verifier Agent
    participant MCP as MCP / Git / Tickets
    participant Gate as Human Gate

    Sched->>Skill: Fire pattern (e.g. daily-triage)
    Skill->>State: Read goals + last run + budget
    State-->>Skill: Context snapshot
    Skill->>WT: Create isolated worktree
    WT-->>Skill: worktree path
    Skill->>Maker: Task with skills + constraints
    Maker->>Maker: Implement change in worktree
    Maker->>Check: Hand off patch
    Check->>Check: Run tests + policy gates

    alt Verify pass and budget OK
        Check->>MCP: Open PR or update ticket
        MCP-->>Gate: PR link + summary
        Gate->>Gate: Safe and allowlisted?
        alt Safe auto-path
            Gate->>MCP: Approve merge or leave L1 report
            MCP->>State: Write run outcome
        else Risky or ambiguous
            Gate->>Eng: Escalate with full context
            Eng->>MCP: Decide / override
            MCP->>State: Write decision + outcome
        end
    else Verify fail or budget exceeded
        Check->>State: Log failure mode
        Check->>Eng: Report only, no auto-merge
    end

State

Run lifecycle

stateDiagram-v2
    [*] --> Scheduled
    Scheduled --> LoadingContext: cadence fire
    LoadingContext --> BlockedBudget: budget exceeded
    LoadingContext --> RunningTriage: context OK
    RunningTriage --> WorkingInWorktree: task selected
    RunningTriage --> IdleNoop: nothing to do
    WorkingInWorktree --> Verifying: implementer done
    Verifying --> AwaitingHumanGate: verify pass + risky
    Verifying --> Failed: verify fail
    AwaitingHumanGate --> Applied: human or allowlist approve
    AwaitingHumanGate --> Rejected: human reject
    Applied --> Logged
    Failed --> Logged
    Rejected --> Logged
    BlockedBudget --> Logged
    IdleNoop --> Logged
    Logged --> [*]

State

Autonomy levels L1-L3

stateDiagram-v2
    [*] --> L1_ReportOnly
    L1_ReportOnly --> L2_Assisted: audit score up + human OK
    L2_Assisted --> L3_Unattended: denylist + budget + gates proven
    L3_Unattended --> L2_Assisted: incident or cost spike
    L2_Assisted --> L1_ReportOnly: kill switch
    L3_Unattended --> L1_ReportOnly: kill switch

Architecture

Stack: primitives + tools

flowchart TB
    subgraph Human["Engineer / Owner"]
        Design[Design LOOP.md patterns]
        Gate[Human gate + kill switch]
        Review[Read PRs + reports]
    end

    subgraph Control["Control plane"]
        Sched[Automations / Scheduling]
        Patterns[patterns/registry.yaml]
        Cost[loop-cost: estimate spend]
    end

    subgraph Memory["Durable memory"]
        State[STATE.md]
        LoopDoc[LOOP.md]
        Context[loop-context: prune, inject, circuit breaker]
        Sync[loop-sync]
    end

    subgraph Execution["Execution plane"]
        WT[loop-worktree: isolated attempts + locks]
        Maker[Implementer sub-agent]
        Verifier[Verifier sub-agent]
    end

    subgraph Tooling["CLIs + connectors"]
        Init[loop init: scaffold]
        Audit[loop-audit: readiness score]
        MCP[MCP server: read-only policy + tools]
    end

    Design --> Patterns
    Init --> State
    Init --> LoopDoc
    Sched --> Patterns
    Patterns --> Cost
    Cost --> Context
    Patterns --> WT
    State <--> Sync
    WT --> Maker
    Maker --> Verifier
    Verifier --> Context
    Context --> Gate
    Verifier --> MCP
    Gate --> Review
    Audit --> Design

Copy-friendly Mermaid source →

Copy & run

Production patterns

Documented loops with scheduling, skills, state schemas, verification, and honest failure modes.

Seven production loop patterns — cadence and token cost overview

PR Babysitter

Shepherd PRs through review, CI, rebase, and merge. Human stays in the judgment seat.

⏱ 5–15m ◆ Medium risk

Read pattern → Starter →

Start here

Daily Triage

Morning scan of CI, issues, and commits. Report-only week one, then small auto-wins.

⏱ 1d–2h ◆ Low risk

Read pattern → Starter →

High frequency

CI Sweeper

React to failing checks with minimal fixes. Classify flakes. Escalate after 3 attempts.

⏱ 5–15m ◆ Medium risk

Read pattern → Starter →

Off-peak

Post-Merge Cleanup

TODOs, deprecations, and tech debt after merges. Small PRs overnight.

⏱ 1d–6h ◆ Low risk

Read pattern → Starter →

Security

Dependency Sweeper

Patch CVEs and stale deps in worktrees. Majors and denylist stay human-gated.

⏱ 6h–1d ◆ Medium risk

Read pattern → Starter →

Low risk

Issue Triage

Dedupe, score, and label incoming issues. Propose-only week one — pairs well with Daily Triage.

⏱ 2h–1d ◆ Low risk

Read pattern → Starter →

Low risk

Changelog Drafter

Scan merges & commits, produce polished categorized release notes drafts. Human approves before publish. Huge leverage, tiny risk.

⏱ 1d or tag ◆ Low risk

Read pattern → Starter →

Patterns

Starter kits

L0→L3

Readiness levels

100

Audit score max

5 minutes

Get started

Scaffold in your repo, audit readiness, run report-only for one week. Full quickstart guide →

terminal — minimal loop

# Scaffold starter (or copy manually)
npx @cobusgreyling/loop init . --pattern daily-triage --tool grok

# Score loop readiness (PRs get audit comments in CI)
npx @cobusgreyling/loop-audit . --suggest

# Grok — report only, week one
/loop 1d Run loop-triage. Update STATE.md. No auto-fix.

# Also try the new low-risk pattern
/loop 1d Run changelog-scan + draft-release-notes. Write RELEASE_NOTES_DRAFT.md. Human review only.

Quickstart guide · All starters · Design checklist · Production stories

Deep dives

Engineering, not hype

Failure Modes

Infinite fix loops, verifier theater, token burn — with mitigations.

Read catalog →

Safety & Guardrails

Path denylist, auto-merge policy, MCP least privilege.

Read safety doc →

loop-audit CLI

Loop Readiness Score 0–100. Know before you ship unattended.

Run audit →

Pattern Picker

CI red? PRs stalling? Morning chaos? Pick the right loop first.

Choose pattern →

Stay in control

Observability & cost

Loops should be boring and transparent. Use state files, run logs, and budgets.

STATE.md + pattern state

The durable memory spine. Every loop reads it at start and writes outcomes.

See example →

Run logs & budgets

Structured JSON per run + daily token / spawn limits. Detect problems before they explode.

See sample logs →

loop-audit on every PR

Automated Loop Readiness Score + suggestions. The reference dogfoods it.

Run it →

Ready to stop prompting?

The best loops are boring, reliable, and transparent.

Star on GitHub

Contribute a pattern

Dogfooding

Live Telemetry

Machine-readable data rendered directly from loop-run-log.md. This repo runs loops.

Loop Readiness Score

Cumulative Tokens (Estimate)

Automated vs Escalated Runs

Community

Adopters

Forks mean people are trying this. Listed projects run (or adapted) loops from this reference. Add yours via the Add Adopter issue or a PR to docs/adopters.md.

Show your Loop Ready score

Paste a badge in your README after loop-audit — social proof for your team and contributors.

npx @cobusgreyling/loop-audit . --badge

Add my project →

Your project here

Pattern · Tool · Level

One line on what worked or broke — failures are first-class.

List yours →

What one loop looks like

Five primitives + memory

Scheduling

Worktrees

Skills

Connectors

Sub-agents

State

How one run actually moves

One loop cycle

Run lifecycle

Autonomy levels L1-L3

Stack: primitives + tools

Production patterns

PR Babysitter

Daily Triage

CI Sweeper

Post-Merge Cleanup

Dependency Sweeper

Issue Triage

Changelog Drafter

Get started

Interactive Pattern Picker & Readiness Simulator

What's hurting right now?

Live Loop Readiness Simulator (mirrors loop-audit)

Engineering, not hype

Failure Modes

Safety & Guardrails

loop-audit CLI

Pattern Picker

Observability & cost

STATE.md + pattern state

Run logs & budgets

loop-audit on every PR

Ready to stop prompting?

Live Telemetry

Loop Readiness Score

Cumulative Tokens (Estimate)

Automated vs Escalated Runs

Adopters

Show your Loop Ready score

loop-engineering

loop-engineering (maintainer)

loop-engineering (maintainer)

Pluribus

Your project here