PR Babysitter
Shepherd PRs through review, CI, rebase, and merge. Human stays in the judgment seat.
Read pattern → Starter →Loop engineering moves you from "I prompt → agent responds → I prompt again" to "I design the autonomous system that discovers work, verifies results, and drives progress."
Or try instantly: npx @cobusgreyling/loop-init . --pattern daily-triage --tool grok
npx @cobusgreyling/loop-audit . --suggest
"You shouldn't be prompting coding agents anymore. You should be designing loops that prompt your agents."— Peter Steinberger
"I don't prompt Claude anymore. I have loops running that prompt Claude. My job is to write loops."— Boris Cherny, Head of Claude Code
"Build the loop. But build it like someone who intends to stay the engineer, not just the person who presses go."— Addy Osmani
Anatomy
You design it once. You're not prompting every micro-step.
Building blocks
The same shape works in Grok, Claude Code, and Codex. Tool names differ; capabilities converge.
/loop, cron, automations
Parallel without collisions
Intent written once
MCP → real tools
Maker / checker split
Memory outside the model
Copy & run
Documented loops with scheduling, skills, state schemas, verification, and honest failure modes.
Shepherd PRs through review, CI, rebase, and merge. Human stays in the judgment seat.
Read pattern → Starter →Morning scan of CI, issues, and commits. Report-only week one, then small auto-wins.
Read pattern → Starter →React to failing checks with minimal fixes. Classify flakes. Escalate after 3 attempts.
Read pattern → Starter →TODOs, deprecations, and tech debt after merges. Small PRs overnight.
Read pattern → Starter →Patch CVEs and stale deps in worktrees. Majors and denylist stay human-gated.
Read pattern → Starter →Scan merges & commits, produce polished categorized release notes drafts. Human approves before publish. Huge leverage, tiny risk.
Read pattern → Starter →5 minutes
Clone a starter, audit your project, run report-only for one week.
# Scaffold starter (or copy manually) npx @cobusgreyling/loop-init . --pattern daily-triage --tool grok # Score loop readiness (PRs get audit comments in CI) npx @cobusgreyling/loop-audit . --suggest # Grok — report only, week one /loop 1d Run loop-triage. Update STATE.md. No auto-fix. # Also try the new low-risk pattern /loop 1d Run changelog-scan + draft-release-notes. Write RELEASE_NOTES_DRAFT.md. Human review only.
Play with it
Pick your pain. See the exact loop + commands. Simulate your score live.
Check the boxes you already have (or plan to add). Score updates live.
loop-audit v1.4 scoring. L3 requires verifier + state + cost observability + proven activity.
Deep dives
Infinite fix loops, verifier theater, token burn — with mitigations.
Read catalog →Path denylist, auto-merge policy, MCP least privilege.
Read safety doc →Loop Readiness Score 0–100. Know before you ship unattended.
Run audit →CI red? PRs stalling? Morning chaos? Pick the right loop first.
Choose pattern →Stay in control
Loops should be boring and transparent. Use state files, run logs, and budgets.
The durable memory spine. Every loop reads it at start and writes outcomes.
See example →Structured JSON per run + daily token / spawn limits. Detect problems before they explode.
See sample logs →Automated Loop Readiness Score + suggestions. The reference dogfoods it.
Run it →The best loops are boring, reliable, and transparent.
Star on GitHub Contribute a pattern