← All casesDAG Orchestration + Kill GatesAI

Autonomous Code Factory

Multi-agent pipeline with kill gates — Flutter app of 25-40 files in 15-30 min, 3× faster than sequential generation

At a glance

27/27 judge points — PASS 100/100, 553 tests, 0 failures

Architect & sole developer

5 мес · solo

Python
asyncio
Claude CLI
PostgreSQL
Redis Streams
FastAPI

Quality gates — critic & review thresholds

Expand the deep-dive ↓

Problem

What doesn't work

Single-call LLM code generation produces files that don't compile and aren't connected. Each error requires regenerating the entire project — expensive ($0.50–2.00) and slow (30+ min). Without quality evaluation, bad ideas pass through expensive stages.

Solution

Architectural approach

Multi-agent orchestration with built-in self-evaluation. Generation: PlannerAgent → PlanReviewPanel (4 models, ≥3/4 quorum) → DAG sort → TaskExecutor parallel → TaskReviewer retry ≤3×. Evaluation: Critic (0-10, 5 criteria) + 3 kill gates (min_critic_score, compliance_risk, cannibalization) kill weak ideas before expensive stages. Judge (26 checkpoints, 100 points) verifies final quality.

My role & contribution

Architect & sole developer

Designed and built from scratch: PlannerAgent, DAG scheduler with topological sort, TaskExecutor/TaskReviewer, PlanReviewPanel (4 models), kill gates in orchestrator, Redis pub/sub for real-time progress. 11 microservices, 553 tests. Entire codebase is my project.

Autonomous Code Factory

What doesn't work

Architectural approach

Ready to discuss?