Floor Agents #
Your AI dev team. Plugs into your tools. Ships code while you sleep.
Overview #
Floor Agents is an autonomous AI engineering platform that gives software teams specialized AI agents capable of decomposing tasks, writing code, creating pull requests, reviewing code, and writing tests. It integrates with your existing tools — GitHub, Linear, and any LLM provider.
How It Works #
A configurable team of AI agents collaborates through a structured workflow:
| Agent | Role |
|---|---|
| PM | Decomposes tasks into sub-issues |
| Backend Dev | Writes backend code, creates PRs |
| Frontend Dev | Writes frontend code, creates PRs |
| CTO | Reviews PRs, approves or requests changes |
| QA | Writes tests, validates functionality |
The workflow follows a state machine:
Backlog → Triage → In Progress → In Review → QA → Done
↑ │
└── Changes Requested (max 3 cycles → escalate to human)
Every PR is reviewed by the CTO agent. If changes are needed, the dev agent revises — up to 3 cycles before escalating to a human. All work is visible in your issue tracker and PRs. No black box.
Tool Agnostic #
Floor Agents connects to the tools you already use:
- Git platforms — GitHub (GitLab, Bitbucket planned)
- Task managers — Linear, GitHub Issues (Jira planned)
- LLM providers — Claude (Anthropic), OpenAI, Google Gemini, LM Studio, Ollama, Groq, Together
Use cheap local models for routine dev work, expensive cloud models for critical reviews. Mix and match per agent.
Architecture #
Built as a Bun monorepo with 10 packages:
- @floor-agents/core — Types, config loader, YAML validation
- @floor-agents/orchestrator — Main loop, state machine, guardrails, cost tracking
- @floor-agents/context-builder — File selection via import tracing, prompt rendering, token budgets
- @floor-agents/anthropic — Claude API adapter
- @floor-agents/claude-code — Claude Code CLI adapter (native worktree execution)
- @floor-agents/openai — OpenAI-compatible adapter
- @floor-agents/gemini — Google Gemini adapter
- @floor-agents/lmstudio — Local model adapter
- @floor-agents/github — GitHub REST API (branches, commits, PRs)
- @floor-agents/task — Task manager adapters
Key Design Decisions #
- Config-driven — A single YAML file defines team composition, models, guardrails, cost limits, and workflow states
- Native worktree execution — Dev agents spawn Claude Code CLI directly in git worktrees with full file access, not just tool-use
- Crash-recoverable — 10-step execution state machine with file-based persistence. Every step is idempotent
- Cost controls — Per-task and per-day spending limits. Local models report $0
- Guardrails — File count/size limits, blocked paths (
.env,.pem, CI configs), path traversal detection - Pluggable adapters — Task managers, git platforms, and LLM providers all behind interfaces. Add new integrations without touching core logic
Tech Stack #
| Language | TypeScript (strict) |
| Runtime | Bun |
| Config | YAML |
| State | File-based JSON (PostgreSQL planned) |
| Tests | 116 tests across 22 files |