Floor Agents #

In DevelopmentUpdated Apr 9, 2026

Your AI dev team. Plugs into your tools. Ships code while you sleep.

Overview #

Floor Agents is an autonomous AI engineering platform that gives software teams specialized AI agents capable of decomposing tasks, writing code, creating pull requests, reviewing code, and writing tests. It integrates with your existing tools — GitHub, Linear, and any LLM provider.

How It Works #

A configurable team of AI agents collaborates through a structured workflow:

Agent	Role
PM	Decomposes tasks into sub-issues
Backend Dev	Writes backend code, creates PRs
Frontend Dev	Writes frontend code, creates PRs
CTO	Reviews PRs, approves or requests changes
QA	Writes tests, validates functionality

The workflow follows a state machine:

Backlog → Triage → In Progress → In Review → QA → Done
                        ↑               │
                        └── Changes Requested (max 3 cycles → escalate to human)

Every PR is reviewed by the CTO agent. If changes are needed, the dev agent revises — up to 3 cycles before escalating to a human. All work is visible in your issue tracker and PRs. No black box.

Tool Agnostic #

Floor Agents connects to the tools you already use:

Git platforms — GitHub (GitLab, Bitbucket planned)
Task managers — Linear, GitHub Issues (Jira planned)
LLM providers — Claude (Anthropic), OpenAI, Google Gemini, LM Studio, Ollama, Groq, Together

Use cheap local models for routine dev work, expensive cloud models for critical reviews. Mix and match per agent.

Architecture #

Built as a Bun monorepo with 10 packages:

@floor-agents/core — Types, config loader, YAML validation
@floor-agents/orchestrator — Main loop, state machine, guardrails, cost tracking
@floor-agents/context-builder — File selection via import tracing, prompt rendering, token budgets
@floor-agents/anthropic — Claude API adapter
@floor-agents/claude-code — Claude Code CLI adapter (native worktree execution)
@floor-agents/openai — OpenAI-compatible adapter
@floor-agents/gemini — Google Gemini adapter
@floor-agents/lmstudio — Local model adapter
@floor-agents/github — GitHub REST API (branches, commits, PRs)
@floor-agents/task — Task manager adapters

Key Design Decisions #

Config-driven — A single YAML file defines team composition, models, guardrails, cost limits, and workflow states
Native worktree execution — Dev agents spawn Claude Code CLI directly in git worktrees with full file access, not just tool-use
Crash-recoverable — 10-step execution state machine with file-based persistence. Every step is idempotent
Cost controls — Per-task and per-day spending limits. Local models report $0
Guardrails — File count/size limits, blocked paths (.env, .pem, CI configs), path traversal detection
Pluggable adapters — Task managers, git platforms, and LLM providers all behind interfaces. Add new integrations without touching core logic

Tech Stack #


Language	TypeScript (strict)
Runtime	Bun
Config	YAML
State	File-based JSON (PostgreSQL planned)
Tests	116 tests across 22 files