Spec Reviewer Agent
Validates code against product requirements, designs, and expected behavior - catching gaps and deviations before they ship.
Baz reviews the plan before the code exists, and turns the pull request into a confirmation.
AI coding is foundational to how software gets built, and teams need a new verification layer - Baz is defining it.
Baz is more contextual, scalable, and dependable - refreshing for teams shipping AI-native software.
Planner
Post-coding review catches mistakes after you’ve paid to make them. Planner moves the review to the plan: drafted from the ticket, checked against your architecture, gated by risk, and approved before an agent writes code. Teams report up to 65% less rework after merge, measured by reverts and hotfixes.
event_ingested_at (processing time) onto every row in stg_orders.fct_orders incremental model off event_ingested_at with a 3-day lookback.--full-refresh in prod.event_time silently drops late events - they land after the watermark. Switched the predicate to event_ingested_at so no rows are lost, and the lookback re-merges late arrivals idempotently.Agents
Whatever still reaches a pull request gets a final check from purpose-built agents with full context: behavior, requirements, APIs, architecture, production systems, and security.
Validates code against product requirements, designs, and expected behavior - catching gaps and deviations before they ship.
Reasons across auth and network boundaries, infrastructure, pipelines, and application code to uncover vulnerabilities.
Correlates repository changes with production telemetry to identify reliability, performance, and observability risks - then proposes fixes.
Automatically applies and validates safe code changes in an isolated runtime, turning review feedback into tested commits.
What's new
At AI Engineer World's Fair, Baz announced Baz Planner - a gateway that eliminates entire classes of bugs and vulnerabilities in code planning - alongside $9M in new funding, bringing total raised to $17M.
How Baz uses Datadog LLM Observability to run autonomous agents safely in production - with end-to-end trace coverage, human-in-the-loop validation, and ~80% faster root cause analysis.
Six classes of vulnerability slipped past traditional tooling for years. A study of 28 high-confidence findings from advanced security review shows why cyber-capable models now catch them at review time - before they ship.
Session Logs give every Baz run a structured execution record. They show how a run was triggered, where it executed, its current state, major completed stages, outcome, and cost. Instead of exposing raw internal logs, Baz presents a timeline of product events so teams can inspect reviewer runs, fixer runs, scheduled scans, benchmark runs, and future Codex or Claude Code workflows from one place.
We're hiring
Help us build the platform that reviews the plan, not just the code.

Get started
Start free on a real repository, or talk to us and we’ll run Baz against a change from your backlog.