Evaluation
Benchmarks agent performance over time and across all reviews: PRs reviewed, accepted suggestions, and rejection rate, plus accepted suggestions by agent type and an acceptance-rate trend.
Product
| Updated | Baz Reviewer | Agents | User name | User interaction |
|---|---|---|---|---|
| 51 minutes ago | Removing `lightColor`/`darkColor` from the desktop `RecoloredMark` caus… | Logical Bugs | --- | LLukas Brandt / Unaddressed Baz Suggestion |
| 1 hour ago | `new Date(value).toLocaleDateString("en-US", { month: "short", day: "num… | Logical Bugs | baz-reviewer… | SSofia Keller / Addressed Baz Suggestion |
| 1 hour ago | `normalizeDate()` only trims/slices frontmatter, so `resource.publishedAt` … | Type Inconsisten… | --- | LLukas Brandt / Unaddressed Baz Suggestion |
| 1 hour ago | `employmentType` is built from freeform `Job.employmentType` via `toUp… | Type Inconsisten… | --- | LLukas Brandt / Unaddressed Baz Suggestion |
| 1 hour ago | `breadcrumbSchema` is built inline in both the docs page and `src/app/res… | Code Dedup and… | --- | LLukas Brandt / Unaddressed Baz Suggestion |
| 1 hour ago | `leadImage()` dereferences `image.src` without validating the media entry,… | Type Inconsisten… | --- | LLukas Brandt / Unaddressed Baz Suggestion |
| 1 hour ago | Lines 214–225 duplicate the `Finding` mapping in `findings/fetchers/ai_fin… | Code Dedup and… | baz-reviewer… | MMarco Rossi / Unaddressed Baz Suggestion |
| 1 hour ago | `impact` is inserted into `### {single_line_impact}` after only `\n`/`\r` norm… | Basic Security P… | --- | MMarco Rossi / Addressed Baz Suggestion |
| 2 hours ago | `CombineSuggestionsRequestItem`/`CombineSuggestionsResultItem` dro… | Logical Bugs | baz-reviewer… | MMarco Rossi / Addressed Baz Suggestion |
| 2 hours ago | `getCommentExcerpt` passes raw HTML/markdown into `CommentRender… | Basic Security P… | baz-reviewer… | AAnna Novak / Unaddressed Baz Suggestion |
What's inside
Filter every view by agent, user, and time period to focus on the slices of data that matter to your team.
Benchmarks agent performance over time and across all reviews: PRs reviewed, accepted suggestions, and rejection rate, plus accepted suggestions by agent type and an acceptance-rate trend.
A detailed table of every agent - accepted suggestions, unaddressed suggestions, and rejection rate - making it easy to see which agents add the most value and where practices may need adjusting.
Logs each user interaction with a Baz reviewer comment - the user, their reply or reaction, the original comment, the reviewer title, and a timestamp - so you can see which reviewers are useful and where confusion arises.
A consolidated view of how Baz influences the stability and reliability of your codebase, opening with a carousel of notable bugs Baz surfaced paired with user reactions.
Tracks code-correctness bugs detected by the Logical Bugs, Breaking Changes, and Type Inconsistency reviewers against bugs reported after release in your ticketing system - a read on post-release quality.
Median time for a human reviewer to first interact with a PR, and median time a PR spends from creation to merge - surfacing review responsiveness, cycle time, and delivery velocity.
Get started
No setup required - metrics are computed automatically from review activity. Sign up to see your own numbers, or reach out for a guided read of the dashboards.