H

homelab dev

Dashboard About Install Tools Config Verify Events Perf Bench Grafana ↗

Benchmarks

AI coding agent evaluation — full LLM input/output traces with performance KPIs.

Start Benchmark Run

Agent

API Key Label

Task Description

Category

Priority

Idle Delay (seconds)

Runs

Select a run from the list to view its LLM call trace.

homelab hub Dashboard About Install Tools Config Verify Events Grafana ↗