Skip to content

Health

Suggest Edits

What’s each agent costing you? Which tool got hammered today? Whose context window is about to wrap? Whose dispatches are silently failing? Your runtime keeps the receipts but it doesn’t read them back to you. This dashboard does.

Live token counts, session costs, MCP and REST volumes, error rates, plugin status, search-engine state, the full diagnostic sweep. Without Bakin you’d be grepping JSONL transcripts to answer any of it.

The health dashboard: cost and context cards, usage tabs, doctor results, system status, all in one feed.

Two cards anchor the top of the dashboard side by side. They cover the questions that pile up fastest when you’ve got a roster running: what each agent is costing you, and whose context is about to overflow. Glance at them on the way past, look closer when something looks off.

Context usage on the left (tokens in each agent's latest session), session cost on the right with the day's running total.
CardWhat it answers
Context UsageTotal tokens in each agent’s latest session, bar-charted. Spot who’s pushing the model’s window before they hit it.
Session CostInput, output, cache-read, cache-write, and total per agent with the day’s running total at the top. Pulled from the runtime’s posted rates, directional not invoice-grade. Answers “is Roscoe’s day eating my budget?” without making you go ask the gateway.

Three tabs sit below the cost cards, all feeding from the same in-memory recorder. Same activity, sliced three ways. When something looks off in the system, this is usually the first place you’ll see it.

The usage panel: tool, endpoint, and agent tabs, windowed to 5m, 1h, or 24h.
TabWhat it counts
Tool UsageEvery MCP exec tool call, by name (e.g. bakin_exec_tasks_create). Count + error rate per window.
Endpoint UsageEvery REST endpoint hit, by path. Count + error rate.
Agent UsageCalls + errors per agent. Quick read on who’s busy and who’s stuck.

A green-light scan of every moving part in the stack. Agent roster, runtime adapter, search adapter, taskboard, assets, channel approvals, restart-recovery candidates, the works. Red means broken, yellow means drifting, and most rows have a one-click auto-fix so you don’t have to know what went wrong to fix it.

Run it before you start the day or any time something feels off. Results cache so the dashboard reads fast; the refresh button (or bakin doctor from the CLI) forces a fresh sweep.

Live state of the Bakin process and what it’s connected to:

SectionWhat’s there
Server statsPort, PID, memory in use, uptime, node version.
MCP sessionsAgents currently connected, open session count per agent, when they connected.
Plugin registryEvery plugin loaded, with route count and source (built-in vs user-installed).
Search engineAntfly status and row counts per bakin_* table.

Quick sanity check for “did everything actually start up?”

Any plugin can register a health check that surfaces here alongside the built-ins. It picks up the same color coding (red / yellow / green) and the same auto-fix scaffolding. If a plugin owns external state worth watching (an API key, a queue, a cache, a daemon), wire a check.

SettingTypeDefaultWhat it does
Refresh interval (seconds)number30How often to poll for updated metrics
Detailed metricsbooleantrueShow per-plugin and per-tool breakdowns
CommandPurpose
bakin statusShow dispatch and server status.
bakin doctorRun health checks.

Full surface in the CLI reference.

HTTP API surface for this plugin: see the API reference.

Agents can self-check via MCP exec tools.

  • bakin_exec_health_doctor: Run system diagnostics (agent roster, skill sync, runtime, taskboard, assets, etc.). Returns detailed check results. Use fresh=true to force a full re-check instead of returning cached results.
  • bakin_exec_health_status: Get a quick system health summary — uptime, memory, active MCP sessions, and doctor error/warning counts. Useful for checking system state before starting work.

Full schemas in the Exec tools reference.

  • Models: pick cheaper models for the agents that show up loudest in Cost
  • Essentials → System Status: the always-on dot in the header is backed by these checks
  • Daily Operation: start, stop, restart, update, the lifecycle commands that affect health