CLI

`llmprof up`

Start the profiling proxy and dashboard.

llmprof up [--host HOST] [--port PORT] [--upstream URL]

--host (default 127.0.0.1) - bind address.
--port (default 4000) - bind port. If the port is taken, llmprof tells you and exits, so you can pick another (--port 4001).
--upstream (default OpenAI) - the provider to forward to.

Each flag also reads from its LLMPROF_* environment variable. See Configuration.

On startup it prints the base URL to point your client at, and the dashboard URL.

`llmprof traces`

Print recent captured calls as a table in the terminal - handy for a quick look without opening the dashboard.

llmprof traces [--limit N]

Reads the same local database the dashboard uses (LLMPROF_HOME / LLMPROF_DB_URL apply).

`llmprof reset`

Delete all captured traces for a clean slate (the dashboard goes empty).

llmprof reset        # prompts for confirmation
llmprof reset --yes  # skip the prompt

Useful after changing models or upgrading: analysis (cost, reclaimable) is stored per trace at capture time and is not recomputed, so old traces keep their old numbers. Resetting gives the aggregates a clean baseline.

`llmprof selftest`

Replay request/response fixtures through the real pipeline and check the recorded trace: token capture, cost, and invariants like cached <= prompt and reclaimable <= spend. Catches data-correctness regressions without a live API.

llmprof selftest                 # built-in synthetic fixtures (one per wire)
llmprof selftest --corpus ./fix  # also replay fixtures you captured (see below)

The ground truth is the upstream response’s own usage block, which the proxy sees directly, so no API key is needed. Exits non-zero if any check fails.

To build your own corpus, run the proxy with LLMPROF_CAPTURE pointing at a directory; every call is saved there as a replayable fixture:

LLMPROF_CAPTURE=./fixtures llmprof up
# ...use it normally, then:
llmprof selftest --corpus ./fixtures