llmproxy - Chat Debugger

Connecting live feed...

Health Status

All currently active proxy connections with queue state, streaming mode, and live metrics.

Inspect backend status, load, errors, and concurrency settings in one place.

Backend	Status	Load	Models	Metrics	Last Error	Controls

Chat directly through the proxy, similar to the built-in llama.cpp frontend, including live tokens, sampler parameters, and raw responses.

Model System Prompt Next User Message

temperature top_p top_k min_p repeat_penalty max_tokens

Use streaming

// request appears here

// response appears here

Live history with queue time, target backend, and outcome.