lmcache ping#

The lmcache ping command is a liveness check for an LMCache KV cache server or a vLLM serving engine.

# Ping the KV cache server (default: http://localhost:8080)
lmcache ping kvcache

# Ping the serving engine (default: http://localhost:8000)
lmcache ping engine --url http://localhost:8000
======= Ping KV Cache ========
Status:                   OK
Round trip time (ms):     3.42
==============================

ping kvcache checks the /healthcheck endpoint; ping engine checks /health.

Options#

Flag

Description

kvcache | engine

Target to ping (positional, required).

--url

Server URL. Defaults to http://localhost:8080 for kvcache, http://localhost:8000 for engine.

--format

Output format: terminal (default) or json.

--output PATH

Save metrics to a file (format follows --format).

-q / --quiet

Suppress stdout output. Exit code only.

JSON Output#

lmcache ping kvcache --format json
{
  "title": "Ping KV Cache",
  "metrics": {
    "status": "OK",
    "round_trip_time_ms": 3.42
  }
}

Exit Codes#

Code

Meaning

0

Server is reachable (HTTP 200).

1

Connection failure or non-200 response.