Distributed KV Cache#

Sharing and coordinating KV cache across multiple LMCache servers and inference instances – disaggregated prefill, peer-to-peer sharing, multi-server coordination, and cache management.