Skip to content

⌘ K

Getting Started

KV Cache Operations

Recipes

Secondary KV Storage
- Supported Backends
- KV Cache Compression
  - CacheGen

Distributed KV Cache

Use LMCache in Production

Observability
- Metrics
- Logging
- Tracing

Community
- Community meetings
- Blogs

KV Cache Optimizations
- CacheBlend
- Segmented Prefill

Developer Guide

Non-KV Caching
- Encodings
- Hidden states

Legacy (In-Process Mode)

/

Use LMCache in Production

Use LMCache in Production#

Deploying, scaling, and operating LMCache in production.

Deployment Guide
Kubernetes Deployment
Kubernetes Operator
Runtime Plugins
Dynamo Integration

KV Cache Management

Deployment Guide

© 2024, The LMCache Team Built with Sphinx 8.2.3