lmcache.experimental.storage_backend package#
Subpackages#
Submodules#
lmcache.experimental.storage_backend.abstract_backend module#
- class lmcache.experimental.storage_backend.abstract_backend.StorageBackendInterface(dst_device: str = 'cuda')[source]#
-
- abstract contains(key: CacheEngineKey) bool [source]#
Check whether key is in the storage backend.
- abstract exists_in_put_tasks(key: CacheEngineKey) bool [source]#
Check whether key is in the ongoing put tasks.
- abstract get_blocking(key: CacheEngineKey) MemoryObj | None [source]#
A blcocking function to get the kv cache from the storage backend.
- Parameters:
key (CacheEngineKey) – The key of the MemoryObj.
- Returns:
MemoryObj. None if the key does not exist.
- abstract submit_prefetch_task(key: CacheEngineKey) Future | None [source]#
An async function to get the MemoryObj from the storage backend.
- Parameters:
key (CacheEngineKey) – The key of the MemoryObj.
- Returns:
a future object. None if the key does not exist.
- abstract submit_put_task(key: CacheEngineKey, obj: MemoryObj) Future | None [source]#
An async function to put the MemoryObj into the storage backend.
- Parameters:
key (CacheEngineKey) – The key of the MemoryObj.
obj (MemoryObj) – The MemoryObj to be stored.
- Returns:
a future object
lmcache.experimental.storage_backend.local_disk_backend module#
- class lmcache.experimental.storage_backend.local_disk_backend.LocalDiskBackend(config: LMCacheEngineConfig, loop: AbstractEventLoop, memory_allocator: MemoryAllocatorInterface, dst_device: str = 'cuda', lookup_server: LookupServerInterface | None = None)[source]#
Bases:
StorageBackendInterface
- async async_load_bytes_from_disk(path: str, dtype: dtype, shape: Size) MemoryObj | None [source]#
Async load bytearray from disk.
- async_save_bytes_to_disk(key: CacheEngineKey, memory_obj: MemoryObj) None [source]#
Convert KV to bytes and async store bytes to disk.
- contains(key: CacheEngineKey) bool [source]#
Check whether key is in the storage backend.
- exists_in_put_tasks(key: CacheEngineKey) bool [source]#
Check whether key is in the ongoing put tasks.
- get_blocking(key: CacheEngineKey) MemoryObj | None [source]#
Blocking get function.
- insert_key(key: CacheEngineKey, memory_obj: MemoryObj) None [source]#
- load_bytes_from_disk(path: str, dtype: dtype, shape: Size) MemoryObj | None [source]#
Load bytearray from disk.
- load_disk(path: str, backend: str = 'bytes', dtype: dtype | None = None, shape: Size | None = None) MemoryObj | None [source]#
Load KV from disk.
- remove(key: CacheEngineKey) None [source]#
- submit_prefetch_task(key: CacheEngineKey) Future | None [source]#
An async function to get the MemoryObj from the storage backend.
- Parameters:
key (CacheEngineKey) – The key of the MemoryObj.
- Returns:
a future object. None if the key does not exist.
- submit_put_task(key: CacheEngineKey, memory_obj: MemoryObj) Future | None [source]#
An async function to put the MemoryObj into the storage backend.
- Parameters:
key (CacheEngineKey) – The key of the MemoryObj.
obj (MemoryObj) – The MemoryObj to be stored.
- Returns:
a future object
lmcache.experimental.storage_backend.storage_manager module#
- class lmcache.experimental.storage_backend.storage_manager.StorageManager(config: LMCacheEngineConfig, metadata: LMCacheEngineMetadata, allocator: MemoryAllocatorInterface, lookup_server: LookupServerInterface | None = None)[source]#
The StorageManager is responsible for managing the storage backends.
- allocate(shape: Size, dtype: dtype, eviction=True) MemoryObj | None [source]#
Allocate memory object with memory allocator. Use LRU evictor if eviction is enabled.
- contains(key: CacheEngineKey, search_range: List[str] | None = None) bool [source]#
Check whether the key exists in the storage backend.
- Parameters:
key (CacheEngineKey) – The key to check.
search_range (Optional[List[str]]) – The range of storage backends
to search in. Should be a subset of [“Hot”, “LocalDiskBackend”] for now. If None, search in all backends.
return: True if the key exists in the specified storage backends.
- get(key: CacheEngineKey) MemoryObj | None [source]#
Blocking function to get the memory object from the storages.
- prefetch(key: CacheEngineKey) None [source]#
Launch a prefetch request in the storage backend. Non-blocking
- put(key: CacheEngineKey, memory_obj: MemoryObj) None [source]#
Non-blocking function to put the memory object into the storages. Do not store if the same object is being stored (handled here by storage manager) or has been stored (handled by storage backend).
Module contents#
- lmcache.experimental.storage_backend.CreateStorageBackends(config: LMCacheEngineConfig, metadata: LMCacheEngineMetadata, loop: AbstractEventLoop, memory_allocator: MemoryAllocatorInterface, dst_device: str = 'cuda', lookup_server: LookupServerInterface | None = None) OrderedDict[str, StorageBackendInterface] [source]#