Documentation
¶
Overview ¶
Package modelcache provides an LRU model file cache for pre-caching GGUF models on Kubernetes nodes via a DaemonSet.
Experimental — this package is not yet wired into the main framework.
Package modelcache provides an LRU model file cache for pre-caching GGUF models on Kubernetes nodes via a DaemonSet.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Cache ¶
type Cache struct {
// contains filtered or unexported fields
}
Cache manages local model files with LRU eviction.
func NewCache ¶
NewCache creates a cache that stores files under dir with a maximum aggregate size of maxSize bytes.
func (*Cache) Evict ¶
func (c *Cache) Evict()
Evict removes least-recently-used entries until the total cache size is at or below maxSize.
Click to show internal directories.
Click to hide internal directories.