Documentation
¶
Index ¶
- func DeleteOrphanedRunnersDir(c config.OllamaConfig) error
- func SetEnvVarsFromConfig(c config.OllamaConfig) error
- type Manager
- func (m *Manager) CreateNewModelOfGGUF(modelName string, spec *manager.ModelSpec) error
- func (m *Manager) DownloadAndCreateNewModel(modelName string, resp *mv1.GetBaseModelPathResponse) error
- func (m *Manager) IsReady() (bool, string)
- func (m *Manager) PullBaseModel(modelName string) error
- func (m *Manager) Run() error
- func (m *Manager) UpdateModelTemplateToLatest(modelName string) error
- func (m *Manager) WaitForReady() error
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func DeleteOrphanedRunnersDir ¶ added in v0.256.0
func DeleteOrphanedRunnersDir(c config.OllamaConfig) error
DeleteOrphanedRunnersDir deletes orphaned payload directories.
func SetEnvVarsFromConfig ¶ added in v0.233.0
func SetEnvVarsFromConfig(c config.OllamaConfig) error
SetEnvVarsFromConfig sets environment variables from the given configuration.
Types ¶
type Manager ¶
type Manager struct {
// contains filtered or unexported fields
}
Manager manages the Ollama service.
TODO(kenji): Refactor this class once we completely switch to the one-odel-per-pod implementation where inference-manager-engine doesn't directly run vLLM or Ollama.
func (*Manager) CreateNewModelOfGGUF ¶ added in v0.273.0
CreateNewModelOfGGUF creates a new model with the given name and spec that uses a GGUF model file.
func (*Manager) DownloadAndCreateNewModel ¶ added in v0.273.0
func (m *Manager) DownloadAndCreateNewModel(modelName string, resp *mv1.GetBaseModelPathResponse) error
DownloadAndCreateNewModel downloads the model from the given path and creates a new model.
func (*Manager) IsReady ¶ added in v0.212.0
IsReady returns true if the processor is ready. If not, it returns a message describing why it is not ready.
func (*Manager) PullBaseModel ¶ added in v0.7.0
PullBaseModel pulls the base model from the given path.
func (*Manager) UpdateModelTemplateToLatest ¶ added in v0.222.0
UpdateModelTemplateToLatest updates the model template to the latest.
func (*Manager) WaitForReady ¶ added in v0.7.0
WaitForReady waits for the Ollama service to be ready.