Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func ChatTemplate ¶ added in v0.253.0
ChatTemplate returns the chat template for the given model.
func ModelFilePath ¶ added in v0.253.0
ModelFilePath returns the file path of the model.
Types ¶
type Manager ¶
type Manager struct {
// contains filtered or unexported fields
}
Manager manages the Ollama service.
TODO(kenji): Refactor this class once we completely switch to the one-odel-per-pod implementation where inference-manager-engine doesn't directly run vLLM or Ollama.
func (*Manager) CreateNewModel ¶
CreateNewModel creates a new model with the given name and spec.
func (*Manager) IsReady ¶ added in v0.212.0
IsReady returns true if the processor is ready. If not, it returns a message describing why it is not ready.
func (*Manager) UpdateModelTemplateToLatest ¶ added in v0.222.0
UpdateModelTemplateToLatest updates the model template to the latest.
func (*Manager) WaitForReady ¶
WaitForReady waits for the vllm service to be ready.
Click to show internal directories.
Click to hide internal directories.