embed

package

v0.2.15 Latest Latest Go to latest Published: Jun 26, 2026 License: MPL-2.0 Imports: 6 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/loremlabs/thanks-computer

Links

Open Source Insights

Documentation ¶

Overview ¶

Package embed is the chassis-owned `ai://embed` exec dispatch surface.

Like chassis/chat, it is a thin registry: backends (ollama-local first, OpenAI-direct next) self-register via init() in their subpackages; the chassis activates one with a blank import. Per-call selection is handled by Resolve (provider override, else first-registered).

**Separation of concerns.** embed turns text into a vector and persists NOTHING. Storing and searching vectors is a separate primitive (txco://vector). This mirrors the design doc's split: embedding belongs to AI; vector storage/retrieval belongs to infrastructure.

**Boundary of trust.** Same as chat: RequiredSecrets() declares which standardized names a backend needs (the ollama backend needs none — it talks to a local, keyless endpoint); the ExecAI handler materializes them through the per-tenant store with optional env fallback; cleartext rides only in the *secrets.SecretBag passed to Embed.

**v1 is intentionally boring.** Batch is first-class (one round-trip embeds many texts), but there is no caching, no automatic chunking of over-long inputs, and no cross-backend capability routing. Each escalation is its own focused change.

Index ¶

func Register(name string, c Constructor)
func Registered() []string
type Backend
- func Open(name string, cfg Config) (Backend, error)
- func Resolve(providerHint string, cfg Config) (Backend, string, error)
type CodedError
type Config
type Constructor
type InvalidWithError
- func (e *InvalidWithError) Code() string
- func (e *InvalidWithError) Error() string
type MissingSecretError
- func (e *MissingSecretError) Code() string
- func (e *MissingSecretError) Error() string
type NoBackendError
- func (e *NoBackendError) Code() string
- func (e *NoBackendError) Error() string
type ProviderHTTPError
- func (e *ProviderHTTPError) Code() string
- func (e *ProviderHTTPError) Error() string
type ProviderNetError
- func (e *ProviderNetError) Code() string
- func (e *ProviderNetError) Error() string
type ProviderParseError
- func (e *ProviderParseError) Code() string
- func (e *ProviderParseError) Error() string
type Request
type Response

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func Register ¶

func Register(name string, c Constructor)

Register adds a backend constructor. Called from a backend package's init(); the chassis activates a backend with a blank import.

Re-registering an existing name overwrites the constructor (test support); registration order is preserved on first-registration only.

func Registered ¶

func Registered() []string

Registered returns the names of currently registered backends, sorted.

Types ¶

type Backend ¶

type Backend interface {
	// Name returns the registered name (must match the Register key).
	Name() string

	// Capabilities returns descriptive labels recorded on the trace event
	// for observability. v1 does NOT route on them.
	Capabilities() []string

	// DefaultModel is the model used when a request omits WITH model.
	DefaultModel() string

	// RequiredSecrets are the standardized secret names this backend needs.
	// The ollama backend returns nil (local, unauthenticated); the OpenAI
	// backend will return []string{"OPENAI_KEY"}.
	RequiredSecrets() []string

	// Embed returns one vector per input text, in the same order. The bag
	// carries cleartext for every name in RequiredSecrets(); implementations
	// read via bag.Get and keep cleartext in local variables only.
	Embed(ctx context.Context, req Request, bag *secrets.SecretBag) (Response, error)
}

Backend is the chassis-facing interface every embedding backend implements.

Lifecycle: backends register by name via Register() in an init(); the chassis resolves one per ai://embed EXEC. Embed must be safe for concurrent use — the chassis dispatches embed ops in parallel on WHEN/EMIT fan-out.

func Open ¶

func Open(name string, cfg Config) (Backend, error)

Open constructs the named backend. Unknown name is an error listing what is available.

func Resolve ¶

func Resolve(providerHint string, cfg Config) (Backend, string, error)

Resolve picks a backend for one ai://embed dispatch. v1 logic mirrors chat.Resolve:

providerHint non-empty → look up by name; NoBackendError if unknown. routing_decision = "provider-override".
providerHint empty → first-registered backend. routing_decision = "default". With a single v1 backend (ollama), this is unambiguous.

type CodedError ¶

type CodedError interface {
	error
	Code() string
}

CodedError is implemented by embed errors that carry a stable txco_embed_* code. The ExecAI handler surfaces the code on the response envelope's top-level `embed.error.code` so rule authors can dispatch uniformly with `WHEN @embed.error EXEC ...`.

type Config ¶

type Config struct {
	// HTTPClient is the chassis-owned http.Client (egress-guarded). Every
	// outbound call from a backend MUST use it; backends never build their
	// own transport.
	HTTPClient *http.Client

	// OllamaBaseURL is the base URL for the ollama backend (e.g.
	// http://localhost:11434). Empty → the backend's localhost default.
	OllamaBaseURL string
}

Config carries embed-package construction options resolved from chassis config. Backends extend it with their own fields without breaking callers (same convention as chat.Config).

type Constructor ¶

type Constructor func(Config) (Backend, error)

Constructor builds a Backend from resolved config.

type InvalidWithError ¶

type InvalidWithError struct {
	Reason string
}

InvalidWithError flags a malformed WITH clause (the rule author's bug).

func (*InvalidWithError) Code ¶

func (e *InvalidWithError) Code() string

func (*InvalidWithError) Error ¶

func (e *InvalidWithError) Error() string

type MissingSecretError ¶

type MissingSecretError struct {
	Backend string
	Secret  string
}

MissingSecretError is returned when a backend's RequiredSecrets() name is absent from the per-tenant store (and env fallback, when enabled).

func (*MissingSecretError) Code ¶

func (e *MissingSecretError) Code() string

func (*MissingSecretError) Error ¶

func (e *MissingSecretError) Error() string

type NoBackendError ¶

type NoBackendError struct {
	ProviderHint string
	Registered   []string
}

NoBackendError is returned by Resolve when a provider hint names an unregistered backend, or when no backend is registered at all.

func (*NoBackendError) Code ¶

func (e *NoBackendError) Code() string

func (*NoBackendError) Error ¶

func (e *NoBackendError) Error() string

type ProviderHTTPError ¶

type ProviderHTTPError struct {
	StatusCode int
	Body       string
}

ProviderHTTPError carries a non-2xx provider response (body already truncated/sanitized by the backend).

func (*ProviderHTTPError) Code ¶

func (e *ProviderHTTPError) Code() string

func (*ProviderHTTPError) Error ¶

func (e *ProviderHTTPError) Error() string

type ProviderNetError ¶

type ProviderNetError struct {
	Reason string
}

ProviderNetError carries a network/DNS/timeout failure reaching the provider (after the backend's retry budget is spent).

func (*ProviderNetError) Code ¶

func (e *ProviderNetError) Code() string

func (*ProviderNetError) Error ¶

func (e *ProviderNetError) Error() string

type ProviderParseError ¶

type ProviderParseError struct {
	Reason  string
	BodyLen int
}

ProviderParseError flags an empty or malformed provider response body.

func (*ProviderParseError) Code ¶

func (e *ProviderParseError) Code() string

func (*ProviderParseError) Error ¶

func (e *ProviderParseError) Error() string

type Request ¶

type Request struct {
	// Texts are the inputs to embed, in order. Single-text callers pass a
	// length-1 slice; batch is first-class.
	Texts []string `json:"texts"`

	// Model is the resolved model identifier; empty → backend default.
	Model string `json:"model,omitempty"`

	// Dimensions optionally requests a truncated embedding (Matryoshka).
	// Zero → the model's native dimension. Backends that can't truncate
	// ignore it; the chassis records the actual dimension returned.
	Dimensions int `json:"dimensions,omitempty"`

	// Intent is a trace-only label (e.g. "embed_book_profile").
	Intent string `json:"intent,omitempty"`
}

Request is the chassis-normalized embedding request. The handler decodes op.Meta (WITH-clause materialization) into this shape.

type Response ¶

type Response struct {
	// Vectors holds one embedding per input text, in input order.
	Vectors [][]float32 `json:"vectors"`

	// Provider, Model are observability fields surfaced in trace + envelope.
	Provider string `json:"provider"`
	Model    string `json:"model"`

	// Dimensions is the actual length of each returned vector (0 when no
	// vectors were produced, e.g. on error).
	Dimensions int `json:"dimensions"`

	// Tokens is the provider-reported input token count (0 if unreported).
	// Recorded in trace + `_embed` metadata; never charged to fuel.
	Tokens int64 `json:"tokens"`

	// LatencyMS is wall-clock from request build to response parse.
	LatencyMS int64 `json:"latency_ms"`

	// Retries is the count of provider retries performed (0 on first-try
	// success).
	Retries int `json:"retries"`
}

Response is the chassis-normalized embedding result. Backends translate the provider's on-wire shape into this struct.

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
ollama Package ollama is the v1 ai://embed backend: a local, keyless Ollama instance (https://ollama.com) speaking its /api/embed endpoint.	Package ollama is the v1 ai://embed backend: a local, keyless Ollama instance (https://ollama.com) speaking its /api/embed endpoint.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL