Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type CallMetrics ¶
type CallMetrics struct {
Provider string `json:"provider"`
Model string `json:"model"`
StartAt time.Time `json:"start_at"`
FirstAt time.Time `json:"first_at,omitempty"`
EndAt time.Time `json:"end_at,omitempty"`
Bytes int `json:"bytes"`
Chunks int `json:"chunks"`
RequestBytes int `json:"request_bytes,omitempty"`
ResponseBytes int `json:"response_bytes,omitempty"`
HTTPStatus int `json:"http_status,omitempty"`
PromptTokens int `json:"prompt_tokens"`
CompletionTokens int `json:"completion_tokens"`
TotalTokens int `json:"total_tokens"`
Error string `json:"error,omitempty"`
}
CallMetrics reports lightweight performance indicators for a single LLM call.
func (CallMetrics) FirstTokenLatency ¶
func (m CallMetrics) FirstTokenLatency() time.Duration
FirstTokenLatency returns time to first delta/token if available.
func (CallMetrics) Latency ¶
func (m CallMetrics) Latency() time.Duration
Latency returns end-to-end latency (EndAt-StartAt) if available.
Click to show internal directories.
Click to hide internal directories.