genai

package

v0.2.0 Latest Latest Go to latest Published: Mar 19, 2026 License: MIT Imports: 12 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/skosovsky/metry

Links

Open Source Insights

Documentation ¶

Overview ¶

Package genai provides OpenLLMetry semantic convention constants and helpers for recording GenAI usage (tokens, cost, prompts) on OTel spans.

Index ¶

Constants
Variables
func RecordAgentStep(span trace.Span, agentName, agentRole, step string)
func RecordAsyncFeedback(ctx context.Context, tracer trace.Tracer, traceIDHex string, score float64, ...) error
func RecordCacheHit(span trace.Span, hit bool, source string)
func RecordInteraction(ctx context.Context, span trace.Span, payload GenAIPayload, usage GenAIUsage)
func RecordStreamingCompletion(ctx context.Context, modelName string, outputTokens int, ttftSeconds float64, ...)
func RecordTTFT(ctx context.Context, durationSeconds float64, modelName string)
func RecordToolResult(span trace.Span, resultJSON string, isError bool)
func StartToolSpan(ctx context.Context, toolName, toolID, argsJSON string) (context.Context, trace.Span)
type GenAIPayload
type GenAIUsage

Constants ¶

View Source

const (
	System          = "gen_ai.system"
	RequestModel    = "gen_ai.request.model"
	ResponseModel   = "gen_ai.response.model"
	Prompt          = "gen_ai.prompt"
	Completion      = "gen_ai.completion"
	InputTokens     = "gen_ai.usage.input_tokens"  // #nosec G101 -- OTel semantic convention name, not a credential
	OutputTokens    = "gen_ai.usage.output_tokens" // #nosec G101 -- OTel semantic convention name, not a credential
	CostUSD         = "gen_ai.usage.cost"
	AudioSeconds    = "gen_ai.usage.audio_seconds"
	ImageCount      = "gen_ai.usage.image_count"
	EvaluationScore = "gen_ai.evaluation.score"
	EvaluationText  = "gen_ai.evaluation.text"

	// Tool Calling (toolsy).
	ToolName   = "gen_ai.tool.name"
	ToolID     = "gen_ai.tool.id"
	ToolArgs   = "gen_ai.tool.args"   // Expected JSON string.
	ToolResult = "gen_ai.tool.result" // Tool output (truncated if over internal limit).
	ToolError  = "gen_ai.tool.error"  // True if the tool call failed.

	// RAG and Semantic Cache.
	RetrievalSource = "gen_ai.retrieval.source"
	CacheHit        = "gen_ai.cache.hit"
	EmbeddingModel  = "gen_ai.embedding.model"

	// Multi-agent and orchestration (flowy).
	AgentName    = "gen_ai.agent.name"
	AgentRole    = "gen_ai.agent.role"
	WorkflowStep = "gen_ai.workflow.step"
	PromptType   = "gen_ai.prompt.type"

	// Operation purpose for cost tracking (e.g. generation vs guard evaluation).
	OperationPurpose = "ai.operation.purpose"
)

OpenLLMetry semantic convention attribute keys. See https://openllmetry.io/ and OpenTelemetry GenAI semantic conventions.

Note on OpenTelemetry standard evolution: Currently, metry defines its own semantic conventions for GenAI (e.g. CostUSD = "gen_ai.usage.cost") tracking the OpenLLMetry project. The OpenTelemetry project is actively standardizing GenAI semantic conventions (semconv). When official OTel GenAI semconv mature and are included in the standard Go OTel packages (e.g., > 1.42), these constants should be updated to alias the official ones to ensure long-term ecosystem compatibility without breaking the public `metry/genai` API.

View Source

const (
	PurposeGeneration        = "generation"
	PurposeGuardEvaluation   = "guard_evaluation"
	PurposeQualityEvaluation = "quality_evaluation"
)

Standard values for OperationPurpose.

Variables ¶

View Source

var (
	SystemKey          = attribute.Key(System)
	RequestModelKey    = attribute.Key(RequestModel)
	ResponseModelKey   = attribute.Key(ResponseModel)
	PromptKey          = attribute.Key(Prompt)
	CompletionKey      = attribute.Key(Completion)
	InputTokensKey     = attribute.Key(InputTokens)
	OutputTokensKey    = attribute.Key(OutputTokens)
	CostUSDKey         = attribute.Key(CostUSD)
	AudioSecondsKey    = attribute.Key(AudioSeconds)
	ImageCountKey      = attribute.Key(ImageCount)
	EvaluationScoreKey = attribute.Key(EvaluationScore)
	EvaluationTextKey  = attribute.Key(EvaluationText)

	ToolNameKey   = attribute.Key(ToolName)
	ToolIDKey     = attribute.Key(ToolID)
	ToolArgsKey   = attribute.Key(ToolArgs)
	ToolResultKey = attribute.Key(ToolResult)
	ToolErrorKey  = attribute.Key(ToolError)

	RetrievalSourceKey = attribute.Key(RetrievalSource)
	CacheHitKey        = attribute.Key(CacheHit)
	EmbeddingModelKey  = attribute.Key(EmbeddingModel)

	AgentNameKey    = attribute.Key(AgentName)
	AgentRoleKey    = attribute.Key(AgentRole)
	WorkflowStepKey = attribute.Key(WorkflowStep)
	PromptTypeKey   = attribute.Key(PromptType)

	// Operation purpose for cost tracking (e.g. generation vs guard evaluation).
	OperationPurposeKey = attribute.Key(OperationPurpose)
)

Attribute keys as attribute.Key for type-safe span recording.

Functions ¶

func RecordAgentStep ¶ added in v0.1.2

func RecordAgentStep(span trace.Span, agentName, agentRole, step string)

RecordAgentStep records one agent step as a span event (ReAct loops: Thought -> Action -> Observation). Event name gen_ai.agent.step and attributes follow OTel GenAI semantic conventions for dashboards. Call from flowy on each state transition; multiple calls on the same span produce a chronological event list.

func RecordAsyncFeedback ¶ added in v0.2.0

func RecordAsyncFeedback(
	ctx context.Context,
	tracer trace.Tracer,
	traceIDHex string,
	score float64,
	feedbackText string,
) error

RecordAsyncFeedback attaches delayed evaluation data to an existing trace.

func RecordCacheHit ¶

func RecordCacheHit(span trace.Span, hit bool, source string)

RecordCacheHit records cache hit and retrieval source on the span. Call from RAG layer before LLM request.

func RecordInteraction ¶

func RecordInteraction(ctx context.Context, span trace.Span, payload GenAIPayload, usage GenAIUsage)

RecordInteraction records the payload and usage for a single GenAI interaction.

func RecordStreamingCompletion ¶ added in v0.2.0

func RecordStreamingCompletion(
	ctx context.Context,
	modelName string,
	outputTokens int,
	ttftSeconds float64,
	totalDurationSeconds float64,
)

RecordStreamingCompletion records aggregate streaming quality metrics for a completed generation.

func RecordTTFT ¶

func RecordTTFT(ctx context.Context, durationSeconds float64, modelName string)

RecordTTFT records the Time To First Token (in seconds) as a histogram metric with model dimension. modelName is recorded as an attribute so dashboards can show TTFT per LLM (e.g. gpt-4o vs claude-3-5). Metrics are registered automatically when metry.Init is called with a metric exporter.

func RecordToolResult ¶ added in v0.1.2

func RecordToolResult(span trace.Span, resultJSON string, isError bool)

RecordToolResult records the result of a tool call on its own span (from StartToolSpan). resultJSON is truncated; isError sets span status to Error for dashboard visibility.

func StartToolSpan ¶ added in v0.1.3

func StartToolSpan(ctx context.Context, toolName, toolID, argsJSON string) (context.Context, trace.Span)

StartToolSpan creates a child span for a tool invocation. Caller MUST call span.End() (e.g. via defer). Use for parallel tool calls so each has its own span and timing.

Types ¶

type GenAIPayload ¶ added in v0.2.0

type GenAIPayload struct {
	System     string
	Prompt     string
	Completion string
}

GenAIPayload captures the text payloads attached to an interaction span.

type GenAIUsage ¶ added in v0.2.0

type GenAIUsage struct {
	InputTokens  int
	OutputTokens int
	CostUSD      float64
	AudioSeconds float64
	ImageCount   int
	Purpose      string
}

GenAIUsage captures token/cost and multimodal usage recorded on an interaction span.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL