confidential-websearch

command module

v0.2.6 Latest Latest Go to latest Published: Mar 10, 2026 License: AGPL-3.0 Imports: 18 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/tinfoilsh/confidential-websearch

Links

Open Source Insights

README ¶

Confidential Web Search Proxy

A proxy that augments LLM responses with real-time web search results, running inside a secure enclave.

Requests flow through a pipeline of specialized models that handle search decisions, safety filtering, and response generation:

Agent Model: A small, fast model that decides whether a search is needed and generates queries
Safeguard Model: Filters PII from outgoing queries and detects prompt injection in search results
Responder Model: The user's requested model, which generates the final response using search context

Uses the Tinfoil Go SDK for secure, attested communication with Tinfoil enclaves.

Architecture

User Request
  │ model: "<responder-model>"
  │ Authorization: Bearer <api-key>
  ▼
┌─────────────────────┐
│    Agent Model      │ ──► Decides: search needed?
│   (small, fast)     │     Generates search queries
└─────────────────────┘
          │
          ▼ search queries
┌─────────────────────┐
│   PII Filter        │ ──► Blocks queries with sensitive data
│ (Safeguard Model)   │     (SSN, bank accounts, medical IDs)
└─────────────────────┘
          │
          ▼ filtered queries
┌─────────────────────┐
│       Exa API       │ ──► Returns search results
└─────────────────────┘
          │
          ▼ search results
┌─────────────────────┐
│  Injection Filter   │ ──► Removes results with prompt injection
│ (Safeguard Model)   │     (instruction overrides, jailbreaks)
└─────────────────────┘
          │
          ▼ clean results
    ──► SSE: web_search_call events (query + status)
          │
          ▼
┌─────────────────────────────────────┐
│         Responder Model             │
│      (from user request)            │
└─────────────────────────────────────┘
          │
          ▼ (streaming)
    1. metadata chunk (annotations + reasoning)
    2. response content chunks...

Quick Start

# Set required API key
export EXA_API_KEY="your-exa-api-key"

# Run the proxy
go run .

# With verbose logging
go run . -v

Environment Variables

Variable	Default	Description
`EXA_API_KEY`	-	Exa AI Search API key (required)
`AGENT_MODEL`	-	Model for search decisions (small, fast)
`SAFEGUARD_MODEL`	-	Model for safety filtering
`ENABLE_INJECTION_CHECK`	`false`	Default for prompt injection detection (can be overridden per-request via tools)
`LISTEN_ADDR`	`:8089`	Address to listen on

API Endpoints

This server provides an OpenAI-compatible API with custom search and safety tools. Standard OpenAI SDKs can make requests, but custom streaming events and response fields are extensions that require additional client handling.

Chat Completions

POST /v1/chat/completions

curl http://localhost:8089/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $TINFOIL_API_KEY" \
  -d '{
    "model": "<responder-model>",
    "messages": [{"role": "user", "content": "What is the latest news about SpaceX?"}],
    "stream": true
  }'

Response includes standard OpenAI fields plus custom extensions:

choices[0].message.content - The generated response
choices[0].message.annotations - URL citations from search results
choices[0].message.search_reasoning - Agent's reasoning for search decisions (extension)
choices[0].message.blocked_searches - Queries blocked by safety filters (extension)

Streaming: In addition to standard content chunks, streams custom web_search_call events with search status. These use a chat.completion.chunk envelope so SDKs don't fail, but the content is custom.

Responses API

POST /v1/responses

curl http://localhost:8089/v1/responses \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $TINFOIL_API_KEY" \
  -d '{
    "model": "<responder-model>",
    "input": "What is the latest news about SpaceX?",
    "tools": [{"type": "web_search"}],
    "stream": true
  }'

Response includes structured output with web_search_call items and message content with annotations.

Streaming: When stream: true, emits OpenAI-conformant response.* events:

response.created, response.in_progress - Lifecycle events
response.web_search_call.in_progress/completed - Search status
response.output_text.delta - Content chunks
response.output_text.annotation.added - URL citations
response.completed - Final event

Health Check

GET /health - Returns {"status":"ok"}

Safety Features

PII Detection

Blocks search queries that would leak sensitive personally identifiable information:

Social Security Numbers, Tax IDs, Passport numbers
Bank account numbers, Credit card numbers
Medical record numbers, Health insurance IDs

Also blocks email addresses and phone numbers (these uniquely identify individuals). Does NOT block: names, addresses, or dates alone (these are commonly searched and don't uniquely identify someone). Combinations that identify a specific person (e.g., "John Smith, DOB 03/15/1985") are also blocked.

Prompt Injection Detection

Filters search results that contain prompt injection attempts:

Instruction overrides ("ignore previous instructions")
Role manipulation ("you are now DAN")
System prompt extraction attempts
Jailbreak patterns

Results flagged as containing injection are removed before being passed to the responder model.

Pipeline Stages

The request flows through six stages:

ValidateStage - Validates request format, extracts user query
AgentStage - Runs agent model with search tool, returns pending searches
SearchStage - Executes pending searches in parallel via Exa API
FilterResultsStage - Filters search results for prompt injection
BuildMessagesStage - Injects search results into conversation context
ResponderStage - Generates final response (streaming or non-streaming)

Docker

docker build -t websearch-proxy .
docker run -p 8089:8089 \
  -e EXA_API_KEY=$EXA_API_KEY \
  websearch-proxy

Safety checks are controlled per-request via the tools array. Include { "type": "pii_check" } to enable PII filtering on search queries, and { "type": "injection_check" } to filter prompt injection from search results.

Security

This proxy uses the Tinfoil Go SDK which provides:

Automatic attestation validation to ensure enclave integrity
TLS certificate pinning with attested certificates
Direct-to-enclave encrypted communication
API key forwarding from client requests (no stored credentials)

All processing occurs within secure enclaves - search queries, results, and responses never leave the trusted execution environment unencrypted.

Reporting Vulnerabilities

Please report security vulnerabilities by either:

Emailing security@tinfoil.sh
Opening an issue on GitHub on this repository

We aim to respond to (legitimate) security reports within 24 hours.

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

Directories ¶

Path	Synopsis
agent
api
config
fetch
llm
pipeline
safeguard
search

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL