Store and access your full request and response data for debugging, analytics, and compliance.

Data Retention

LLM Gateway offers configurable data retention policies that allow you to store full request and response payloads. This enables powerful debugging capabilities, detailed analytics, and compliance with data governance requirements.

Retention Levels

LLM Gateway supports two retention levels that can be configured per organization:

Level	Description	Storage Cost
Metadata Only	Stores request metadata (timestamps, model, tokens, costs) without full payloads. Default.	Free
Retain All Data	Stores complete request and response payloads including messages, tool calls, and attachments.	$0.01/1M tokens

Metadata-only retention is enabled by default and provides usage analytics without additional storage costs.

Storage Pricing

When full data retention is enabled, storage is billed at $0.01 per 1 million tokens. This rate applies to:

Input tokens (prompt)
Cached input tokens
Output tokens (completion)
Reasoning tokens

Storage costs are calculated per request and displayed in the cost_usd_data_storage field of the response. See Cost Breakdown for details on tracking costs programmatically.

Example Cost Calculation

For a request with:

1,000 input tokens
500 output tokens
1,500 total tokens

Storage cost = 1,500 / 1,000,000 × $0.01 = $0.000015

Configuring Retention

Data retention is configured at the organization level in your dashboard settings:

Navigate to Organization Settings → Policies
Select your preferred Data Retention Level
Save changes

Changing retention settings applies to new requests only. Existing stored data follows the retention period active when it was created.

Retention Periods

Data is retained for 30 days for all users. Enterprise plans can have custom retention periods. After the retention period expires, data is automatically deleted.