Changelog

Released, dated, accounted for.

Every release that touches an externally-visible surface is logged here. Internal-only changes (replica rebalances, observability plumbing) live in our SRE log.

Gemini 3.1 Pro joins the catalog.

The 1M-context Gemini frontier model is now generally available across all routed regions. Streaming, tools, vision, and structured JSON all supported on day one.

  • new gemini-3.1-pro in the model catalog
  • new 1M-token context window across all regions
  • change Default vision tier on Gemini moved from 3-Flash to 3.1-Pro

Zero-retention routes generally available.

Set idc-no-retention: true on any request to force a route where prompt and completion bytes never leave volatile memory. Available across all frontier and balanced-tier models.

  • new Zero-retention header support
  • change Audit log entries now include a retention_class field

Claude Opus 4.7 ships.

The new Anthropic frontier model is live across all routed regions. Claude Sonnet 4.6 (shipped February) remains our recommended balanced default for agentic loops thanks to its tool-use stability across long multi-turn sessions.

  • new claude-opus-4.7
  • change Default frontier-tier suggestion in docs updated to Opus 4.7

APAC · Sydney region open for production traffic.

Sixth routed region. Australian and New Zealand customers now route locally instead of via Singapore — typical p50 latency improvement is 60–90 ms.

  • new ap-syd region hint
  • new AU data-residency lock available

Per-tenant rate budgets.

Scale and Enterprise plans can now define rate budgets per downstream tenant. Useful for multi-tenant SaaS products that want to prevent one heavy account from starving the rest.

  • new Management API: POST /v1/management/tenants
  • new Dashboard: tenant budget editor
  • fix Token counts on cached prompt responses are now exact, not estimated

OpenTelemetry export.

Stream per-request traces directly to your existing OTLP/gRPC collector. No proprietary dashboard, no parallel observability stack.

  • new OTLP/gRPC trace export
  • change Webhook payloads now include the parent span ID
  • fix Region label normalization in S3 sink (was us-east-1, now us-east)

DeepSeek-V3.2 and Qwen3-Max added.

Two more open-weights flagships join the efficient tier. Both are available on shared capacity and as dedicated replicas in EU and APAC.

  • new deepseek-v3.2
  • new qwen3-max

BYO-provider routing.

Bring your own upstream key under our schema, routing, and observability layer. Keep your negotiated rates with the provider; pay us only the per-request gateway fee.

  • new BYO-provider key management
  • new Hybrid routing: idclinks pools + your keys, picked by quota
  • fix Improved error mapping for upstream 5xx → gateway 502/503

Want the changelog in your inbox?

One email per release, never marketing. Reply to subscribe — we don't run a list signup form.