Does OpenRouter have an SLA?

No. OpenRouter does not publish a formal SLA or uptime guarantee. Their status page shows incident history, but there is no contractual uptime commitment for API users.

How much latency does OpenRouter add?

OpenRouter claims ~25ms overhead at the edge under ideal conditions, and ~40ms under typical production conditions per their own documentation. Independent benchmarks with statistically significant sample sizes are scarce.

What happens when OpenRouter goes down?

During the February 2026 outages, 80-90% of requests failed for 35-38 minutes. Errors were returned as 401 authentication errors rather than 503 infrastructure errors, causing developers to debug their own API keys when nothing was wrong on their end.

Do OpenRouter credits expire?

Yes. Credits expire after 365 days from purchase. Crypto purchases are non-refundable; card purchases can be refunded within 24 hours of the transaction.

Is OpenRouter Reliable? An Honest Review for Production Use (2026)

TL;DR: OpenRouter works well for prototyping and low-stakes workloads. For production, the honest answer is: no SLA, three documented outages in eight months (35-50 minutes each), a 5.5% credit purchase fee, and misleading error codes during infrastructure failures. Whether that’s acceptable depends entirely on your uptime requirements.

OpenRouter is the easiest way to access 300+ models through one API — but “easy” and “production-grade” aren’t the same thing, and the gap matters when your app is down at 6 AM.

What OpenRouter Actually Promises

OpenRouter does not publish a formal SLA. There is no contractual uptime guarantee, no credits for downtime, and no tiered reliability offering. Their status page at status.openrouter.ai shows incident history, but that’s documentation after the fact, not a commitment.

What they do offer: automatic failover to alternate providers when an upstream model is rate-limited or unavailable, continuous provider health monitoring, and an OpenAI-compatible API across 300+ models from 60+ providers. For most developers, that’s genuinely useful. The question is whether it’s enough when things go wrong.

The Outage Record: Three Incidents in Eight Months

This is the part most OpenRouter reviews skip. Let’s go through the documented incidents.

August 28, 2025 — 50-minute database outage. A database failure took the service down for roughly 50 minutes starting at 5:40 AM Eastern. The root cause was a database-layer failure, not an upstream provider issue. This was an OpenRouter infrastructure problem.

February 17, 2026 — 38-minute outage. A third-party caching dependency failed, dropping all database connections. From 5:27 to 5:40 AM UTC, about 20% of requests failed. Then from 5:40 to 6:05 AM UTC, failure rates jumped to 80-90%. Total impact: 38 minutes.

February 19, 2026 — 35-minute outage. Two days later, the same caching layer failed again. Partial failures from 7:36 to 7:42 AM UTC, then near-total downtime from 7:42 to 8:11 AM UTC.

The February incidents had a particularly frustrating failure mode: OpenRouter returned 401 “User not found” errors during what was actually an infrastructure failure. Developers spent time checking their API keys and account status when nothing on their side was wrong. OpenRouter has since fixed this — infrastructure failures now return 503 — but the incident illustrates what happens when a routing layer adds its own failure modes on top of the underlying providers.

Three outages in eight months, each lasting 35-50 minutes, with no SLA. If your application needs four-nines uptime, that’s a problem.

Latency Overhead: 25ms or 40ms?

OpenRouter’s homepage claims ~25ms of added latency at the edge. Their own documentation says ~40ms under typical production conditions. The honest answer is that independent, statistically significant benchmarks comparing API gateways are scarce — most published numbers come from the vendors themselves.

What’s not in dispute: there is overhead. Every request goes through OpenRouter’s routing layer before reaching the model provider. For interactive applications where you’re already waiting 500ms-2s for a model response, 25-40ms is noise. For high-frequency, latency-sensitive workloads, it compounds.

The 5.5% Fee You Might Have Missed

OpenRouter charges a 5.5% fee on credit card purchases (5.0% on cryptocurrency), with a minimum charge of $0.80 per transaction. Credits expire after 365 days.

This isn’t hidden — it’s in their FAQ — but it’s easy to overlook when you’re focused on per-token pricing. On $1,000 of monthly API spend, that’s $55 in platform fees before you’ve made a single request. At $10,000/month, it’s $550.

The fee structure also means the “pass-through pricing” framing requires a footnote: yes, OpenRouter passes through provider token prices, but you’re paying 5.5% on top of every dollar you load into the system. For the full fee-stack breakdown including the BYOK 5% tail above 1M requests/month, see OpenRouter Pricing 2026: Complete Model Cost Guide & Hidden Markup Breakdown.

For comparison, ofox.ai charges no credit purchase fee — you pay for tokens used, not for loading credits. On the same $1,000/month spend, that’s $55 back in your pocket.

Model Availability: The Routing Layer Problem

OpenRouter’s value proposition is access to 300+ models through one endpoint. The flip side is that model availability depends on upstream providers, and the routing layer can introduce its own inconsistencies.

A documented pattern: open-source models accessed through OpenRouter sometimes produce lower-quality outputs than the same models accessed directly. The likely causes include provider-side load balancing (routing to lower-spec hardware under load), model version drift (the model ID you request may not always map to the same weights), and rate-limit handling that silently degrades rather than failing loudly.

This isn’t unique to OpenRouter — any routing layer has this problem. But it’s worth knowing before you benchmark a model through OpenRouter and assume the results reflect the model’s actual capability.

When OpenRouter Makes Sense

None of this means OpenRouter is bad. For the right use cases, it’s genuinely the fastest path to production:

Prototyping and experimentation — access to 300+ models with one API key, no per-provider account setup
Model comparison — easy to swap models in a single config change
Low-stakes workloads — internal tools, batch jobs, non-critical pipelines where 35 minutes of downtime per incident is acceptable
Teams without dedicated infrastructure — the routing, failover, and provider management is handled for you

The problems emerge when you need production-grade reliability, cost predictability at scale, or tight latency budgets.

The Alternative: Direct Access via ofox

If OpenRouter’s reliability record or fee structure is a concern, the practical alternative is an API gateway that offers the same OpenAI-compatible interface without the credit fee and with a cleaner failure model.

ofox.ai provides access to Claude, GPT, Gemini, DeepSeek, Qwen, and other frontier models through a single endpoint at https://api.ofox.ai/v1. Migration from OpenRouter is a two-line change:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.ofox.ai/v1",
    api_key="your-ofox-key"
)

No credit purchase fee, no expiring credits, and the same OpenAI SDK compatibility you’re already using. See the full model list for what’s currently available.

For a broader comparison of OpenRouter alternatives including self-hosted options, see 7 Best OpenRouter Alternatives in 2026.

The Verdict

OpenRouter is reliable enough for most developers most of the time — until it isn’t, and when it isn’t, you’ll get a 401 error that makes you think the problem is yours.

Here’s the honest breakdown:

Factor	OpenRouter	What it means
SLA	None	No contractual uptime guarantee
Outages (8 months)	3 incidents, 35-50 min each	~2 hours of downtime
Latency overhead	25-40ms	Acceptable for most use cases
Credit fee	5.5% (card) / 5.0% (crypto)	$55/month on $1K spend
Credit expiry	365 days	Unused credits are lost
Error codes	Fixed in Feb 2026	Was returning 401 for infra failures

For prototyping: use OpenRouter. The model breadth and ease of setup are hard to beat.

For production workloads where downtime has a cost: factor in the fee, the lack of SLA, and the outage history before committing. The AI API aggregation landscape has matured enough that you have real options.

Outage data sourced from OpenRouter’s official announcements and BigGo Finance reporting. Pricing data from OpenRouter’s FAQ and checkthat.ai, verified April 2026.

What OpenRouter Actually Promises

The Outage Record: Three Incidents in Eight Months

Latency Overhead: 25ms or 40ms?

The 5.5% Fee You Might Have Missed

Model Availability: The Routing Layer Problem

When OpenRouter Makes Sense

The Alternative: Direct Access via ofox

The Verdict

Related Articles

7 Best OpenRouter Alternatives in 2026: Pricing & Migration

Cut Claude Code Costs 80% with Hybrid Model Routing

LLM API Gateway Guide: Choose the Right One (2025)