Skip to content

OpenSLA

OpenSLA is a vendor-neutral specification for API service level agreements. It defines what you promise to your consumers — throughput, latency, quotas, and pricing — in a machine-readable format that the gateway enforces automatically.

An OpenSLA document describes the contract between an API producer and its consumers:

SectionWhat It Specifies
MetadataAPI name, version, provider details
GuaranteesP95/P99 latency targets, availability percentage
QuotasDaily/monthly/quarterly RU limits
Rate limitsRequests per minute (RPM), transactions per second (TPS)
CostRU cost rate per operation, currency

Producers define multiple SLA tiers to support different consumer needs:

TierRate LimitRU QuotaLatency P95Price
Free10 RPM1,000 RU/monthBest effortFree
Standard100 RPM50,000 RU/month200msPer-unit
Premium1,000 RPM500,000 RU/month100msPer-unit
EnterpriseCustomUnlimited50msCustom

Tiers are ordered — the system knows which tier is “above” the current one, enabling automatic upgrade recommendations.

  1. Define your tiers when creating or updating an API
  2. Set per-tier rate limits, RU quotas, latency guarantees, and pricing
  3. Each tier maps to a specific set of gateway enforcement rules
  1. Browse available tiers in the developer portal or marketplace
  2. Select a tier manually, or let Apiway recommend one based on estimated usage
  3. The selected tier drives the subscription’s rate limits, RU quota, and cost model

When a consumer consistently hits their tier’s limits:

  1. After 3+ budget/rate limit breaches in 30 days → ConsumerSlaUpgradeRecommended governance event
  2. The event includes the recommended next tier
  3. Consumer can one-click upgrade → starts a governance flow
  4. Capacity planning validates the new tier has headroom
  5. New limits take effect after governance approval

OpenSLA isn’t just documentation — the gateway enforces it:

SLA ElementGateway Enforcement
Rate limit (RPM)RateLimitStage — fixed window, 429 on breach
RU quotaRuMeteringStage — tracks per subscription, 402 on exhaustion
TPS (spike arrest)RateLimitStage — per-second throttling
LatencyMeasured and reported; Assurance validates against targets

Response headers expose the current state to consumers:

RateLimit-Limit: 100
RateLimit-Remaining: 73
X-RU-Limit: 50000
X-RU-Remaining: 41200
X-RU-Cost: 1

OpenSLA tiers are the foundation of API economics:

  • Each tier has a defined RU cost rate
  • Revenue is tracked per tier — producers see which tiers generate the most revenue
  • The tier structure IS the monetisation plan
  • The Wealth Engine reports revenue by tier across the entire API portfolio