Rate Limits

Understand quotas, retry guidance, and headers returned by API v1.

Default Limits

ServoAgent enforces token-bucket rate limiting per API key, workspace, and endpoint class.

Burst capacity is available for short spikes, but sustained throughput follows your plan quota.

Every API response includes current rate state for adaptive clients.

X-RateLimit-Limit: 100
X-RateLimit-Remaining: 84
X-RateLimit-Reset: 1735689600
Retry-After: 9

On 429 responses, back off using Retry-After and apply jitter to avoid synchronized retries.

Write idempotent mutation requests where possible so retries are safe in network failure scenarios.