How it works
Every request goes through an ordered list of providers. Edgee tries each one in sequence, retrying transient failures before moving on. If all providers are exhausted without success, the error from the last attempt is returned to the caller.Provider ordering
Fallback order is determined automatically by each provider’s success rate, computed from recent request history. Providers with higher success rates are tried first. When multiple providers have the same score, they are shuffled randomly for load distribution. If you use BYOK keys, only your own provider keys are eligible — Edgee’s shared providers are not used as fallbacks. If no BYOK key is available for a model, shared providers are used instead.Retry behavior
Edgee distinguishes three categories of error:| Category | Errors | Behavior |
|---|---|---|
| Retry then fallback | Rate limit (429), Service unavailable (5xx) | Retry the same provider once, then fall back |
| Immediate fallback | Timeout (408, 504), Credential not found, Stream parse error | Skip retry, move to next provider immediately |
| Terminal | Invalid token (401), Configuration error | Return error immediately — no retry, no fallback |