Peter Lee
|
92648f9ba9
|
fix(agents): broaden 402 temporary-limit detection and allow billing cooldown probe (#38533)
Merged via squash.
Prepared head SHA: 282b9186c6f48fcdbf0c81c49f739e5e9ed2df23
Co-authored-by: xialonglee <22994703+xialonglee@users.noreply.github.com>
Co-authored-by: altaywtf <9790196+altaywtf@users.noreply.github.com>
Reviewed-by: @altaywtf
|
2026-03-08 10:27:01 +03:00 |
|
Peter Steinberger
|
2891c6c93c
|
refactor(agents): dedupe model fallback probe failure tests
|
2026-03-07 17:58:31 +00:00 |
|
Altay
|
6e962d8b9e
|
fix(agents): handle overloaded failover separately (#38301)
* fix(agents): skip auth-profile failure on overload
* fix(agents): note overload auth-profile fallback fix
* fix(agents): classify overloaded failures separately
* fix(agents): back off before overload failover
* fix(agents): tighten overload probe and backoff state
* fix(agents): persist overloaded cooldown across runs
* fix(agents): tighten overloaded status handling
* test(agents): add overload regression coverage
* fix(agents): restore runner imports after rebase
* test(agents): add overload fallback integration coverage
* fix(agents): harden overloaded failover abort handling
* test(agents): tighten overload classifier coverage
* test(agents): cover all-overloaded fallback exhaustion
* fix(cron): retry overloaded fallback summaries
* fix(cron): treat HTTP 529 as overloaded retry
|
2026-03-07 01:42:11 +03:00 |
|
Vignesh Natarajan
|
d45353f95b
|
fix(agents): honor explicit rate-limit cooldown probes in fallback runs
|
2026-03-05 20:03:06 -08:00 |
|
Ramez
|
acbb93be48
|
fix(agents): comprehensive quota fallback fixes - session overrides + surgical cooldown logic (#23816)
Merged via /review-pr -> /prepare-pr -> /merge-pr.
Prepared head SHA: e6f2b4742b82b9fe44a7e103170c2f96565b09c5
Co-authored-by: ramezgaberiel <844893+ramezgaberiel@users.noreply.github.com>
Co-authored-by: gumadeiras <5599352+gumadeiras@users.noreply.github.com>
Reviewed-by: @gumadeiras
|
2026-02-25 20:35:40 -05:00 |
|
Vignesh Natarajan
|
5c7c37a02a
|
Agents: infer auth-profile unavailable failover reason
|
2026-02-22 16:10:32 -08:00 |
|
Peter Steinberger
|
3c75bc0e41
|
refactor(test): dedupe agent and discord test fixtures
|
2026-02-22 20:04:51 +00:00 |
|
Peter Steinberger
|
ad1072842e
|
test: dedupe agent tests and session helpers
|
2026-02-22 17:11:54 +00:00 |
|
sebslight
|
d224776ffb
|
refactor(agents): extract cooldown probe decision helper
|
2026-02-16 08:10:52 -05:00 |
|
Ítalo Souza
|
39bb1b3322
|
fix: auto-recover primary model after rate-limit cooldown expires (#17478) (#18045)
Merged via /review-pr -> /prepare-pr -> /merge-pr.
Prepared head SHA: f7a7865727a9aee0aaa3d929cce65dc46c3db234
Co-authored-by: PlayerGhost <28265945+PlayerGhost@users.noreply.github.com>
Co-authored-by: sebslight <19554889+sebslight@users.noreply.github.com>
Reviewed-by: @sebslight
|
2026-02-16 08:03:35 -05:00 |
|