mirror of
https://github.com/tiennm99/litellm.git
synced 2026-06-26 19:08:09 +00:00
bb63de2f82
Concurrent requests via run_in_executor + asyncio.gather caused a race condition where more requests slipped through the rate limiter than expected, leading to flaky test failures (e.g. 3 successes instead of 2 with rpm_limit=2). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>