litellm

tiennm99/litellm

Fork 0

mirror of https://github.com/tiennm99/litellm.git synced 2026-08-02 20:22:10 +00:00

Files

T

History

533eab4dbd fix(tests/vcr): make Redis cassette cache replay deterministically (zero VCR misses on consecutive runs) (#28826 )

* test(vcr): make Redis-backed cassettes replay deterministically across runs

- Pin LITELLM_LOCAL_MODEL_COST_MAP=True in the shared VCR harness so the
  per-test importlib.reload(litellm) no longer fetches the model cost map
  from raw.githubusercontent.com. That live fetch was being recorded into
  cassettes; for tests that subsequently skip it was the only recorded
  episode, so the persister refused to save it (skipped tests don't persist)
  and the test re-recorded it live every run (MISS:NOT_PERSISTED).

- Compare-time symmetric matcher tolerance for Google OAuth (ya29.*) tokens,
  observability/telemetry payloads, credential-exchange bodies, and volatile
  UUID/timestamp tokens, so existing cassettes select a recorded episode
  instead of growing past the 50-episode cap and re-recording live.

- Don't record fire-and-forget telemetry (langfuse/arize/otel/...) into
  non-telemetry tests' cassettes. Several modules set litellm.success_callback
  at import time, so observability logging is globally enabled and an async
  flush from the background logging worker lands in an unrelated test's VCR
  window, saved as a spurious MISS:RECORDED (observed: a Langfuse batch from
  another completion landing on test_lowest_latency_routing_buffer). Such a
  request now passes through live (telemetry hosts aren't real-spend hosts);
  tests that actually assert on telemetry keep recording it.

- Dedupe + cap the VCR diagnostic dump so the classification summary survives
  CircleCI's ~400KB step-output truncation.

- Stabilize a non-deterministic rate-limit test body; mark AWS Secrets Manager
  lifecycle tests VCR-incompatible (uniquely-named secrets can't be replayed).

- Mark test_router_text_completion_client VCR-incompatible: it fires 300
  identical requests to verify async-client reuse, but vcrpy patches the HTTP
  transport so replay never exercises the real connection pool the test
  validates, and recording 300 near-identical episodes overflows the
  50-episode cap (MISS:OVERFLOW every run). It hits a free mock endpoint.

- Mark the Vertex AI MaaS Mistral OCR tests (vertex_ai/mistral-ocr-2505)
  VCR-incompatible: the MaaS model is not provisioned in the CI GCP project,
  so the live :rawPredict call fails and the test skips every run, leaving no
  cassette to record (MISS:NOT_PERSISTED every run). Sibling direct-Mistral
  and Azure OCR tests are unaffected and still replay from cache.

* fix(tests/vcr): refresh cassette TTL on read so replayed cassettes don't expire

The Redis VCR persister loaded cassettes with a plain GET, which does not
touch the key's TTL. A cassette that is only ever replayed (HIT/NOOP, never
re-recorded) therefore expired exactly 24h after its last *write*, no matter
how often it was read. Whichever CI run happened to cross that boundary
re-recorded the cassette live and surfaced a spurious VCR MISS on otherwise
deterministic cassettes — the residual per-run flakiness floor (a different
random subset of read-only cassettes expiring each run).

Slide the expiry forward on every successful load (best-effort EXPIRE), so
any cassette used at least once per TTL window stays alive indefinitely and
the 2nd/3rd run of a day replays cleanly.

* fix(tests/vcr): recover from spurious GET-None for existing cassette keys

Under concurrent CI load, the persister's load GET was observed returning
None for a cassette key that demonstrably existed on the (single, non-
clustered) Redis master — an external monitor saw the key present with a
healthy TTL at the same instant the in-process client read None. Because
None is a valid GET result (not a RedisError), the retry-on-error client
config never engaged, so the cassette re-recorded live (a phantom
MISS:RECORDED); for flaky/networked tests the failed live call then
triggered a pytest rerun, which is why a rotating subset of otherwise
deterministic tests missed each run.

On a None result, re-check EXISTS and re-read once. If the key really
exists, use the recovered value and log [vcr-transient-miss-recovered]
(also counted in cassette_cache_health). A genuinely absent key (a new
cassette) still falls through to CassetteNotFoundError.

* chore(tests/vcr): TEMP diagnostic for persistent-miss cassette load path

Logs GET/EXISTS at load time for the three cassettes that re-record every
run despite being present in Redis, to capture what the in-process client
sees. To be reverted before merge.

* chore(tests/vcr): write load diagnostic to Redis (truncation-proof)

CI stdout truncates to the last ~400KB, dropping the early loaddbg lines
for the alphabetically-first failing test. Push the load probe to a Redis
list instead so it survives. To be reverted before merge.

* fix(tests/vcr): don't drop stored telemetry episodes during cassette load

Root cause of the residual per-run misses on present cassettes: vcrpy's
Cassette._load() replays each *stored* interaction through Cassette.append(),
which runs before_record_request on it — and a None return there silently
drops that episode. The telemetry-leak suppressor (_should_drop_telemetry_record)
returns None for telemetry requests, so when a non-telemetry-named test (or the
alphabetically-first test in a worker, whose _current_test_nodeid is still empty)
loaded a cassette containing a Langfuse ingestion episode, the episode was
dropped on read — forcing an endless live re-record (a phantom MISS:RECORDED on
a cassette that was demonstrably present in Redis). Verified by reproducing
Cassette._load() against the real cassette: empty/non-telemetry nodeid -> 0
episodes survive; with the guard -> 1 survives.

Fix: guard the suppressor with a thread-local set around Cassette._load (via a
small idempotent monkeypatch), so the drop only ever stops *new* incidental
telemetry from being recorded and never filters the existing cassette on read.

Also drops the speculative GET-None recovery + its diagnostics from the previous
commits: the load diagnostic showed GET returns the cassette bytes fine
(get=1440B), so the persister never returned a spurious None — the loss happened
later in vcrpy's append. The proven TTL-refresh-on-read fix is retained.

* fix(tests/vcr): drop incidental telemetry export POSTs to stop rotating async-flush misses

litellm's observability loggers flush on a background thread, so a Langfuse
ingestion POST scheduled by one telemetry test can fire mid-way through a
*later* telemetry-named test (after that test's own httpx mock has exited) and
be recorded by VCR as a phantom episode — a non-deterministic MISS:RECORDED /
PARTIAL that rotates onto a different telemetry test from run to run.

Telemetry export POSTs are fire-and-forget; no test asserts on a *recorded*
export response except the pass-through proxy test (which forwards a client POST
to Langfuse ingestion and replays its 207). So _should_drop_telemetry_record now
drops incidental export POSTs for every test except that one. Dropping returns
None (live fire-and-forget, never stored), so it can only turn a phantom miss
into a harmless live call, never the reverse; recorded read-back GETs that
telemetry tests assert on are matched by method and left untouched.

* fix(tests/vcr): restore assertion in test_banner_silent_when_vcr_disabled

The assertion that the banner is suppressed when VCR is disabled was
inadvertently moved into test_diagnostic_log_silent_when_no_dir when
the diagnostic-log tests were added, leaving the disabled-VCR test
verifying nothing.

Co-authored-by: Yassin Kortam <yassin@berri.ai>

---------

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
Co-authored-by: Yassin Kortam <yassin@berri.ai>

2026-05-26 11:30:44 -07:00

.litellm_cache

…

auto_router

[Feat] Backend Router - Add Auto-Router powered by semantic-router (#12955 )

2025-07-24 18:32:56 -07:00

example_config_yaml

test: test

2026-03-28 19:17:38 -07:00

test_configs

test: test

2026-03-28 19:17:38 -07:00

test_model_response_typing

…

azure_fine_tune.jsonl

…

azure_speech.mp3

[Feat] Add Azure AVA TTS integration (#15749 )

2025-10-20 16:52:23 -07:00

batch_job_results_furniture.jsonl

…

cache_unit_tests.py

fix: use fastuuid helper (#14903 )

2025-09-25 15:47:01 -07:00

conftest.py

fix(tests/vcr): make Redis cassette cache replay deterministically (zero VCR misses on consecutive runs) (#28826 )

2026-05-26 11:30:44 -07:00

create_mock_standard_logging_payload.py

chore(ci): modernize model references in tests and configs (#27856 )

2026-05-15 15:44:28 -07:00

data_map.txt

…

eagle.wav

…

example.jsonl

…

gettysburg.wav

…

large_text.py

…

model_cost.json

…

openai_batch_completions_router.jsonl

…

openai_batch_completions.jsonl

…

speech_vertex.mp3

…

stream_chunk_testdata.py

…

test_acompletion_fallbacks.py

…

test_acompletion.py

…

test_acooldowns_router.py

test: test

2026-03-28 19:17:38 -07:00

test_add_function_to_prompt.py

…

test_add_update_models.py

fix(tests): skip remaining real prisma DB tests in CI and related test suites

2026-02-20 13:25:42 -03:00

test_aim_guardrails.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_alangfuse.py

test: update key names

2026-03-28 21:13:16 -07:00

test_amazing_vertex_completion.py

test(vertex_ai): tolerate transient 500 in google maps grounding test (#28503 )

2026-05-21 17:01:49 -07:00

test_anthropic_prompt_caching.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_arize_ai.py

test: rename env var

2026-03-28 20:27:39 -07:00

test_arize_phoenix.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_assistants.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_async_fn.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_auth_utils.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_azure_anthropic_sync_post.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_azure_content_safety.py

…

test_azure_openai.py

test: test

2026-03-28 19:17:38 -07:00

test_azure_perf.py

test: test

2026-03-28 19:17:38 -07:00

test_basic_python_version.py

[Test] CI: add v2 migration resolver coverage with local Postgres

2026-04-21 14:40:11 -07:00

test_batch_completion_return_exceptions.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_batch_completions.py

replace retired claude-3-haiku-20240307 with claude-haiku-4-5-20251001 in local_testing part1 and router fallback tests

2026-04-20 16:10:45 -07:00

test_blocked_user_list.py

fix(tests): skip remaining real prisma DB tests in CI and related test suites

2026-02-20 13:25:42 -03:00

test_braintrust.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_budget_manager.py

…

test_cache_preset_key.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_caching_handler.py

fix(caching): replay openai/responses bridge cache hits as chat streams (#28158 )

2026-05-18 16:27:06 -07:00

test_caching_ssl.py

Merge main and resolve conflict in test_router_client_init.py

2026-03-30 18:44:33 -07:00

test_caching.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_class.py

test: test

2026-03-28 19:17:38 -07:00

test_completion_cost.py

test(fireworks): mock remaining live smoke tests

2026-05-15 22:28:27 -07:00

test_completion_with_retries.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_completion.py

chore(tests): migrate Bedrock CI to AWS account 941277531214 (#28728 )

2026-05-25 12:03:17 -07:00

test_config.py

test: test

2026-03-28 19:17:38 -07:00

test_cost_calc.py

Revert "Fix xdist test isolation: capture true defaults and poll instead of sleep"

2026-03-15 22:57:39 -07:00

test_custom_api_logger.py

…

test_custom_callback_input.py

fix(tests): replace shut-down gpt-4o-audio-preview with gpt-audio-1.5 (#28281 )

2026-05-19 14:48:30 -07:00

test_custom_llm.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_custom_logger.py

Mark test_redis_cache_completion_stream as flaky with retries

2026-03-15 20:44:18 -07:00

test_disk_cache_unit_tests.py

…

test_docker_no_network_on_deploy.py

build: migrate packaging, CI, and Docker from Poetry to uv (#25007 )

2026-04-09 11:46:23 -07:00

test_dual_cache.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_dynamic_rate_limit_handler.py

fix: use fastuuid helper (#14903 )

2025-09-25 15:47:01 -07:00

test_dynamodb_logs.py

…

test_embedding.py

[Test] Tests: Stop parametrizing API keys into pytest test IDs (#27249 )

2026-05-05 17:21:18 -07:00

test_exceptions.py

fix: cleanup tests

2026-03-30 16:24:35 -07:00

test_file_types.py

…

test_function_call_parsing.py

chore(tests): migrate Bedrock CI to AWS account 941277531214 (#28728 )

2026-05-25 12:03:17 -07:00

test_function_calling.py

chore(tests): migrate Bedrock CI to AWS account 941277531214 (#28728 )

2026-05-25 12:03:17 -07:00

test_function_setup.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_gcs_bucket.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_gcs_cache_unit_tests.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_gemini_reasoning_content.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_get_llm_provider.py

test: drop duplicate openrouter prefix-strip test

2026-04-25 18:06:25 -03:00

test_get_model_file.py

Revert "Merge pull request #16590 from Chesars/refactor/remove-backup-file-dry-principle"

2026-04-25 17:10:41 -03:00

test_get_model_info.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_get_optional_params_embeddings.py

fix(embeddings): allow dimensions param passthrough via allowed_openai_params for non-text-embedding-3 OpenAI models

2026-02-26 09:59:37 +05:30

test_get_optional_params_functions_not_supported.py

…

test_google_ai_studio_gemini.py

…

test_guardrails_ai.py

…

test_helicone_integration.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_http_parsing_utils.py

…

test_img_resize.py

…

test_lakera_ai_prompt_injection.py

…

test_langchain_ChatLiteLLM.py

…

test_langsmith.py

fix: use fastuuid helper (#14903 )

2025-09-25 15:47:01 -07:00

test_least_busy_routing.py

test: fixes because azure deactivated our account

2025-10-25 15:10:45 -07:00

test_litellm_max_budget.py

…

test_llm_guard.py

…

test_load_test_router_s3.py

fix tests

2025-10-25 10:19:24 -07:00

test_loadtest_router.py

test: test

2026-03-28 19:17:38 -07:00

test_logfire.py

…

test_logging.py

…

test_longer_context_fallback.py

…

test_lowest_cost_routing.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_lowest_latency_routing.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_lunary.py

…

test_max_tpm_rpm_limiter.py

…

test_mem_leak.py

…

test_mem_usage.py

fix tests

2025-10-25 10:19:24 -07:00

test_mock_request.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_model_alias_map.py

fix(test): scope ERROR log assertion to LiteLLM logger in test_model_alias_map

2026-04-29 03:48:41 +00:00

test_model_max_token_adjust.py

…

test_multiple_deployments.py

[Fix] TogetherAIConfig.get_supported_openai_params recursion

2026-04-16 17:20:58 -07:00

test_ollama_local_chat.py

…

test_ollama_local.py

…

test_ollama.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_openai_moderations_hook.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_opik.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_pass_through_endpoints.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_profiling_router.py

…

test_prometheus_service.py

[Release Fix] (#22411 )

2026-02-28 09:46:35 -08:00

test_prompt_caching.py

claude-sonnet-4-5-20250929 fix

2025-10-31 18:20:52 -07:00

test_prompt_injection_detection.py

test: test

2026-03-28 19:17:38 -07:00

test_promptlayer_integration.py

…

test_provider_specific_config.py

Litellm fix update bedrock models (#24947 )

2026-04-01 19:22:54 -07:00

test_pydantic_namespaces.py

…

test_pydantic.py

…

test_redis_batch_optimizations.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_register_model.py

Revert "test_update_model_cost_map_url"

2025-12-22 12:41:30 +05:30

test_responses_stream_cache_keys.py

fix(cache): persist and replay streamed Responses API requests (#24580 )

2026-05-01 11:55:36 +05:30

test_router_auto_router.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_router_batch_completion.py

test fix

2025-09-01 17:04:47 -07:00

test_router_budget_limiter.py

test: test

2026-03-28 19:17:38 -07:00

test_router_caching.py

test: test

2026-03-28 19:17:38 -07:00

test_router_client_init.py

test_router_init_azure_service_principal_with_secret_with_environment_variables

2026-03-30 21:15:53 -07:00

test_router_cooldown_handlers.py

test: test

2026-03-28 19:17:38 -07:00

test_router_custom_routing.py

Optimize CI: parallelize router and guardrails test jobs, fix test isolation

2026-03-14 22:54:44 -07:00

test_router_debug_logs.py

feat: routing groups ui

2026-05-04 18:09:14 -07:00

test_router_fallback_handlers.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_router_fallbacks.py

replace retired claude-3-haiku-20240307 with claude-haiku-4-5-20251001 in local_testing part1 and router fallback tests

2026-04-20 16:10:45 -07:00

test_router_get_deployments.py

Fix:add async_get_available_deployment_for_pass_through in code tests

2026-01-16 16:37:44 +05:30

test_router_max_parallel_requests.py

fix(tests/vcr): make Redis cassette cache replay deterministically (zero VCR misses on consecutive runs) (#28826 )

2026-05-26 11:30:44 -07:00

test_router_pattern_matching.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_router_retries.py

fix(tests): read CI_CD_DEFAULT_ANTHROPIC_MODEL env var instead of hardcoding model (#21781 )

2026-02-21 10:46:49 -08:00

test_router_timeout.py

Litellm fix update bedrock models (#24947 )

2026-04-01 19:22:54 -07:00

test_router_utils.py

test: test

2026-03-28 19:17:38 -07:00

test_router_with_fallbacks.py

…

test_router.py

test(vcr): drop dead 'from respx import MockRouter' imports

2026-05-13 00:32:03 +00:00

test_rules.py

…

test_sagemaker_nova_integration.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_sagemaker.py

chore(tests): migrate Bedrock CI to AWS account 941277531214 (#28728 )

2026-05-25 12:03:17 -07:00

test_scheduler.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_secret_detect_hook.py

…

test_spend_calculate_endpoint.py

test fix

2025-09-01 17:04:47 -07:00

test_stream_chunk_builder.py

fix(tests): replace shut-down gpt-4o-audio-preview with gpt-audio-1.5 (#28281 )

2026-05-19 14:48:30 -07:00

test_streaming.py

chore(tests): migrate Bedrock CI to AWS account 941277531214 (#28728 )

2026-05-25 12:03:17 -07:00

test_supabase_integration.py

…

test_team_config.py

…

test_text_completion.py

style: run black formatter on files from main merge

2026-04-17 13:02:59 -07:00

test_timeout.py

Litellm fix update bedrock models (#24947 )

2026-04-01 19:22:54 -07:00

test_together_ai.py

…

test_tpm_rpm_routing_v2.py

fix: drain logging worker in test_router_caching_ttl to remove flake

2026-04-23 14:48:02 -07:00

test_traceloop.py

…

test_ui_sso_helper_utils.py

…

test_unit_test_caching.py

style: black format test_unit_test_caching.py

2026-04-15 18:19:04 -07:00

test_update_spend.py

fix(tests): skip remaining real prisma DB tests in CI and related test suites

2026-02-20 13:25:42 -03:00

test_validate_environment.py

…

test_wandb.py

…

user_cost.json

…

vertex_ai.jsonl

…

vertex_batch_completions.jsonl

…

vertex_key.json

test: update to new vertex ai keys

2026-03-28 20:19:05 -07:00

whitelisted_bedrock_models.txt

Litellm fix update bedrock models (#24947 )

2026-04-01 19:22:54 -07:00