litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-07-04 15:07:22 +00:00

Author	SHA1	Message	Date
Krrish Dholakia	e17acd6b69	test: make tests smarter	2026-03-30 18:14:44 -07:00
Ishaan Jaffer	facb230fee	test_create_vertex_fine_tune_jobs_mocked	2026-03-30 18:01:23 -07:00
Krrish Dholakia	c7e2bfc577	fix: cleanup tests	2026-03-30 16:24:35 -07:00
Krrish Dholakia	4c00a14ce0	fix: fix ci/cd + handle oidc jwt tokens	2026-03-30 16:12:58 -07:00
Ishaan Jaffer	87debe6254	test_litellm_overhead_non_streaming	2026-03-30 15:20:53 -07:00
Krrish Dholakia	1fb677702d	test: update to new vertex ai keys	2026-03-28 20:19:05 -07:00
Krrish Dholakia	cdcab8a243	refactor: cleanup deprecated models	2026-03-28 19:39:11 -07:00
Krrish Dholakia	bc829d51f2	test: test	2026-03-28 19:17:38 -07:00
Seokjun Yang	d3afaf613d	Update tests/litellm_utils_tests/test_bedrock_token_counter.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-03-20 22:21:22 +09:00
Seokjun Yang	eb733702fc	Update tests/litellm_utils_tests/test_bedrock_token_counter.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-03-20 22:21:15 +09:00
stias	bc4608e718	fix(bedrock): respect api_base and aws_bedrock_runtime_endpoint in count_tokens endpoint The /v1/messages/count_tokens endpoint was hardcoding the Bedrock runtime URL, ignoring api_base and aws_bedrock_runtime_endpoint settings. This aligns it with invoke/converse handlers by using the existing get_runtime_endpoint() method for consistent endpoint resolution. Signed-off-by: stias <seokjun.yang@mycraft.kr>	2026-03-20 17:58:20 +09:00
yuneng-jiang	27d0ffea44	Fix flaky AWS secret manager tests by skipping on ThrottlingException ThrottlingException is a transient AWS rate-limit error unrelated to code correctness. Skip the test instead of failing the CI pipeline. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-15 18:53:18 -07:00
yuneng-jiang	3d45ba3edf	Fix flaky vertex_ai overhead test by mocking auth and HTTP calls Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-15 18:01:54 -07:00
yuneng-jiang	0fbfb8e772	fix(tests): remove test_oidc_circle_v1_with_amazon_fips that depends on external infra Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 10:33:52 -07:00
yuneng-jiang	b98ecd8c1d	fix(tests): quarantine flaky test_oidc_circle_v1_with_amazon The test fails with InvalidIdentityToken because the OIDC provider is no longer configured in the third-party AWS account (ai.moda). This matches the existing quarantine on test_oidc_circleci_with_azure. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 09:04:42 -07:00
Cursor Agent	177edb06ae	fix: stabilize 5 CI test failures - Vertex AI batch cost tests: replace removed gemini-1.5-flash-001 model with gemini-2.0-flash-001 in pricing lookups - MCP test_executes_tool_when_allowed: add server_id and auth_type attrs to StubServer to match new _resolve_allowed_mcp_servers_with_ip_filter - MCP M2M tests: infer oauth2_flow='client_credentials' in _execute_with_mcp_client when client_id/client_secret/token_url present (NewMCPServerRequest lacks oauth2_flow field) - Team list test: update mock find_many to filter by team_id per the current per-team query pattern in list_team - Azure DALL-E 3 health check: skip test due to 410 ModelDeprecated Co-authored-by: yuneng-jiang <yuneng-jiang@users.noreply.github.com>	2026-03-13 01:03:35 +00:00
yuneng-jiang	c9f7075690	Replace additional deprecated models across test files - tests/local_testing/test_completion_cost.py: - claude-3-5-sonnet-20240620 -> claude-sonnet-4-6 - gemini/gemini-1.5-flash-001 -> gemini/gemini-2.5-flash - tests/test_litellm/test_utils.py: - claude-3-5-sonnet-20240620 -> claude-sonnet-4-6 (VertexAI config test, proxy tests) - gemini-1.5-pro -> gemini-2.5-pro (pre_process_non_default_params) - gemini/gemini-1.5-pro -> gemini/gemini-2.5-pro (proxy tests) - tests/litellm_utils_tests/test_utils.py: - claude-3-opus-20240229 -> claude-sonnet-4-6 (trimming, vision tests) - gemini-pro -> gemini-2.5-pro (function calling test) - gemini-pro-vision -> gemini-2.5-flash (vision test) - gemini-1.5-pro -> gemini-2.5-pro (response schema test) - gemini/gemini-1.5-flash -> gemini/gemini-2.5-flash (function calling test) - gemini-1.5-pro -> gemini-2.5-pro (vision gemini test) - gpt-4-vision-preview -> gpt-4o (vision test) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 17:03:54 -07:00
yuneng-jiang	b314e8d20a	Merge pull request #20688 from BerriAI/litellm_budget_tier_enforcement_for_keys [Fix] Budget-linked keys never had spend reset	2026-03-06 20:44:58 -08:00
Sameer Kankute	f68cbc4c95	Fix test_perform_health_check_filters_by_model_id	2026-02-26 10:43:05 +05:30
Alejandro Tapia	80c09295ad	Merge upstream/main - resolve health check conflicts	2026-02-24 17:09:05 -08:00
Sean Marsh Glover	4652c73259	feat(proxy): limit concurrent health checks with health_check_concurrency (#20584 ) * staged first pass * black * Update litellm/proxy/health_check.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * simpler * restore cached logo * fix tests for perform_health_check max_concurrency arg * implement pr suggestion * and the helm chart * add configureable resources and probes to the deployment in the helm chart * more helm chart unittests * move some background healthcheck loggin to debug --------- Co-authored-by: Sean Glover <sglover@athenahealth.com> Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-02-24 08:16:59 -08:00
Ryan Crabbe	ea32ad72c6	Merge origin/main into perf/callback-registration-routing Resolve conflicts: - logging_callback_manager.py: keep PR's MAX_CALLBACKS, _is_async_callable, Callable type - test_utils.py: keep both TestCallbackAsyncSyncSeparation and TestMetadataNoneHandling	2026-02-21 12:40:23 -08:00
Ishaan Jaff	afdf70be73	fix(test): update claude model name in test_get_valid_models_from_dynamic_api_key (#21771 )	2026-02-21 10:15:23 -08:00
Shivam Rawat	c45a17bad4	Merge branch 'main' into litellm_budget_tier_enforcement_for_keys	2026-02-17 16:00:04 -08:00
Tomu Hirata	43ba7d0a07	Add test case for Databricks Meta LLaMA 3.1 70B instruct model in content parsing tests Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>	2026-02-17 15:36:00 +09:00
shivam	9f7f190670	resolved greptile comment	2026-02-16 10:24:06 -08:00
shivam	0117b35a6b	added more tests, fixed tests	2026-02-16 09:59:19 -08:00
Alejandro Tapia	82f6d0fe43	healthcheck-model_id-fix: There was quite a bit of code that needed to be changed since health checks were entirely keyed by model name. This includes, proxy logic, the dashboard, and even networking, because model name was the identifier everywhere. All changed files had tests added to them, which are passing with no regressions.	2026-02-12 14:47:09 -08:00
Ryan Crabbe	aaaf7f3b6c	perf: move async/sync callback separation from per-request to registration time The three loops in function_setup that called is_async_callable() on every callback each request were redundant after the first request. Move the async/sync routing into LoggingCallbackManager.add_litellm_*_callback() so it happens once at registration time instead of on every request.	2026-02-07 12:10:38 -08:00
Yuta Saito	695fbf4ec5	feat: hashicorp vault rotate support	2026-01-23 17:32:55 +09:00
Yuta Saito	898cc3ff4f	test: update langfuse trace_id tests to use litellm_trace_id	2026-01-22 06:19:43 +09:00
Sameer Kankute	e315e4cd7c	fix litellm/tests/litellm_utils_tests/test_utils.py	2026-01-21 18:50:02 +05:30
Ishaan Jaff	ddebdd47bc	[Feat] Add Support for Claude Code Max/OAuth 2 on LiteLLM AI Gateway (#19453 ) * fix count_tokens_with_anthropic_api * remove outdated file * fix ANTHROPIC_TOKEN_COUNTING_BETA_VERSION * refactor: get_token_counter * init test suite for token counter * init token counters * fix: fix pyrightI * fix Code QA issues * feat: add OAUTH handling ant * feat: Oauth handling Ant * test anthopic common utils * fix code QA * docs	2026-01-20 17:21:17 -08:00
Alexsander Hamir	15c3bc219b	[Refactor] Add CI enforcement for O(1) operations in _get_model_cost_key to prevent performance regressions (#19052 ) * Optimize _get_model_cost_key to avoid expensive scans - Remove expensive O(n) scan fallback that was causing 42.87% CPU overhead - Only scan when size mismatch detected (O(1) check) - Add warning in docstring: Only O(1) lookup operations are acceptable - Clean up comments to be more concise - Keep stale entry rebuild for pop() case (only triggers when stale entry found) This fixes the performance issue where the scan was being triggered on every failed lookup, causing severe CPU overhead during router operations. * Add code quality check to enforce O(1) operations in _get_model_cost_key - Add check_get_model_cost_key_performance.py to statically analyze _get_model_cost_key - Detects O(n) operations (loops, comprehensions, problematic function calls) - Recursively checks called functions to find nested O(n) operations - Allows conditional O(n) rebuilds in helper functions (_rebuild_model_cost_lowercase_map, _handle_stale_map_entry_rebuild, _handle_new_key_with_scan) * Integrate _get_model_cost_key performance check into CI pipeline - Add check_get_model_cost_key_performance.py to check_code_and_doc_quality job - Ensures O(1) requirement is enforced in CI to prevent performance regressions * Remove unused performance test and clean up utils.py - Remove test_get_model_info_performance.py (no longer needed) - Remove extra blank line in utils.py * Document allowed helper functions and exception process in _get_model_cost_key - Add documentation listing allowed helper functions with O(n) operations - Explain why these are acceptable (conditionally called) - Add instructions for adding new exceptions to check_get_model_cost_key_performance.py * Fix docstring detection and type checker error in performance check - Add proper docstring tracking to skip docstring content (fixes false positive for 'map' in docstring) - Add None check for docstring_quote to fix type checker error - Restore _handle_new_key_with_scan to allowed_helpers list * Remove check_get_model_cost_key_performance from CI pipeline - Temporarily remove the performance check from CI to avoid blocking builds * Restore performance check and remove memory leak tests from CI - Add back check_get_model_cost_key_performance.py to CI pipeline - Remove memory_leak_tests job that was causing port conflicts * Remove extra blank line in CI config	2026-01-13 17:08:03 -08:00
Alexsander Hamir	a1dd3ead4d	[Perf] Remove bottleneck causing high CPU usage & overhead under heavy load (#19049 )	2026-01-13 15:22:09 -08:00
Sameer Kankute	28c7659d3d	Potential fix for code scanning alert no. 3954: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2026-01-05 16:06:10 +05:30
Sameer Kankute	6337ea876b	fix: Add custom llm provider to get_llm_provider when sent via UI	2026-01-05 12:57:30 +05:30
Alexsander Hamir	5534038e93	Fix CI: Revert security scan changes and add GitGuardian ignore rules (#18358 )	2025-12-22 17:03:53 -08:00
Ishaan Jaffer	6112160a16	Revert "[Fix] Security - Remove example API keys with high entropy (#18255 )" This reverts commit `24edbccf5c`.	2025-12-20 20:48:11 +05:30
Alexsander Hamir	24edbccf5c	[Fix] Security - Remove example API keys with high entropy (#18255 )	2025-12-19 10:09:50 -08:00
Yuta Saito	b7b22d559e	feat: allow per-team Vault overrides when storing keys	2025-12-18 06:21:53 +09:00
Alexsander Hamir	958c190134	Fix flanky tests (#17665 ) * Fix test_delete_polling_removes_from_cache mock setup - Mock async_delete_cache to properly execute the real implementation path - Ensures init_async_client() is called and delete() is invoked on the returned client - Fixes AssertionError: Expected 'delete' to be called once. Called 0 times. * fix: resolve timeout in add_model_tab test by mocking useProviderFields hook - Mock useProviderFields hook to prevent network calls and React Query delays - Use waitFor to properly handle async operations - Test now passes reliably without 10s timeout * fix: add test timeout to prevent CI timeout failure - Add 15 second timeout to 'should display Test Connect and Add Model buttons' test - Test takes ~6 seconds locally, but CI was timing out at default 5 second limit - Ensures test has sufficient time to complete in CI environment * test: quarantine flaky test_oidc_circleci_with_azure Quarantine test that fails with 401 Unauthorized from Azure OAuth. The test is flaky and blocks CI builds. Marked with @pytest.mark.skip until Azure authentication can be fixed or migrated to our own account.	2025-12-08 12:21:26 -08:00
Chetan Choudhary	d8ac213c6a	Native Webhook Integration Sumologic (#17630 ) * Fix: Support generic_api_compatible_callbacks.json in callback initialization - Added check in _add_custom_callback_generic_api_str to load callbacks from generic_api_compatible_callbacks.json - Added SumoLogic webhook integration to generic_api_compatible_callbacks.json - Fixes bug where callbacks in JSON file were not being loaded * Added 3 unit tests for JSON callback loading	2025-12-07 23:23:39 -08:00
Alexsander Hamir	73075c7d24	fix: add retry logic for flaky Azure image generation health check test (#17595 ) - Add missing @pytest.mark.asyncio decorator - Implement retry logic with exponential backoff (3 retries) - Only retry on transient Azure internal server errors - Fail immediately on non-transient errors This fixes the flaky test_azure_img_gen_health_check which was failing due to transient Azure internal server errors that are outside our control.	2025-12-06 08:11:52 -08:00
Ishaan Jaffer	eaa7e61f57	test fixes	2025-12-05 17:12:01 -08:00
Colin Lin	0bd144103d	[stripe] simplify opus test	2025-12-05 10:56:09 -05:00
Colin Lin	3046b9f163	[stripe] opus budget thinking	2025-12-05 10:56:02 -05:00
Ishaan Jaffer	92cca5d515	test_cyberark_write_and_read_secret	2025-12-03 19:43:29 -08:00
Ishaan Jaff	57544f1662	[Feat] Adds IAM role assumption support for AWS Secret Manager (#16887 ) * add AWS fields for KeyManagementSettings * docs IAM roles * use aws iam auth on secret manager v2 * fix: load_aws_secret_manager * test_secret_manager_with_iam_role_settings	2025-11-20 12:38:48 -08:00
Cesar Garcia	5e70c78b94	fix(cost-tracking): support base_model lookup in litellm_metadata for Responses API (#16778 ) Cost tracking was failing for Responses API when using custom deployment names with base_model configuration. The issue occurred because: - Chat Completions API stores model_info in 'metadata' - Responses API stores model_info in 'litellm_metadata' - Cost calculator only checked 'metadata', missing Responses API costs Changes: - Updated _get_base_model_from_metadata() to check both metadata locations - Added comprehensive unit tests covering all scenarios - Maintains backward compatibility (metadata takes precedence) Fixes #16772	2025-11-18 19:53:18 -08:00

1 2 3

136 Commits