litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-28 05:08:21 +00:00

Author	SHA1	Message	Date
Chesars	4e6e1d8de8	merge: resolve conflicts with upstream staging (bedrock + mcp tests) Keep both sets of tests: upstream's OAuth2 token injection test and our case-insensitive tool matching tests. Use upstream's version of the bedrock output_config test (more comprehensive).	2026-03-12 13:40:16 -03:00
Chesars	feed274aa3	Reapply "feat: add model_cost aliases expansion support" This reverts commit `3d2df7e8b5`.	2026-03-12 13:36:57 -03:00
Sameer Kankute	982f3917c5	Fix test_standard_logging_payload	2026-03-12 18:35:01 +05:30
Sameer Kankute	15d873e204	Fix update deprecated model test	2026-03-12 18:34:20 +05:30
Chesars	1be6b31e2f	merge: resolve conflicts between main and litellm_oss_staging_03_11_2026	2026-03-12 09:38:31 -03:00
yuneng-jiang	82de82f1b6	Fix test_completion_cost_prompt_caching gemini parametrization gemini/gemini-2.5-flash lacks cache_creation_input_token_cost in the model cost map, causing a TypeError when the test multiplies cache_creation_input_tokens by None. Use claude-haiku-4-5 instead, which has the required prompt caching cost fields. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 17:12:15 -07:00
yuneng-jiang	c9f7075690	Replace additional deprecated models across test files - tests/local_testing/test_completion_cost.py: - claude-3-5-sonnet-20240620 -> claude-sonnet-4-6 - gemini/gemini-1.5-flash-001 -> gemini/gemini-2.5-flash - tests/test_litellm/test_utils.py: - claude-3-5-sonnet-20240620 -> claude-sonnet-4-6 (VertexAI config test, proxy tests) - gemini-1.5-pro -> gemini-2.5-pro (pre_process_non_default_params) - gemini/gemini-1.5-pro -> gemini/gemini-2.5-pro (proxy tests) - tests/litellm_utils_tests/test_utils.py: - claude-3-opus-20240229 -> claude-sonnet-4-6 (trimming, vision tests) - gemini-pro -> gemini-2.5-pro (function calling test) - gemini-pro-vision -> gemini-2.5-flash (vision test) - gemini-1.5-pro -> gemini-2.5-pro (response schema test) - gemini/gemini-1.5-flash -> gemini/gemini-2.5-flash (function calling test) - gemini-1.5-pro -> gemini-2.5-pro (vision gemini test) - gpt-4-vision-preview -> gpt-4o (vision test) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 17:03:54 -07:00
Cesar Garcia	3d2df7e8b5	Revert "feat: add model_cost aliases expansion support"	2026-03-10 22:39:19 -03:00
Sameer Kankute	30fde1de7f	fix(tests): update cache hit redaction assertion to expect choices format Made-with: Cursor	2026-03-10 12:14:24 +05:30
yuneng-jiang	c1d042c2a3	Fix flaky test_stream_chunk_builder_openai_audio_output_usage The test calls OpenAI's gpt-4o-audio-preview model which sometimes doesn't return usage data in the streaming response. Fixed by: - Adding @pytest.mark.flaky(retries=5, delay=2) for retry handling - Fixing usage_obj loop to check chunk.usage is not None - Skipping gracefully when OpenAI doesn't return usage data Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-09 17:18:00 -07:00
Ishaan Jaff	e8a7116899	fix(tests): fix repeating chunk and audio usage streaming tests (#23061 ) - Replace ModelResponse(stream=True) with ModelResponseStream in test_unit_test_custom_stream_wrapper_repeating_chunk — stream=True stores delta as a plain dict causing AttributeError in CustomStreamWrapper - Accept MidStreamFallbackError alongside InternalServerError in the repeating-chunk safety check assertion - Add @pytest.mark.flaky(retries=3) to the live OpenAI audio output usage test	2026-03-07 16:18:51 -08:00
Ishaan Jaff	a50a84c16c	fix(tests): update redaction assertion + remove flaky qwen3 streaming test (#23062 ) test_standard_logging_payload_audio: the response field in standard_logging_object is now a ModelResponse choices dict (since `d84e5e381a`), not {text: redacted-by-litellm}. Update both audio and non-audio variants to check choices[0].message.content instead. Audio is still correctly redacted - the new code creates a fresh ModelResponse with no audio field, so audio bytes never appear in the payload. test_partner_models_httpx_streaming: remove qwen3-coder-480b (us-south1) from the parametrize list - same treatment as llama-4-scout which was removed earlier. The endpoint is unavailable in CI and the test has been consistently failing.	2026-03-07 16:07:14 -08:00
Sameer Kankute	dc2d465c7e	Add support for wildcards models for files api	2026-03-04 11:48:42 +05:30
Chesars	ec16bd3509	merge: resolve conflict with upstream/main in presidio.py Take upstream's refactored PII handling with _unmask_pii_text and _process_response_for_pii helpers. Add missing StreamingChoices import.	2026-03-02 17:40:22 -03:00
Sameer Kankute	fc41f46f0f	Fix vertex ai function calls	2026-03-02 19:43:24 +05:30
Sameer Kankute	e40e913622	Fix vertex ai function calls	2026-03-02 19:42:18 +05:30
Ishaan Jaff	29e3fd5d79	[Release Fix] (#22411 ) * fix(lint): suppress PLR0915 for 3 complex methods that exceed 50-statement limit - streaming_iterator.py: _process_event (84 statements) - transformation.py: translate_messages_to_responses_input (51 statements) - transformation.py: transform_realtime_response (54 statements) Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(mypy): resolve type errors in public_endpoints, user_api_key_auth, common_utils, transformation - public_endpoints.py: fix _cached_endpoints type annotation - user_api_key_auth.py: accept Optional[str] for end_user_id parameter - common_utils.py: add NewProjectRequest/UpdateProjectRequest to Union type - transformation.py: add ChatCompletionRedactedThinkingBlock and list[Any] to content type Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(proxy-extras): bump version to 0.4.50 and sync schema - Bump litellm-proxy-extras from 0.4.49 to 0.4.50 - Sync schema.prisma with main proxy schema - Includes new LiteLLM_ClaudeCodePluginTable model - Includes new @@index([startTime, request_id]) on SpendLogs - Update version references in requirements.txt and pyproject.toml Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(router): use string id in test_add_deployment and add defensive str() in register_model - Change test to use string '100' instead of int 100 for model_info.id - Add str() conversion in register_model to prevent AttributeError on non-string keys Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(security): update minimatch to 10.2.4 to fix CVE-2026-27903 and CVE-2026-27904 - Run npm audit fix in docs/my-website - Updates minimatch from 10.2.1 to 10.2.4 (fixes HIGH severity ReDoS vulnerabilities) Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(test): update realtime guardrail test assertions to match actual guardrail behavior - test_text_message_blocked_by_guardrail_no_ai_response: allow guardrail's own block message text in response.done (previously expected empty content) - test_voice_transcript_blocked_by_guardrail: allow guardrail to send response.cancel + block message + response.create flow (previously expected no response.create) Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix: revert proxy-extras version in requirements.txt and pyproject.toml The litellm-proxy-extras 0.4.50 is not published to PyPI yet, so consumer references must stay at 0.4.49. Only the source package pyproject.toml should be bumped to 0.4.50 for the publish_proxy_extras CI job. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix: make transcript delta check optional in voice guardrail test The guardrail sends an error event (guardrail_violation) when blocking voice transcripts; it does not always produce transcript deltas. Remove the assertion requiring response.audio_transcript.delta since the error event is the primary signal that blocked content was handled. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * Add missing env keys to documentation: LITELLM_MAX_STREAMING_DURATION_SECONDS and LITELLM_USE_CHAT_COMPLETIONS_URL_FOR_ANTHROPIC_MESSAGES These two environment variables were used in code but not documented in the environment variables reference section of config_settings.md, causing the test_env_keys.py CI test to fail. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * Fix 13 mypy type errors across 6 files - in_flight_requests_middleware.py: Fix type: ignore error codes from [union-attr] to [attr-defined], add [arg-type] for Gauge *kwargs - transformation.py: Add [assignment] ignore for output_format reassignment, add fallback empty string for tool use id to fix arg-type - responses/main.py: Remove redundant type annotation on second secret_fields assignment to fix no-redef - streaming_iterator.py: Add [assignment] ignores for intermediate cache token assignments - handler.py: Add [typeddict-item] ignore for AnthropicMessagesRequest construction from dict - public_endpoints.py: Add [arg-type] ignore for _load_endpoints() return type mismatch with SupportedEndpoint model Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> fix: add auth overrides to spend tracking tests, fix realtime guardrail assertion, update UI minimatch - Add app.dependency_overrides for user_api_key_auth in 4 spend tracking tests that were returning 401 Unauthorized (error_code, error_message, error_code_and_key_alias, key_hash) - Fix realtime guardrail test to check ANY error event for guardrail_violation instead of just the first (OpenAI may send its own errors first) - Update ui/litellm-dashboard/package-lock.json to fix minimatch vulnerability Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * Fix failing MCP e2e and create_mcp_server UI tests Test 1 (test_independent_clients_no_shared_session): - Add allow_all_keys: true to MCP servers in test config. With master_key and no DB, get_allowed_mcp_servers returned empty, causing 0 tools and 403 on tool calls. allow_all_keys bypasses per-key restrictions. - Add asyncio.sleep(0.5) between client connections to allow MCP SDK TaskGroup cleanup and avoid ExceptionGroup on connection close (MCP #915). Test 2 (create_mcp_server 'auth value is provided'): - Use userEvent.setup({ delay: null }) for instant keystrokes to avoid timeout from default typing delay on CI. - Increase per-test timeout to 15000ms for CI environments. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix: stabilize proxy unit tests for parallel execution - test_response_polling_handler: add xdist_group to prevent heavy import OOM - test_db_schema_migration: use temp dir for worker isolation, sync schema.prisma index - test_custom_tokenizer_bug: use lighter tokenizer to prevent OOM in parallel Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix: add auth overrides to more spend tracking and model info tests - Fix test_ui_view_spend_logs_pagination missing auth override (401) - Fix test_view_spend_tags missing auth override (401) - Fix test_view_spend_tags_no_database missing auth override (401) - Fix test_empty_model_list.py to use app.dependency_overrides instead of patch() for FastAPI dependency injection auth Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(test): use patch.object for aiohttp transport test to work in parallel execution The @patch decorator was not intercepting the static method call in parallel xdist workers. Using patch.object on the directly-imported class is more reliable. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(security): update minimatch from 10.2.1 to 10.2.4 in Dockerfile The Docker image was explicitly pinning minimatch@10.2.1 which has HIGH severity ReDoS vulnerabilities (GHSA-7r86-cg39-jmmj, GHSA-23c5-xmqv-rm74). Update to 10.2.4 which includes fixes for both CVEs. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(ui): prevent MCP and TeamInfo test timeouts on CI - Add userEvent.setup({ delay: null }) to all tests using userEvent in both files - Add timeout: 15000 to tests with significant user interaction (typing, multiple clicks) - Fixes: create_mcp_server Bearer Token test, TeamInfo cancel button test Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix: stabilize parallel test execution and aiohttp transport test - test_aiohttp_handler: rewrite transport test to not rely on static method mock (consistently fails in parallel xdist workers) - test_proxy_cli: add xdist_group to prevent timeout during heavy imports - test_swagger_chat_completions: add xdist_group to prevent timeout Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(security): add serialize-javascript override to fix GHSA-5c6j-r48x-rmvq Add npm override for serialize-javascript>=7.0.3 in docs/my-website to fix HIGH severity RCE vulnerability via RegExp.flags. Also bump minimatch override to >=10.2.4. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * Fix flaky tests: remove broken Vertex model, add retries for Anthropic - Remove vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas from test_partner_models_httpx_streaming - consistently returns 400 BadRequest - Add @pytest.mark.flaky(retries=6, delay=10) to test_function_call_parsing for transient Anthropic API overload errors - Add @pytest.mark.flaky(retries=6, delay=10) to test_openai_stream_options_call for transient Anthropic InternalServerError Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(ci): add xdist_group(proxy_heavy) to prevent OOM in parallel proxy tests - Add pytestmark = pytest.mark.xdist_group('proxy_heavy') to test_proxy_utils.py - Change test_db_schema_migration.py from schema_migration to proxy_heavy group - Add @pytest.mark.xdist_group('proxy_heavy') to test_proxy_server.py::test_health Groups heavy proxy tests to run on same worker, avoiding worker OOM crashes. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * Fix vertex AI qwen global endpoint test to mock vertexai module import The test_vertex_ai_qwen_global_endpoint_url test was failing because the VertexAIPartnerModels.completion() method tries to 'import vertexai' before any of the mocked code runs. In environments without google-cloud-aiplatform installed, this import fails with a VertexAIError(status_code=400). Fix by: - Adding patch.dict('sys.modules', {'vertexai': MagicMock()}) to mock the vertexai module import - Adding vertex_ai_location parameter to the acompletion call for completeness Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(ci): add xdist_group to health endpoint and watsonx tests for parallel stability - test_health_liveliness_endpoint: add xdist_group('proxy_health') to prevent timeout - test_watsonx_gpt_oss tests: add xdist_group('watsonx_heavy') to prevent mock interference Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(test): pre-populate WatsonX IAM token cache to prevent parallel test interference The watsonx prompt transformation test was failing in parallel execution because litellm.module_level_client.post mock was being interfered with by other tests. Pre-populating the IAM token cache avoids the HTTP call entirely. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(test): add spend data polling with retries for e2e pass-through tests - test_vertex_with_spend.test.js: Replace 15s fixed wait with polling loop (up to 6 attempts, 10s apart) for spend data to appear in DB - Increase test timeout from 25s to 90s to accommodate polling - base_anthropic_messages_tool_search_test.py: Add flaky(retries=3) for streaming test that depends on live Anthropic API Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(ci): reduce parallel workers from 8 to 4 for proxy tests to prevent OOM - litellm_proxy_unit_testing_part2: -n 8 -> -n 4 - litellm_mapped_tests_proxy_part2: -n 8 -> -n 4, timeout 60 -> 120 - Worker crashes consistently caused by too many parallel proxy tests each loading the full FastAPI app and heavy dependency tree Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(db): add migration for SpendLogs composite index (startTime, request_id) The @@index([startTime, request_id]) was added to schema.prisma but had no corresponding migration. This caused test_aaaasschema_migration_check to fail because prisma migrate diff detected the missing index. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(db): add migration for MCP available_on_public_internet default change to true The schema.prisma changed the default for available_on_public_internet from false to true, but no migration was created. This caused the schema migration test to detect drift. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(test): increase server wait time and add retry to flaky external API tests - test_basic_python_version.py: increase server startup wait from 60s to 90s for slower CI environments (fixes installing_litellm_on_python_3_13) - test_a2a_agent.py: add flaky(retries=3, delay=5) for non-streaming test that depends on live A2A agent endpoint Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(test): add flaky retries to all intermittent external API tests for 0-fail CI Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> * fix(test): add auth overrides to file endpoint tests that return 500 The test_target_storage tests were getting 500 because the FastAPI auth dependency wasn't overridden. Added app.dependency_overrides for proper auth bypass in test environment. Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com> --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: Ishaan Jaff <ishaan-jaff@users.noreply.github.com>	2026-02-28 09:46:35 -08:00
Cesar Garcia	fc7bc9147f	Merge pull request #21629 from Chesars/fix/pydantic-serialization-warnings fix(types): remove StreamingChoices from ModelResponse, use ModelResponseStream	2026-02-27 17:48:33 -03:00
Sameer Kankute	691927f9c9	fix(embeddings): allow dimensions param passthrough via allowed_openai_params for non-text-embedding-3 OpenAI models When calling non-text-embedding-3 models routed through the openai provider (e.g. nvidia/llama-3.2-nv-embedqa-1b-v2), passing `dimensions` previously raised an UnsupportedParamsError unconditionally. This fix threads `allowed_openai_params` through the embedding call stack so that providers can opt-in to passing `dimensions` by including it in the list. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-26 09:59:37 +05:30
Julio Quinteros Pro	bb63de2f82	fix(tests): make RPM limit test sequential to avoid race condition Concurrent requests via run_in_executor + asyncio.gather caused a race condition where more requests slipped through the rate limiter than expected, leading to flaky test failures (e.g. 3 successes instead of 2 with rpm_limit=2). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 16:34:52 -03:00
Ishaan Jaffer	52294029a0	test_vertex_ai_gemini_2_5_pro_streaming	2026-02-21 16:59:22 -08:00
Ishaan Jaff	d7b22d340b	fix(tests): move test_router_azure_acompletion to llm_translation testing (#21837 )	2026-02-21 14:41:53 -08:00
Ishaan Jaff	c810f5cd63	fix(tests): replace fake France Azure endpoint in test_router_azure_acompletion (#21818 )	2026-02-21 13:26:59 -08:00
Ishaan Jaff	1ed529092f	fix(test): replace flaky test_vertex_ai_gemini_audio_ogg with mocked version (#21807 ) Previously made a real Vertex AI call with a Wikimedia URL that intermittently failed with URL_REJECTED-REJECTED_FC_TIMEOUT. Now mocks HTTPHandler.post and VertexBase._ensure_access_token so the test verifies the translation (OGG -> file_data with audio/ogg mime_type) without any real network calls. Runs in ~0.36s instead of ~60s.	2026-02-21 12:49:06 -08:00
Ishaan Jaff	a0e76f4f25	fix(tests): mock httpx in RPM limit pass-through tests (#21793 ) * fix(tests): isolate flaky files endpoint tests from global proxy state * test(secret_managers): add mocked unit test for write/read JSON secret cycle * fix(tests): mock httpx in rpm limit pass-through tests to avoid real Cohere API calls	2026-02-21 11:55:48 -08:00
Ishaan Jaff	a5e886de79	fix(tests): read CI_CD_DEFAULT_ANTHROPIC_MODEL env var instead of hardcoding model (#21781 ) * fix(tests): read CI_CD_DEFAULT_ANTHROPIC_MODEL env var in bedrock KB tests * fix(tests): read CI_CD_DEFAULT_ANTHROPIC_MODEL env var in test_router * fix(tests): read CI_CD_DEFAULT_ANTHROPIC_MODEL env var in test_router_retries * fix(tests): read CI_CD_DEFAULT_ANTHROPIC_MODEL env var in test_router_timeout	2026-02-21 10:46:49 -08:00
Harshit Jain	f3ff9bf54f	Merge pull request #21009 from BerriAI/litellm_docker-count-no-req add tests for hotpath & docker container	2026-02-21 20:57:28 +05:30
Sameer Kankute	4d6b7699cc	Fix sonnet 3.7 tests	2026-02-20 17:29:16 -08:00
Sameer Kankute	adb91d442a	Fix: test_pass_through_endpoint_bing	2026-02-20 17:28:17 -08:00
Sameer Kankute	36fd14357c	FIx: replace deprecated claude-3-7-sonnet-20250219 with claude-4-sonnet-20250514	2026-02-20 17:27:59 -08:00
yuneng-jiang	65dc7556a8	[Fix] Fix web search model info regression, deprecated prompt caching model, undocumented env keys - Revert test_anthropic_web_search_in_model_info to use claude-3-5-haiku-latest (model info test doesn't make API calls, so the -latest alias is fine here) - Replace claude-3-7-sonnet-20250219 with claude-sonnet-4-5-20250929 in test_anthropic_prompt_caching.py (10 instances) - Include pending doc updates for COMPETITOR_LLM_TEMPERATURE and MAX_COMPETITOR_NAMES env vars in config_settings.md Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-20 17:26:58 -08:00
yuneng-jiang	c27b65d09e	[Fix] Replace deprecated claude-3-7-sonnet-20250219 with claude-sonnet-4-5-20250929 in test_completion Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-20 17:26:02 -08:00
Chesars	0f20976efa	fix(types): remove StreamingChoices from ModelResponse, use ModelResponseStream ModelResponse.choices was typed as List[Union[Choices, StreamingChoices]] which caused Pydantic serialization warnings and false linting errors. Now that ModelResponseStream exists for streaming, narrow ModelResponse.choices to List[Choices] and migrate all ModelResponse(stream=True) call sites to use ModelResponseStream() instead.	2026-02-20 17:47:42 -03:00
Julio Quinteros Pro	1dc3f1e530	fix(tests): skip remaining real prisma DB tests in CI and related test suites Add @pytest.mark.skip to all test functions that use the real `prisma_client` fixture (requiring an external PostgreSQL connection) across 7 test files. Files updated: - tests/proxy_unit_tests/test_proxy_server.py (5 tests) - tests/proxy_admin_ui_tests/test_key_management.py (11 tests) - tests/proxy_admin_ui_tests/test_role_based_access.py (5 tests) - tests/proxy_admin_ui_tests/test_usage_endpoints.py (3 tests) - tests/local_testing/test_blocked_user_list.py (2 tests) - tests/local_testing/test_add_update_models.py (1 test) - tests/local_testing/test_update_spend.py (1 test) Total: 28 new skip markers added. Note: tests using mock_prisma_client (properly mocked) are unaffected. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 13:25:42 -03:00
Harshit Jain	19c8f78c2f	fix: update docker test file to right path	2026-02-19 18:44:28 +05:30
Ishaan Jaff	323aed7211	fix: CI failures - missing env key doc + streaming test (#21510 ) * docs: add DATABRICKS_API_KEY to environment settings reference * fix: streaming test usage check on Pydantic model * fix: mock litellm.proxy.proxy_server in test_skip_server_startup	2026-02-18 18:20:32 -08:00
Ishaan Jaffer	3987198fca	test_partner_models_httpx_streaming	2026-02-14 12:41:35 -08:00
Ishaan Jaffer	e373da6653	test_partner_models_httpx_streaming	2026-02-14 12:09:18 -08:00
Ishaan Jaffer	e5bbafba62	test_completion_openrouter_reasoning_effort	2026-02-14 12:04:05 -08:00
yuneng-jiang	c41459c8e3	fixing mistral model deprecation, cloud zero transform bug	2026-02-12 17:53:04 -08:00
pb	713d3022ae	fix(scheduler): remove orphan entries from queue - causing memory leak. (#20866 ) * fix(scheduler): remove timed-out requests from queue to prevent memory leak Fixes #20059 * fix(scheduler): use actual model param instead of hardcoded gpt-3.5-turbo in schedule_acompletion * trigger CLA recheck --------- Co-authored-by: Piyush Bhawsar <piyush100x@Piyushs-MacBook-Pro-3.local>	2026-02-10 22:34:52 -08:00
Alexsander Hamir	ebce0e5f8c	[Release - 02/10/2026] v1.81.10-nightly	2026-02-10 16:26:30 -08:00
Sameer Kankute	bffd7956a6	Merge pull request #19132 from BerriAI/litellm_gemini_schema_fix fix: Preserved nullable object fields by carrying schema properties	2026-02-10 15:28:18 +05:30
shin-bot-litellm	537f7af583	fix(test): update deprecated gemini embedding model (#20621 ) Replace text-embedding-004 with gemini-embedding-001. The old model was deprecated and returns 404: 'models/text-embedding-004 is not found for API version v1beta' Co-authored-by: Shin <shin@openclaw.ai>	2026-02-06 18:35:40 -08:00
Ishaan Jaff	887a907e42	[Fix] Guardrails API - Ensure OpenAI Moderations Guard works with OpenAI Embeddings (#20523 ) * init OpenAIEmbeddingsHandler * init apply_guardrail * use apply guardrails for OpenAI moderations * test_embeddings_handler_string_input * test_openai_moderation_guardrail_apply_guardrail * fix typing * test_openai_moderation_responses_api_input_field * test fixes	2026-02-05 14:40:15 -08:00
Sameer Kankute	f1df5ea9a9	Merge pull request #20252 from BerriAI/main merge main	2026-02-02 14:54:50 +05:30
shin-bot-litellm	db120c524b	fix(test): accept both AuthenticationError and InternalServerError in batch_completion test (#20186 ) The test uses an invalid API key to verify that batch_completion returns exceptions rather than raising them. However, depending on network conditions, the error may be: - AuthenticationError: API properly rejected the invalid key - InternalServerError: Connection error occurred before API could respond Both are valid outcomes for this test case. Co-authored-by: shin-bot-litellm <shin-bot-litellm@users.noreply.github.com>	2026-01-31 13:36:27 -08:00
shin-bot-litellm	fea40925cf	test: remove hosted_vllm from OpenAI client tests (#20163 ) hosted_vllm no longer uses the OpenAI client, so these tests that mock the OpenAI client are not applicable to hosted_vllm. Removes hosted_vllm from: - test_openai_compatible_custom_api_base - test_openai_compatible_custom_api_video	2026-01-31 10:10:45 -08:00
Sameer Kankute	5ac3f75996	Add disable flahg for anthropic gemini cache translation	2026-01-30 14:58:10 +05:30
Sameer Kankute	df072979e5	Merge branch 'main' into litellm_oss_staging_01_28_2026	2026-01-29 17:39:42 +05:30

1 2 3 4 5 ...

987 Commits