litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-07-05 17:06:54 +00:00

Author	SHA1	Message	Date
Sameer Kankute	9e1275b76c	Merge branch 'main' into litellm_staging_01_19_2026	2026-01-20 19:19:36 +05:30
Sameer Kankute	cd96c8cbb0	Fix:test_aaaaazure_tenant_id_auth	2026-01-20 17:39:08 +05:30
Emerson Gomes	13d887a275	Fix queue persistence to Redis (#19304 ) * Fix queue persistence to Redis * add test	2026-01-19 19:01:34 -08:00
Cesar Garcia	d30c25af21	feat(gemini): use responseJsonSchema for Gemini 2.0+ models (#19314 ) * feat(gemini): add opt-in support for responseJsonSchema Add support for Gemini's native responseJsonSchema parameter which uses standard JSON Schema format instead of OpenAPI-style responseSchema. Benefits of responseJsonSchema (Gemini 2.0+ only): - Standard JSON Schema format (lowercase types) - Supports additionalProperties for stricter validation - Better compatibility with Pydantic's model_json_schema() - No propertyOrdering required Usage: ```python response_format={ "type": "json_schema", "json_schema": {"schema": {...}}, "use_json_schema": True # opt-in } ``` This is backwards compatible - existing code continues to use responseSchema by default. Closes #16340 * docs: add documentation for use_json_schema parameter Document the new use_json_schema option for Gemini 2.0+ models in the JSON Mode documentation. * refactor(gemini): use responseJsonSchema by default for Gemini 2.0+ Remove opt-in flag `use_json_schema` and automatically detect model version: - Gemini 2.0+: uses responseJsonSchema (standard JSON Schema, supports additionalProperties) - Gemini 1.5: uses responseSchema (OpenAPI format, legacy) This follows LiteLLM's philosophy of abstracting provider differences - users write the same code regardless of model version. * test(vertex): update json_schema tests to accept both responseSchema formats Gemini 2.x+ uses responseJsonSchema while Gemini 1.x uses responseSchema. Update tests to accept both formats since litellm now auto-selects based on model version.	2026-01-19 10:45:37 -08:00
Harshit Jain	6cd4b3603f	fix(router): prevent retrying 4xx client errors (#19275 )	2026-01-19 05:18:35 -08:00
Ishaan Jaffer	5812654bdd	test_router_fallbacks_with_custom_model_costs	2026-01-17 16:34:46 -08:00
Sameer Kankute	09fb1581cb	Fix:add async_get_available_deployment_for_pass_through in code tests	2026-01-16 16:37:44 +05:30
Sameer Kankute	dca42047b9	Merge pull request #19068 from BerriAI/litellm_responses_caching_support [Fix] Add support for caching for responses API	2026-01-15 08:38:31 +05:30
Sameer Kankute	52d3c9dcfc	Add retry policy support to responses API	2026-01-14 14:55:17 +05:30
Sameer Kankute	333947e236	Add support for caching for responses API	2026-01-14 13:33:07 +05:30
Sameer Kankute	e98c2e4425	Merge pull request #19012 from BerriAI/litellm_fix_model_deployment_routing Fix: Model matching priority in configuration	2026-01-13 17:55:01 +05:30
Sameer Kankute	d656f01bc9	Merge pull request #19009 from Dima-Mediator/fix-image-tokens-spend-logging Fix image tokens spend logging for /images/generations	2026-01-13 15:03:37 +05:30
Sameer Kankute	ecb3959c3c	Merge pull request #18208 from Chesars/fix/case-insensitive-model-cost-lookup fix: case-insensitive model cost map lookup	2026-01-13 11:53:43 +05:30
Sameer Kankute	dfece51f8c	Fix: Model matching priority in configuration	2026-01-13 11:44:48 +05:30
Yuta Saito	a57f1e2e08	test: remove flaky azure oidc embedding test	2026-01-13 10:34:01 +09:00
Sameer Kankute	ab85eb8f43	Fix: gaurdrail moderation support with responses API	2026-01-12 17:27:13 +05:30
Ishaan Jaffer	5873cd4a32	test_completion_bedrock_titan_null_response	2026-01-07 15:09:44 +05:30
Ishaan Jaffer	269771a7c9	test_bedrock_httpx_streaming	2026-01-07 15:02:10 +05:30
Ishaan Jaffer	645ca64780	a121 fixes	2026-01-07 15:00:49 +05:30
Yuta Saito	1d16a8526e	feat: allow configuring project name for OpenTelemetry service name	2026-01-07 14:20:10 +09:00
Rohit Ravikant Rane	a7c39ccc12	fix(router): correct num_retries tracking in retry logic (#18712 ) * fix(router): correct num_retries tracking in retry logic - Fix off-by-one error in num_retries attribute when retries exhausted - Correct remaining_retries calculation - Add comprehensive tests for retry tracking edge cases Fixes incorrect retry count in error messages and logging * chore: trigger CI re-run * chore: trigger CI tests again	2026-01-06 23:58:41 +05:30
Yuta Saito	ca14160375	Revert "fix: model eol" This reverts commit `5aa1665d79`.	2026-01-06 16:49:30 +09:00
Yuta Saito	23713d1811	fix: anthropic claude-3-opus-20240229 EOL	2026-01-06 16:06:21 +09:00
Yuta Saito	5aa1665d79	fix: model eol	2026-01-06 15:47:37 +09:00
Sameer Kankute	c8bb92afdc	Merge pull request #18579 from BerriAI/litellm_remove_prompt_chaching_header Remove prompt caching headers as the support has been removed	2026-01-02 21:01:50 +05:30
Sameer Kankute	dd96f91254	Add thought signature for non tool call requests	2026-01-02 12:31:31 +05:30
Sameer Kankute	ca94990307	remove prompt caching headers as the support has been removed	2026-01-02 11:08:35 +05:30
Sameer Kankute	0addab4f63	Add test for removal of thought signature	2025-12-23 14:56:42 +05:30
Alexsander Hamir	5534038e93	Fix CI: Revert security scan changes and add GitGuardian ignore rules (#18358 )	2025-12-22 17:03:53 -08:00
Ishaan Jaffer	09dcb38b4b	Revert "test_update_model_cost_map_url" This reverts commit `c9d61fcdba`.	2025-12-22 12:41:30 +05:30
Ishaan Jaffer	c9d61fcdba	test_update_model_cost_map_url	2025-12-22 11:51:30 +05:30
Ishaan Jaffer	148715541c	test compl openai orgs	2025-12-20 23:41:29 +05:30
Ishaan Jaffer	b1090db927	test_openai_stream_options_call	2025-12-20 22:35:21 +05:30
Ishaan Jaffer	6112160a16	Revert "[Fix] Security - Remove example API keys with high entropy (#18255 )" This reverts commit `24edbccf5c`.	2025-12-20 20:48:11 +05:30
Ishaan Jaffer	c13b4c5514	test_aaapass_through_endpoint_pass_through_keys_langfuse	2025-12-20 14:36:59 +05:30
Alexsander Hamir	24edbccf5c	[Fix] Security - Remove example API keys with high entropy (#18255 )	2025-12-19 10:09:50 -08:00
Alexsander Hamir	0c48826cdc	Revert "[Fix] CI/CD - local_testing & mapped tests (#18222 )" (#18223 ) This reverts commit `dc7f500c47`.	2025-12-18 14:46:09 -08:00
Alexsander Hamir	dc7f500c47	[Fix] CI/CD - local_testing & mapped tests (#18222 )	2025-12-18 14:34:48 -08:00
Chesars	e08b4767e9	fix: case-insensitive model cost map lookup Users were getting "does not support parameters: ['tools']" errors when using lowercase model names (e.g., "qwen/qwen3-next-80b-a3b-thinking") because the model cost map has mixed-case keys and the lookup was case-sensitive. Added _get_model_cost_key() helper that tries exact match first (O(1)), then falls back to case-insensitive search if not found.	2025-12-18 17:25:43 -03:00
Alexsander Hamir	f353bb6dba	[Fix] CI/CD - litellm_mapped_tests_llms \| litellm_mapped_tests_core \| caching_unit_tests (#18197 )	2025-12-18 08:49:23 -08:00
Kris Xia	be2f429087	fix(proxy): extract model from vertex ai passthrough url pattern (#18097 ) extract model id from vertex ai passthrough routes that follow the pattern: /vertex_ai//models/{model_id}: the model extraction now handles vertex ai routes by regex matching the model segment from the url path, which allows proper model identification for authentication and authorization in proxy pass-through endpoints. adds comprehensive test coverage for vertex ai model extraction including: - various vertex api versions (v1, v1beta1) - different locations (us-central1, asia-southeast1) - model names with special suffixes (gemini-1.5-pro, gemini-2.0-flash) - precedence verification (request body model over url) - non-vertex route isolation	2025-12-17 11:10:14 +05:30
Sameer Kankute	dbcae4aca5	fix: Add none to encoding_format instead of omitting it	2025-12-16 13:23:15 +05:30
Cesar Garcia	df19a747a2	feat(custom_llm): add image_edit and aimage_edit support (#17999 ) * feat(custom_llm): add image_edit and aimage_edit support Add support for image_edit and aimage_edit methods in CustomLLM class, allowing users to implement custom image editing providers. Changes: - Add image_edit() and aimage_edit() methods to CustomLLM base class - Add custom provider detection in litellm.image_edit() function - Add tests for sync and async image_edit with custom handlers * docs: add image_edit to CustomLLM documentation - Add /v1/images/edits to supported routes - Add Image Edit section with example - Update Custom Handler Spec with image_edit methods	2025-12-15 14:16:36 -08:00
Alexsander Hamir	6635325629	fix: filter internal params in fallback code and fix test issues (#17941 ) - Filter skip_mcp_handler and other internal params in fallback_utils.py before calling acompletion Fixes issue where internal parameters were being passed to provider APIs causing errors - Remove deployment field from GCS bucket logger test metadata Fixes model name mismatch where deployment field was overriding the model in logging - Update Bedrock Titan test to use non-deprecated model (titan-text-express-v1) Fixes test failure due to deprecated amazon.titan-text-lite-v1 model	2025-12-13 15:05:26 -08:00
Ishaan Jaffer	b6562b9d88	test_thought_true_with_empty_text_creates_block	2025-12-06 16:00:05 -08:00
Alexsander Hamir	09dcd381d7	fix: mark test_caching_with_reasoning_content as flaky and handle InternalServerError (#17603 ) - Add @pytest.mark.flaky(retries=3, delay=1) decorator to handle intermittent Anthropic API failures - Add error handling to skip test when Anthropic API returns InternalServerError - Prevents false test failures due to external API 500 errors	2025-12-06 09:41:09 -08:00
yuneng-jiang	4a0893ca22	Merge remote-tracking branch 'origin' into litellm_ui_callback_fix	2025-12-05 11:43:35 -08:00
Yannay Hammer	2627f0d519	Fix aim security guardrail tests (#17499 )	2025-12-04 13:29:20 -08:00
Sameer Kankute	6751badf3a	fix: test_default_api_base for ragfow	2025-12-04 21:49:33 +05:30
yuneng-jiang	cc92fdf90f	Merge remote-tracking branch 'origin' into litellm_ui_callback_fix	2025-12-03 11:02:59 -08:00

1 2 3 4 5 ...

918 Commits