Commit Graph

918 Commits

Author SHA1 Message Date
Sameer Kankute 9e1275b76c Merge branch 'main' into litellm_staging_01_19_2026 2026-01-20 19:19:36 +05:30
Sameer Kankute cd96c8cbb0 Fix:test_aaaaazure_tenant_id_auth 2026-01-20 17:39:08 +05:30
Emerson Gomes 13d887a275 Fix queue persistence to Redis (#19304)
* Fix queue persistence to Redis

* add test
2026-01-19 19:01:34 -08:00
Cesar Garcia d30c25af21 feat(gemini): use responseJsonSchema for Gemini 2.0+ models (#19314)
* feat(gemini): add opt-in support for responseJsonSchema

Add support for Gemini's native responseJsonSchema parameter which uses
standard JSON Schema format instead of OpenAPI-style responseSchema.

Benefits of responseJsonSchema (Gemini 2.0+ only):
- Standard JSON Schema format (lowercase types)
- Supports additionalProperties for stricter validation
- Better compatibility with Pydantic's model_json_schema()
- No propertyOrdering required

Usage:
```python
response_format={
    "type": "json_schema",
    "json_schema": {"schema": {...}},
    "use_json_schema": True  # opt-in
}
```

This is backwards compatible - existing code continues to use
responseSchema by default.

Closes #16340

* docs: add documentation for use_json_schema parameter

Document the new use_json_schema option for Gemini 2.0+ models
in the JSON Mode documentation.

* refactor(gemini): use responseJsonSchema by default for Gemini 2.0+

Remove opt-in flag `use_json_schema` and automatically detect model version:
- Gemini 2.0+: uses responseJsonSchema (standard JSON Schema, supports additionalProperties)
- Gemini 1.5: uses responseSchema (OpenAPI format, legacy)

This follows LiteLLM's philosophy of abstracting provider differences -
users write the same code regardless of model version.

* test(vertex): update json_schema tests to accept both responseSchema formats

Gemini 2.x+ uses responseJsonSchema while Gemini 1.x uses responseSchema.
Update tests to accept both formats since litellm now auto-selects based
on model version.
2026-01-19 10:45:37 -08:00
Harshit Jain 6cd4b3603f fix(router): prevent retrying 4xx client errors (#19275) 2026-01-19 05:18:35 -08:00
Ishaan Jaffer 5812654bdd test_router_fallbacks_with_custom_model_costs 2026-01-17 16:34:46 -08:00
Sameer Kankute 09fb1581cb Fix:add async_get_available_deployment_for_pass_through in code tests 2026-01-16 16:37:44 +05:30
Sameer Kankute dca42047b9 Merge pull request #19068 from BerriAI/litellm_responses_caching_support
[Fix] Add support for caching for responses API
2026-01-15 08:38:31 +05:30
Sameer Kankute 52d3c9dcfc Add retry policy support to responses API 2026-01-14 14:55:17 +05:30
Sameer Kankute 333947e236 Add support for caching for responses API 2026-01-14 13:33:07 +05:30
Sameer Kankute e98c2e4425 Merge pull request #19012 from BerriAI/litellm_fix_model_deployment_routing
Fix: Model matching priority in configuration
2026-01-13 17:55:01 +05:30
Sameer Kankute d656f01bc9 Merge pull request #19009 from Dima-Mediator/fix-image-tokens-spend-logging
Fix image tokens spend logging for /images/generations
2026-01-13 15:03:37 +05:30
Sameer Kankute ecb3959c3c Merge pull request #18208 from Chesars/fix/case-insensitive-model-cost-lookup
fix: case-insensitive model cost map lookup
2026-01-13 11:53:43 +05:30
Sameer Kankute dfece51f8c Fix: Model matching priority in configuration 2026-01-13 11:44:48 +05:30
Yuta Saito a57f1e2e08 test: remove flaky azure oidc embedding test 2026-01-13 10:34:01 +09:00
Sameer Kankute ab85eb8f43 Fix: gaurdrail moderation support with responses API 2026-01-12 17:27:13 +05:30
Ishaan Jaffer 5873cd4a32 test_completion_bedrock_titan_null_response 2026-01-07 15:09:44 +05:30
Ishaan Jaffer 269771a7c9 test_bedrock_httpx_streaming 2026-01-07 15:02:10 +05:30
Ishaan Jaffer 645ca64780 a121 fixes 2026-01-07 15:00:49 +05:30
Yuta Saito 1d16a8526e feat: allow configuring project name for OpenTelemetry service name 2026-01-07 14:20:10 +09:00
Rohit Ravikant Rane a7c39ccc12 fix(router): correct num_retries tracking in retry logic (#18712)
* fix(router): correct num_retries tracking in retry logic

- Fix off-by-one error in num_retries attribute when retries exhausted
- Correct remaining_retries calculation
- Add comprehensive tests for retry tracking edge cases

Fixes incorrect retry count in error messages and logging

* chore: trigger CI re-run

* chore: trigger CI tests again
2026-01-06 23:58:41 +05:30
Yuta Saito ca14160375 Revert "fix: model eol"
This reverts commit 5aa1665d79.
2026-01-06 16:49:30 +09:00
Yuta Saito 23713d1811 fix: anthropic claude-3-opus-20240229 EOL 2026-01-06 16:06:21 +09:00
Yuta Saito 5aa1665d79 fix: model eol 2026-01-06 15:47:37 +09:00
Sameer Kankute c8bb92afdc Merge pull request #18579 from BerriAI/litellm_remove_prompt_chaching_header
Remove prompt caching headers as the support has been removed
2026-01-02 21:01:50 +05:30
Sameer Kankute dd96f91254 Add thought signature for non tool call requests 2026-01-02 12:31:31 +05:30
Sameer Kankute ca94990307 remove prompt caching headers as the support has been removed 2026-01-02 11:08:35 +05:30
Sameer Kankute 0addab4f63 Add test for removal of thought signature 2025-12-23 14:56:42 +05:30
Alexsander Hamir 5534038e93 Fix CI: Revert security scan changes and add GitGuardian ignore rules (#18358) 2025-12-22 17:03:53 -08:00
Ishaan Jaffer 09dcb38b4b Revert "test_update_model_cost_map_url"
This reverts commit c9d61fcdba.
2025-12-22 12:41:30 +05:30
Ishaan Jaffer c9d61fcdba test_update_model_cost_map_url 2025-12-22 11:51:30 +05:30
Ishaan Jaffer 148715541c test compl openai orgs 2025-12-20 23:41:29 +05:30
Ishaan Jaffer b1090db927 test_openai_stream_options_call 2025-12-20 22:35:21 +05:30
Ishaan Jaffer 6112160a16 Revert "[Fix] Security - Remove example API keys with high entropy (#18255)"
This reverts commit 24edbccf5c.
2025-12-20 20:48:11 +05:30
Ishaan Jaffer c13b4c5514 test_aaapass_through_endpoint_pass_through_keys_langfuse 2025-12-20 14:36:59 +05:30
Alexsander Hamir 24edbccf5c [Fix] Security - Remove example API keys with high entropy (#18255) 2025-12-19 10:09:50 -08:00
Alexsander Hamir 0c48826cdc Revert "[Fix] CI/CD - local_testing & mapped tests (#18222)" (#18223)
This reverts commit dc7f500c47.
2025-12-18 14:46:09 -08:00
Alexsander Hamir dc7f500c47 [Fix] CI/CD - local_testing & mapped tests (#18222) 2025-12-18 14:34:48 -08:00
Chesars e08b4767e9 fix: case-insensitive model cost map lookup
Users were getting "does not support parameters: ['tools']" errors when
using lowercase model names (e.g., "qwen/qwen3-next-80b-a3b-thinking")
because the model cost map has mixed-case keys and the lookup was
case-sensitive.

Added _get_model_cost_key() helper that tries exact match first (O(1)),
then falls back to case-insensitive search if not found.
2025-12-18 17:25:43 -03:00
Alexsander Hamir f353bb6dba [Fix] CI/CD - litellm_mapped_tests_llms | litellm_mapped_tests_core | caching_unit_tests (#18197) 2025-12-18 08:49:23 -08:00
Kris Xia be2f429087 fix(proxy): extract model from vertex ai passthrough url pattern (#18097)
extract model id from vertex ai passthrough routes that follow the pattern:
/vertex_ai/*/models/{model_id}:*

the model extraction now handles vertex ai routes by regex matching the model
segment from the url path, which allows proper model identification for
authentication and authorization in proxy pass-through endpoints.

adds comprehensive test coverage for vertex ai model extraction including:
- various vertex api versions (v1, v1beta1)
- different locations (us-central1, asia-southeast1)
- model names with special suffixes (gemini-1.5-pro, gemini-2.0-flash)
- precedence verification (request body model over url)
- non-vertex route isolation
2025-12-17 11:10:14 +05:30
Sameer Kankute dbcae4aca5 fix: Add none to encoding_format instead of omitting it 2025-12-16 13:23:15 +05:30
Cesar Garcia df19a747a2 feat(custom_llm): add image_edit and aimage_edit support (#17999)
* feat(custom_llm): add image_edit and aimage_edit support

Add support for image_edit and aimage_edit methods in CustomLLM class,
allowing users to implement custom image editing providers.

Changes:
- Add image_edit() and aimage_edit() methods to CustomLLM base class
- Add custom provider detection in litellm.image_edit() function
- Add tests for sync and async image_edit with custom handlers

* docs: add image_edit to CustomLLM documentation

- Add /v1/images/edits to supported routes
- Add Image Edit section with example
- Update Custom Handler Spec with image_edit methods
2025-12-15 14:16:36 -08:00
Alexsander Hamir 6635325629 fix: filter internal params in fallback code and fix test issues (#17941)
- Filter skip_mcp_handler and other internal params in fallback_utils.py before calling acompletion
  Fixes issue where internal parameters were being passed to provider APIs causing errors
- Remove deployment field from GCS bucket logger test metadata
  Fixes model name mismatch where deployment field was overriding the model in logging
- Update Bedrock Titan test to use non-deprecated model (titan-text-express-v1)
  Fixes test failure due to deprecated amazon.titan-text-lite-v1 model
2025-12-13 15:05:26 -08:00
Ishaan Jaffer b6562b9d88 test_thought_true_with_empty_text_creates_block 2025-12-06 16:00:05 -08:00
Alexsander Hamir 09dcd381d7 fix: mark test_caching_with_reasoning_content as flaky and handle InternalServerError (#17603)
- Add @pytest.mark.flaky(retries=3, delay=1) decorator to handle intermittent Anthropic API failures
- Add error handling to skip test when Anthropic API returns InternalServerError
- Prevents false test failures due to external API 500 errors
2025-12-06 09:41:09 -08:00
yuneng-jiang 4a0893ca22 Merge remote-tracking branch 'origin' into litellm_ui_callback_fix 2025-12-05 11:43:35 -08:00
Yannay Hammer 2627f0d519 Fix aim security guardrail tests (#17499) 2025-12-04 13:29:20 -08:00
Sameer Kankute 6751badf3a fix: test_default_api_base for ragfow 2025-12-04 21:49:33 +05:30
yuneng-jiang cc92fdf90f Merge remote-tracking branch 'origin' into litellm_ui_callback_fix 2025-12-03 11:02:59 -08:00