Commit Graph

5287 Commits

Author SHA1 Message Date
Harshit Jain 746414eb9b Fix/per service ssl override v2 (#19538)
* refactor(ssl): support per-service SSL verification overrides

* add test cases for ssl
2026-01-21 20:10:04 -08:00
Cesar Garcia 4106d24215 feat: add GMI Cloud provider support (#19376)
* feat: add GMI Cloud provider support

Add GMI Cloud as an OpenAI-compatible provider with:
- Provider configuration in providers.json
- Documentation page with usage examples
- Model pricing for 16 models (Claude, GPT, DeepSeek, Gemini, etc.)
- Sidebar entry for docs navigation

* Add gmi_cloud to provider_endpoints_support.json

Add provider entry to pass CI validation check that ensures all
providers in openai_like/providers.json are documented.

* Fix provider key: gmi_cloud -> gmi

Match the provider key with providers.json

---------

Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
2026-01-21 15:48:15 -08:00
Sampson 09941dd1d1 add search provider for brave search api (#19433)
* add search provider for brave search api

Introduces a minimal implementation of the Brave Search API as a search provider. Additionally, this PR introduces a test file to ensure the provider works properly, and numerous other smaller changes (e.g., changes to docs to mention the new option).

* Update transformation.py
2026-01-20 19:23:29 -08:00
Kamil Jopek ce722ab763 Make grpc dependency optional (#19447)
* Make grpc optional and document gRPC OTEL setup

* Add tests for missing OTLP gRPC imports
2026-01-20 19:03:52 -08:00
Cesar Garcia 94055741d4 docs: clarify Gemini vs Vertex AI model prefix behavior (#19443)
Add documentation explaining the difference between model formats:
- `gemini/model` → Gemini API (simple API key)
- `vertex_ai/model` → Vertex AI (GCP credentials)
- `model` (no prefix) → defaults to Vertex AI

This addresses user confusion when models without prefix require
GCP authentication instead of simple API key auth.

Ref #8424
2026-01-20 15:22:52 -08:00
Alexsander Hamir 5a06868652 Fix in-flight request termination on SIGTERM when health-check runs in a separate process (#19427) 2026-01-20 12:17:06 -08:00
Krrish Dholakia f95f5563ea docs: document input/output/total tokens behaviour
Closes https://github.com/BerriAI/litellm/issues/17480
2026-01-20 10:45:47 -08:00
Ishaan Jaffer f6d6455cbc fix rc 2026-01-20 08:39:17 -08:00
Sameer Kankute dc3ee63359 fix: test_env_keys 2026-01-20 18:37:56 +05:30
Sameer Kankute ae414ed462 Revert "feat: add retry_delay, exponential_backoff, and jitter to completion(…"
This reverts commit 1678f621db.
2026-01-20 17:07:00 +05:30
Sameer Kankute deb9142117 Merge pull request #19400 from BerriAI/main
merge main iin 19/1 staging
2026-01-20 16:45:01 +05:30
Igal Boxerman d7ac8de832 docs: Migrate Pillar Security to Generic Guardrail API (#19364)
Update Pillar Security integration to use the generic_guardrail_api
instead of the dedicated pillar guardrail type. This aligns with
the Generic Guardrail API specification introduced in previous PRs.

Changes:
- Rewrite pillar_security.md with new generic_guardrail_api config
- Add Pillar Security example to generic_guardrail_api.md
- Add Pillar Security to quick_start.md guardrails examples

Related PRs: #17175, #18647, #18932, #19023
2026-01-19 20:35:56 -08:00
YutaSaito 00814d4d90 Merge pull request #19379 from BerriAI/litellm_feat_mcp_version_up
[feat] mcp version up
2026-01-20 13:09:29 +09:00
Ishaan Jaffer ea2e360cb5 docs 2026-01-19 19:49:51 -08:00
Ishaan Jaffer f865f92bec docs plugin marketplaces 2026-01-19 19:42:15 -08:00
Will Chen 2761077820 docs: fix vertex_region to vertex_location in Vertex AI pass-through docs (#19380)
The documentation incorrectly used `vertex_region` as the parameter name,
but the actual parameter expected by LiteLLM is `vertex_location` as defined
in VertexPassThroughCredentials and other type definitions.

Co-authored-by: Claude <noreply@anthropic.com>
2026-01-19 19:15:17 -08:00
Yuta Saito ec7bf0ff1a Merge remote-tracking branch 'upstream/main' into litellm_feat_mcp_version_up 2026-01-20 09:52:38 +09:00
Yuta Saito e5bc2d31d0 docs: mcp version up 2026-01-20 07:56:38 +09:00
0x1f99d 1cce718551 fix(bedrock): deduplicate tool calls in assistant history (#15178) (#19324)
* fix: Avoid attaching tool calls when a call_id already exists

* fix: Prevent MCP responses from reviving past tool calls via previous_response_id

* test: Parametrize MCP streaming test to cover OpenAI and Anthropic models

* test: Fail MCP streaming test when LiteLLM logs errors during follow-up calls

* test: Let MCP tool-execution mock accept new kwargs for streaming tests

* chore: fix lint error

* docs: Add Google Workload Identity Federation (WIF) documentation to Vertex AI (#19320)

- Added new section documenting WIF support for Vertex AI authentication
- Included SDK and Proxy configuration examples
- Added sample WIF credentials file format for AWS federation
- Mentioned LLM Credentials UI as an alternative for credential management
- Added link to Google Cloud WIF documentation

Co-authored-by: Cursor Agent <cursoragent@cursor.com>

* fix(bedrock): deduplicate tool calls in assistant history (#15178)

* fix(types): add missing Set import to factory.py

---------

Co-authored-by: Yuta Saito <uc4w6c@bma.biglobe.ne.jp>
Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
Co-authored-by: YutaSaito <36355491+uc4w6c@users.noreply.github.com>
2026-01-19 10:56:49 -08:00
Alexsander Hamir 3cdeebb5b8 fix(gcs_bucket): prevent unbounded queue growth due to slow API calls (#19297) 2026-01-19 10:47:56 -08:00
Cesar Garcia d30c25af21 feat(gemini): use responseJsonSchema for Gemini 2.0+ models (#19314)
* feat(gemini): add opt-in support for responseJsonSchema

Add support for Gemini's native responseJsonSchema parameter which uses
standard JSON Schema format instead of OpenAPI-style responseSchema.

Benefits of responseJsonSchema (Gemini 2.0+ only):
- Standard JSON Schema format (lowercase types)
- Supports additionalProperties for stricter validation
- Better compatibility with Pydantic's model_json_schema()
- No propertyOrdering required

Usage:
```python
response_format={
    "type": "json_schema",
    "json_schema": {"schema": {...}},
    "use_json_schema": True  # opt-in
}
```

This is backwards compatible - existing code continues to use
responseSchema by default.

Closes #16340

* docs: add documentation for use_json_schema parameter

Document the new use_json_schema option for Gemini 2.0+ models
in the JSON Mode documentation.

* refactor(gemini): use responseJsonSchema by default for Gemini 2.0+

Remove opt-in flag `use_json_schema` and automatically detect model version:
- Gemini 2.0+: uses responseJsonSchema (standard JSON Schema, supports additionalProperties)
- Gemini 1.5: uses responseSchema (OpenAPI format, legacy)

This follows LiteLLM's philosophy of abstracting provider differences -
users write the same code regardless of model version.

* test(vertex): update json_schema tests to accept both responseSchema formats

Gemini 2.x+ uses responseJsonSchema while Gemini 1.x uses responseSchema.
Update tests to accept both formats since litellm now auto-selects based
on model version.
2026-01-19 10:45:37 -08:00
Harshit Jain 99c4ba7adf docs: fix bad examples from sdk (#19322) 2026-01-19 10:27:25 -08:00
Harshit Jain 1678f621db feat: add retry_delay, exponential_backoff, and jitter to completion() (#19371) 2026-01-19 10:27:01 -08:00
Krish Dholakia 0862373b38 docs: add note about no limits on users/keys/teams in LiteLLM OSS (#19367)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2026-01-19 08:22:18 -08:00
Cesar Garcia 4d6a430adc docs: update UI contributing guide (#19353)
* docs: update UI contributing guide with correct commands

- Replace outdated proxy_cli.py command with poetry run litellm
- Add config.yaml example with required settings
- Clarify that UI comes pre-built in the repo
- Add two development options: Build Mode and Dev Mode (hot reload)
- Note about redirect issues in Dev Mode

* docs: add hot reload login flow and PR submission section

- Document the 3000 -> 4000 -> 3000 login flow for hot reload
- Reorder: Hot Reload as Option A, Build Mode as Option B
- Add section 4 on submitting PRs
- Add note that UI changes don't require tests

* Update login flow navigation URL in contributing.md
2026-01-19 06:18:45 -08:00
Manuel Schweigert 29adf34313 Add ChatGPT subscription support and responses bridge (#19030)
* Add ChatGPT subscription support and responses bridge

* Fix typing import for responses bridge

* Guard device code timestamp parsing

* add /v1/messages endpoint to chatgpt model
2026-01-19 05:37:45 -08:00
Sameer Kankute 0a15f1b66a Fix: stability image optional para 2026-01-19 09:05:52 +05:30
Krish Dholakia e326b397c5 docs: Add Google Workload Identity Federation (WIF) documentation to Vertex AI (#19320)
- Added new section documenting WIF support for Vertex AI authentication
- Included SDK and Proxy configuration examples
- Added sample WIF credentials file format for AWS federation
- Mentioned LLM Credentials UI as an alternative for credential management
- Added link to Google Cloud WIF documentation

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2026-01-18 18:18:01 -08:00
YutaSaito 4dc9726db4 Merge pull request #19292 from BerriAI/litellm_docs_cursor-mcp
[docs] add cursor with mcp
2026-01-18 14:59:08 +09:00
Ishaan Jaffer 2fda7a2534 docs fix 2026-01-17 18:33:19 -08:00
Ishaan Jaffer 2e840b1216 fix Deleted Keys Audit Table on UI 2026-01-17 18:12:18 -08:00
Alexsander Hamir 419423cf43 docs: add performance improvement section (#19300) 2026-01-17 18:06:14 -08:00
Ishaan Jaffer 4f2bdd33f4 docs claude code 2026-01-17 18:05:16 -08:00
Ishaan Jaffer fcb5cfe9d9 add img to doc 2026-01-17 18:05:16 -08:00
yuneng-jiang 62b082b8b1 Merge pull request #19299 from BerriAI/1810_docs_yj
[Docs] Deleted Key and Team Table
2026-01-17 17:51:03 -08:00
yuneng-jiang 8629ca819d deleted keys docs 2026-01-17 17:49:12 -08:00
Ishaan Jaffer 46de47d542 doc fix 2026-01-17 17:35:52 -08:00
Ishaan Jaffer 7eecf81cdc docs fix 2026-01-17 17:29:49 -08:00
Ishaan Jaffer 534fa9f4c0 docs fix 2026-01-17 17:26:58 -08:00
Ishaan Jaffer 26497b415b docs fix 2026-01-17 17:21:31 -08:00
Ishaan Jaffer c6998823c0 docs fix 2026-01-17 17:17:34 -08:00
Ishaan Jaffer c158f83cff docs fix 2026-01-17 17:17:13 -08:00
Ishaan Jaffer 4610d1d43c docs fix 2026-01-17 17:16:22 -08:00
Ishaan Jaffer 7d24bbed42 qa fixes 2026-01-17 17:14:51 -08:00
Ishaan Jaffer e15526a60e fix 2026-01-17 17:13:22 -08:00
Ishaan Jaffer c30b17aa9b docs fix 2026-01-17 17:03:48 -08:00
Ishaan Jaff 1417b002a3 [Feat] Claude Code x LiteLLM WebSearch - QA Fixes to work with Claude Code (#19294)
* fix websearch_interception_converted_stream

* test_websearch_interception_no_tool_call_streaming

* FakeAnthropicMessagesStreamIterator

* LITELLM_WEB_SEARCH_TOOL_NAME

* fixes tools def for litellm web search

* fixes FakeAnthropicMessagesStreamIterator

* test_litellm_standard_websearch_tool

* use new hook for modfying before any transfroms from litellm

* init WebSearchInterceptionLogger + ARCHITECTURE

* fix config.yaml

* init doc for claude code web search

* docs fix

* doc fix

* fix mypy linting
2026-01-17 16:30:31 -08:00
YutaSaito d28bf983eb Merge pull request #19272 from Harshit28j/feature/panw-custom-violation-msg
feat(panw_prisma_airs): add custom violation message support
2026-01-18 06:55:39 +09:00
Yuta Saito 18839a3600 docs: add cursor with mcp 2026-01-18 06:51:09 +09:00
yuneng-jiang 953e2736d4 Merge pull request #19291 from BerriAI/deleted_keys_docs_2
[Docs] Deleted Keys and Teams Docs
2026-01-17 13:20:48 -08:00