litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-26 21:07:19 +00:00

Author	SHA1	Message	Date
Harshit Jain	746414eb9b	Fix/per service ssl override v2 (#19538 ) * refactor(ssl): support per-service SSL verification overrides * add test cases for ssl	2026-01-21 20:10:04 -08:00
Cesar Garcia	4106d24215	feat: add GMI Cloud provider support (#19376 ) * feat: add GMI Cloud provider support Add GMI Cloud as an OpenAI-compatible provider with: - Provider configuration in providers.json - Documentation page with usage examples - Model pricing for 16 models (Claude, GPT, DeepSeek, Gemini, etc.) - Sidebar entry for docs navigation * Add gmi_cloud to provider_endpoints_support.json Add provider entry to pass CI validation check that ensures all providers in openai_like/providers.json are documented. * Fix provider key: gmi_cloud -> gmi Match the provider key with providers.json --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>	2026-01-21 15:48:15 -08:00
Sampson	09941dd1d1	add search provider for brave search api (#19433 ) * add search provider for brave search api Introduces a minimal implementation of the Brave Search API as a search provider. Additionally, this PR introduces a test file to ensure the provider works properly, and numerous other smaller changes (e.g., changes to docs to mention the new option). * Update transformation.py	2026-01-20 19:23:29 -08:00
Kamil Jopek	ce722ab763	Make `grpc` dependency optional (#19447 ) * Make grpc optional and document gRPC OTEL setup * Add tests for missing OTLP gRPC imports	2026-01-20 19:03:52 -08:00
Cesar Garcia	94055741d4	docs: clarify Gemini vs Vertex AI model prefix behavior (#19443 ) Add documentation explaining the difference between model formats: - `gemini/model` → Gemini API (simple API key) - `vertex_ai/model` → Vertex AI (GCP credentials) - `model` (no prefix) → defaults to Vertex AI This addresses user confusion when models without prefix require GCP authentication instead of simple API key auth. Ref #8424	2026-01-20 15:22:52 -08:00
Alexsander Hamir	5a06868652	Fix in-flight request termination on SIGTERM when health-check runs in a separate process (#19427 )	2026-01-20 12:17:06 -08:00
Krrish Dholakia	f95f5563ea	docs: document input/output/total tokens behaviour Closes https://github.com/BerriAI/litellm/issues/17480	2026-01-20 10:45:47 -08:00
Ishaan Jaffer	f6d6455cbc	fix rc	2026-01-20 08:39:17 -08:00
Sameer Kankute	dc3ee63359	fix: test_env_keys	2026-01-20 18:37:56 +05:30
Sameer Kankute	ae414ed462	Revert "feat: add retry_delay, exponential_backoff, and jitter to completion(…" This reverts commit `1678f621db`.	2026-01-20 17:07:00 +05:30
Sameer Kankute	deb9142117	Merge pull request #19400 from BerriAI/main merge main iin 19/1 staging	2026-01-20 16:45:01 +05:30
Igal Boxerman	d7ac8de832	docs: Migrate Pillar Security to Generic Guardrail API (#19364 ) Update Pillar Security integration to use the generic_guardrail_api instead of the dedicated pillar guardrail type. This aligns with the Generic Guardrail API specification introduced in previous PRs. Changes: - Rewrite pillar_security.md with new generic_guardrail_api config - Add Pillar Security example to generic_guardrail_api.md - Add Pillar Security to quick_start.md guardrails examples Related PRs: #17175, #18647, #18932, #19023	2026-01-19 20:35:56 -08:00
YutaSaito	00814d4d90	Merge pull request #19379 from BerriAI/litellm_feat_mcp_version_up [feat] mcp version up	2026-01-20 13:09:29 +09:00
Ishaan Jaffer	ea2e360cb5	docs	2026-01-19 19:49:51 -08:00
Ishaan Jaffer	f865f92bec	docs plugin marketplaces	2026-01-19 19:42:15 -08:00
Will Chen	2761077820	docs: fix vertex_region to vertex_location in Vertex AI pass-through docs (#19380 ) The documentation incorrectly used `vertex_region` as the parameter name, but the actual parameter expected by LiteLLM is `vertex_location` as defined in VertexPassThroughCredentials and other type definitions. Co-authored-by: Claude <noreply@anthropic.com>	2026-01-19 19:15:17 -08:00
Yuta Saito	ec7bf0ff1a	Merge remote-tracking branch 'upstream/main' into litellm_feat_mcp_version_up	2026-01-20 09:52:38 +09:00
Yuta Saito	e5bc2d31d0	docs: mcp version up	2026-01-20 07:56:38 +09:00
0x1f99d	1cce718551	fix(bedrock): deduplicate tool calls in assistant history (#15178 ) (#19324 ) * fix: Avoid attaching tool calls when a call_id already exists * fix: Prevent MCP responses from reviving past tool calls via previous_response_id * test: Parametrize MCP streaming test to cover OpenAI and Anthropic models * test: Fail MCP streaming test when LiteLLM logs errors during follow-up calls * test: Let MCP tool-execution mock accept new kwargs for streaming tests * chore: fix lint error * docs: Add Google Workload Identity Federation (WIF) documentation to Vertex AI (#19320) - Added new section documenting WIF support for Vertex AI authentication - Included SDK and Proxy configuration examples - Added sample WIF credentials file format for AWS federation - Mentioned LLM Credentials UI as an alternative for credential management - Added link to Google Cloud WIF documentation Co-authored-by: Cursor Agent <cursoragent@cursor.com> * fix(bedrock): deduplicate tool calls in assistant history (#15178) * fix(types): add missing Set import to factory.py --------- Co-authored-by: Yuta Saito <uc4w6c@bma.biglobe.ne.jp> Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: YutaSaito <36355491+uc4w6c@users.noreply.github.com>	2026-01-19 10:56:49 -08:00
Alexsander Hamir	3cdeebb5b8	fix(gcs_bucket): prevent unbounded queue growth due to slow API calls (#19297 )	2026-01-19 10:47:56 -08:00
Cesar Garcia	d30c25af21	feat(gemini): use responseJsonSchema for Gemini 2.0+ models (#19314 ) * feat(gemini): add opt-in support for responseJsonSchema Add support for Gemini's native responseJsonSchema parameter which uses standard JSON Schema format instead of OpenAPI-style responseSchema. Benefits of responseJsonSchema (Gemini 2.0+ only): - Standard JSON Schema format (lowercase types) - Supports additionalProperties for stricter validation - Better compatibility with Pydantic's model_json_schema() - No propertyOrdering required Usage: ```python response_format={ "type": "json_schema", "json_schema": {"schema": {...}}, "use_json_schema": True # opt-in } ``` This is backwards compatible - existing code continues to use responseSchema by default. Closes #16340 * docs: add documentation for use_json_schema parameter Document the new use_json_schema option for Gemini 2.0+ models in the JSON Mode documentation. * refactor(gemini): use responseJsonSchema by default for Gemini 2.0+ Remove opt-in flag `use_json_schema` and automatically detect model version: - Gemini 2.0+: uses responseJsonSchema (standard JSON Schema, supports additionalProperties) - Gemini 1.5: uses responseSchema (OpenAPI format, legacy) This follows LiteLLM's philosophy of abstracting provider differences - users write the same code regardless of model version. * test(vertex): update json_schema tests to accept both responseSchema formats Gemini 2.x+ uses responseJsonSchema while Gemini 1.x uses responseSchema. Update tests to accept both formats since litellm now auto-selects based on model version.	2026-01-19 10:45:37 -08:00
Harshit Jain	99c4ba7adf	docs: fix bad examples from sdk (#19322 )	2026-01-19 10:27:25 -08:00
Harshit Jain	1678f621db	feat: add retry_delay, exponential_backoff, and jitter to completion() (#19371 )	2026-01-19 10:27:01 -08:00
Krish Dholakia	0862373b38	docs: add note about no limits on users/keys/teams in LiteLLM OSS (#19367 ) Co-authored-by: Cursor Agent <cursoragent@cursor.com>	2026-01-19 08:22:18 -08:00
Cesar Garcia	4d6a430adc	docs: update UI contributing guide (#19353 ) * docs: update UI contributing guide with correct commands - Replace outdated proxy_cli.py command with poetry run litellm - Add config.yaml example with required settings - Clarify that UI comes pre-built in the repo - Add two development options: Build Mode and Dev Mode (hot reload) - Note about redirect issues in Dev Mode * docs: add hot reload login flow and PR submission section - Document the 3000 -> 4000 -> 3000 login flow for hot reload - Reorder: Hot Reload as Option A, Build Mode as Option B - Add section 4 on submitting PRs - Add note that UI changes don't require tests * Update login flow navigation URL in contributing.md	2026-01-19 06:18:45 -08:00
Manuel Schweigert	29adf34313	Add ChatGPT subscription support and responses bridge (#19030 ) * Add ChatGPT subscription support and responses bridge * Fix typing import for responses bridge * Guard device code timestamp parsing * add /v1/messages endpoint to chatgpt model	2026-01-19 05:37:45 -08:00
Sameer Kankute	0a15f1b66a	Fix: stability image optional para	2026-01-19 09:05:52 +05:30
Krish Dholakia	e326b397c5	docs: Add Google Workload Identity Federation (WIF) documentation to Vertex AI (#19320 ) - Added new section documenting WIF support for Vertex AI authentication - Included SDK and Proxy configuration examples - Added sample WIF credentials file format for AWS federation - Mentioned LLM Credentials UI as an alternative for credential management - Added link to Google Cloud WIF documentation Co-authored-by: Cursor Agent <cursoragent@cursor.com>	2026-01-18 18:18:01 -08:00
YutaSaito	4dc9726db4	Merge pull request #19292 from BerriAI/litellm_docs_cursor-mcp [docs] add cursor with mcp	2026-01-18 14:59:08 +09:00
Ishaan Jaffer	2fda7a2534	docs fix	2026-01-17 18:33:19 -08:00
Ishaan Jaffer	2e840b1216	fix Deleted Keys Audit Table on UI	2026-01-17 18:12:18 -08:00
Alexsander Hamir	419423cf43	docs: add performance improvement section (#19300 )	2026-01-17 18:06:14 -08:00
Ishaan Jaffer	4f2bdd33f4	docs claude code	2026-01-17 18:05:16 -08:00
Ishaan Jaffer	fcb5cfe9d9	add img to doc	2026-01-17 18:05:16 -08:00
yuneng-jiang	62b082b8b1	Merge pull request #19299 from BerriAI/1810_docs_yj [Docs] Deleted Key and Team Table	2026-01-17 17:51:03 -08:00
yuneng-jiang	8629ca819d	deleted keys docs	2026-01-17 17:49:12 -08:00
Ishaan Jaffer	46de47d542	doc fix	2026-01-17 17:35:52 -08:00
Ishaan Jaffer	7eecf81cdc	docs fix	2026-01-17 17:29:49 -08:00
Ishaan Jaffer	534fa9f4c0	docs fix	2026-01-17 17:26:58 -08:00
Ishaan Jaffer	26497b415b	docs fix	2026-01-17 17:21:31 -08:00
Ishaan Jaffer	c6998823c0	docs fix	2026-01-17 17:17:34 -08:00
Ishaan Jaffer	c158f83cff	docs fix	2026-01-17 17:17:13 -08:00
Ishaan Jaffer	4610d1d43c	docs fix	2026-01-17 17:16:22 -08:00
Ishaan Jaffer	7d24bbed42	qa fixes	2026-01-17 17:14:51 -08:00
Ishaan Jaffer	e15526a60e	fix	2026-01-17 17:13:22 -08:00
Ishaan Jaffer	c30b17aa9b	docs fix	2026-01-17 17:03:48 -08:00
Ishaan Jaff	1417b002a3	[Feat] Claude Code x LiteLLM WebSearch - QA Fixes to work with Claude Code (#19294 ) * fix websearch_interception_converted_stream * test_websearch_interception_no_tool_call_streaming * FakeAnthropicMessagesStreamIterator * LITELLM_WEB_SEARCH_TOOL_NAME * fixes tools def for litellm web search * fixes FakeAnthropicMessagesStreamIterator * test_litellm_standard_websearch_tool * use new hook for modfying before any transfroms from litellm * init WebSearchInterceptionLogger + ARCHITECTURE * fix config.yaml * init doc for claude code web search * docs fix * doc fix * fix mypy linting	2026-01-17 16:30:31 -08:00
YutaSaito	d28bf983eb	Merge pull request #19272 from Harshit28j/feature/panw-custom-violation-msg feat(panw_prisma_airs): add custom violation message support	2026-01-18 06:55:39 +09:00
Yuta Saito	18839a3600	docs: add cursor with mcp	2026-01-18 06:51:09 +09:00
yuneng-jiang	953e2736d4	Merge pull request #19291 from BerriAI/deleted_keys_docs_2 [Docs] Deleted Keys and Teams Docs	2026-01-17 13:20:48 -08:00

1 2 3 4 5 ...

5287 Commits