litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-07-05 01:07:09 +00:00

Author	SHA1	Message	Date
Krrish Dholakia	dececb84c1	docs(image_edits.md): clarify proxy version support for image edits	2025-05-29 07:50:41 -07:00
Krrish Dholakia	077b69eafa	fix(key_management_endpoints.py): allow master key rotations for OSS users Closes https://github.com/BerriAI/litellm/issues/11210	2025-05-28 22:56:25 -07:00
Krish Dholakia	8fb2779c9e	build(model_prices_and_context_window.json): add supports parallel function calling to all gemini models (#11225 )	2025-05-28 22:32:02 -07:00
Krish Dholakia	1995c7aad5	fix(utils.py): support non default params for audio transcription (#11212 ) * fix(utils.py): support non default params for audio transcription allows passing provider specific params straight through on transcription calls * fix(gpt_transformation.py): fix o_series model routing call _transform_request on async event * refactor: refactor tests * test(test_azure_chat_o_series_transformation.py): add unit test for azure o series error * test: update test * test: update json * fix: fix mutiple keyword error	2025-05-28 22:24:02 -07:00
Krish Dholakia	ba39f9e360	Helicone base url support + fix for embedding cache hits on str input (#11211 ) * fix(helicone.py): add helicone api base support Fixes https://github.com/BerriAI/litellm/issues/10825 * test: add unit test for cache hit response on embedding calls * fix(caching_handler.py): fix handling cache hit on embedding when input is string Fixes LIT-197 * docs(helicone_integration.md): document new helicone api base param	2025-05-28 22:02:55 -07:00
Krrish Dholakia	75f591a6e2	docs(cli.md): add new skip_server_startup flag	2025-05-28 21:47:08 -07:00
Ishaan Jaff	2e5f635803	bump: version 1.71.2 → 1.71.3	2025-05-28 21:39:14 -07:00
Ishaan Jaff	2372492237	test: fix test_user_email_metrics	2025-05-28 21:30:47 -07:00
Ishaan Jaff	226c3b74ba	build: fix ui build	2025-05-28 21:26:18 -07:00
Ishaan Jaff	711c931c71	test: fix test_key_generation_with_object_permission	2025-05-28 21:24:35 -07:00
Ishaan Jaff	c5a82b03f4	Revert "(build) fix ui build" This reverts commit `73a82d24d8`.	2025-05-28 21:21:10 -07:00
Ishaan Jaff	7ac05087c3	Revert "build(deps): bump next from 14.2.26 to 15.2.4 in /ui/litellm-dashboard (#11216 )" This reverts commit `461e1a4751`.	2025-05-28 21:20:56 -07:00
Ishaan Jaff	f0618fd297	Revert "(ui) fix build" This reverts commit `90dbf4cf0b`.	2025-05-28 21:20:35 -07:00
Ishaan Jaff	90dbf4cf0b	(ui) fix build	2025-05-28 21:18:09 -07:00
Ishaan Jaff	edf85f2394	fix: code QA check	2025-05-28 21:16:17 -07:00
Ishaan Jaff	73a82d24d8	(build) fix ui build	2025-05-28 21:05:33 -07:00
Ishaan Jaff	745d9acdc3	fix: code QA check	2025-05-28 21:03:33 -07:00
Ishaan Jaff	6b527a77aa	ci/cd fix build ui	2025-05-28 21:00:39 -07:00
Ishaan Jaff	68a2466f3d	fix debug langfuse clients started (#11221 )	2025-05-28 19:16:52 -07:00
Ishaan Jaff	9e75409518	bump litellm proxy extras v1.71.2.dev1	2025-05-28 18:58:33 -07:00
Ishaan Jaff	5a32fb89b0	build: add vector stores to object permissions	2025-05-28 18:55:05 -07:00
Ishaan Jaff	df76b8c7b9	ui new build	2025-05-28 18:53:37 -07:00
Ishaan Jaff	b35806d0f0	ui new build	2025-05-28 18:53:01 -07:00
Ishaan Jaff	096e03bbc6	scim fix - key (#11220 )	2025-05-28 18:47:51 -07:00
Stephen Solka	3960e08dd7	Fix: Remove duplicate object_permission field in LiteLLM_TeamTable (#11219 ) This fixes a mypy error where 'object_permission' was already defined on line 1267. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Stephen Solka <stephen@standd.io> Co-authored-by: Claude <noreply@anthropic.com>	2025-05-28 18:40:45 -07:00
Ishaan Jaff	4e6c4beef8	[Feat] Permission management vector stores on LiteLLM Key, Team, Orgs (#11213 ) * fix: init commit for object permissions * fix: init commit for object permissions * fix: add vector_store_id to permissions * fix vector store selector * feat:add vector store permission mgmt * feat: ui add allowed vector stores dropdown * feat: add new vector store object permissions * testing: key mgmt * fix: stor vector store permissions on team * ui select vector store for teams * ui add vector store settings for orgs * feat: allow setting org vector store permissions * test: adding team permissions for vector stores	2025-05-28 16:58:53 -07:00
Vinnie-Singleton-NN	178a614d4a	Add sentry sample rate (#10283 ) * Add SENTRY_API_SAMPLE_RATE configuration option for Sentry SDK * removed print line * Update Sentry documentation with sample rate information --------- Co-authored-by: Vinnie <vinnie@Vinnies-MacBook-Pro.local>	2025-05-28 16:44:10 -07:00
Regis David Souza Mesquita	56c32ef503	Update mistral-medium prices and context sizes (#10729 ) * Update mistral-medium prices and context sizes While testing the Mistral model, I noticed a discrepancy in the pricing shown on the logs screen. After reviewing the code, I confirmed that the pricing values were incorrect. This PR corrects the input and output token pricing for the latest Mistral model and adds the newly released mistral-medium-2505 version. * Adds tool calling flag to mistral-medium * Adds mistral-medium price updates to the main model price file * Update model_prices_and_context_window_backup.json sets mistral medium alias to the old values as it probably points to the old version. * Update model_prices_and_context_window.json * Update model_prices_and_context_window_backup.json * Update model_prices_and_context_window.json	2025-05-28 16:42:28 -07:00
tanjiro	14cdf61537	Improve display of model tags on Keys, Teams and Org page (#11022 ) * added accordian for models tag * remove margin top * match badge styles same as on teams page * add badge accordian for organizations * badges fixed on keys page * fixed for singular and plural * fix merge conflict	2025-05-28 16:41:26 -07:00
dependabot[bot]	461e1a4751	build(deps): bump next from 14.2.26 to 15.2.4 in /ui/litellm-dashboard (#11216 ) Bumps [next](https://github.com/vercel/next.js) from 14.2.26 to 15.2.4. - [Release notes](https://github.com/vercel/next.js/releases) - [Changelog](https://github.com/vercel/next.js/blob/canary/release.js) - [Commits](https://github.com/vercel/next.js/compare/v14.2.26...v15.2.4) --- updated-dependencies: - dependency-name: next dependency-version: 15.2.4 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-05-28 16:39:20 -07:00
tanjiro	d8c7dd643d	Removed teamInfo call and used existing teamsList call (#11051 ) * removed teamInfo call and used existing teams list call * made perTeamInfo typesafe	2025-05-28 16:39:07 -07:00
tanjiro	86409976a9	put all proxy models if no models are selected. (#11187 )	2025-05-28 16:31:31 -07:00
Jacob Hagstedt P Suorra	7d344c9aee	Add price for gpt4o mini audio preview without date at the end of the name, just like for the non mini model (#11207 )	2025-05-28 16:30:05 -07:00
Ishaan Jaff	9a6d5c119e	feat: Allow Adding MCP Servers Through LiteLLM UI (#11208 ) * feat: MCP Servers with CRUD operations (#10699) * feat: mcp CRUD operations with authn/authz * feat: mcp server UI * mcp server page with overview, mcp tools, and settings page * Adding MCP Server flow * prisma generate before test * UI callbacks add/remove with api server refetch * test fix: poetry run prisma * feat: mcp server db and config connection * fix: MCPTool filter on description when not present * feat: mcp on UI and integrated with list tools * feat: Update mcp server endpoint * tests: Unit and integration tests for mcp management endpoints * fix: docs and ensuring global_mcp_manage up to date * ui: remove the mcp tools view * fix: ruff lint * fix: unit -> integration test area * fix(ui): remove left nav menu of previous tools --------- Co-authored-by: wagnerjt <wagnerjt@github.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * fix: sync DB MCP tools with in memory * fix: sync DB MCP tools with in memory * fix: stop using prisma.models * fix: code qa check * fix: import MCP * fix: code QA checks * fix: code QA checks * fixes - only list tools for the specific MCP server * fix: only list MCP tools for selected server * fix linting error --------- Co-authored-by: Tyler Wagner <wagnerjt@users.noreply.github.com> Co-authored-by: wagnerjt <wagnerjt@github.com>	2025-05-28 16:29:27 -07:00
Krrish Dholakia	d8415f0c39	bump: version 1.71.1 → 1.71.2 v1.71.2-nightly	2025-05-28 11:34:16 -07:00
Krrish Dholakia	cc780ecf5a	fix(vertex_llm_base.py): handle default vertex location	2025-05-28 11:33:39 -07:00
Adnan Alkattan	18f7316b08	Add missing request_kwargs to function call (#11202 )	2025-05-28 10:55:41 -07:00
Ishaan Jaff	1301cfb06b	add well known MCP servers (#11209 )	2025-05-28 10:46:26 -07:00
Niko Izsak	85525c5c6b	Azure OIDC provider improvements + OIDC audience bug fix (#10054 ) * added support for custom scope in get_azure_ad_token_provider * if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope * fix bug where oidc audience that contains "/" won't be extract correctly * added tests for get_secret with oidc * moved tests to litellm tests folder * tes file naming aligned with source code * renamed test_main because it caused issue in the test in github workflow * updated docs * moved docs to the end of file * fix aws region in example config * renamed test file * added support for custom scope in get_azure_ad_token_provider * if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope * fix bug where oidc audience that contains "/" won't be extract correctly * added tests for get_secret with oidc * moved tests to litellm tests folder * tes file naming aligned with source code * renamed test_main because it caused issue in the test in github workflow * updated docs * moved docs to the end of file * fix aws region in example config * renamed test file * fix merge conflict resolution error * added support for custom scope in get_azure_ad_token_provider * if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope * fix bug where oidc audience that contains "/" won't be extract correctly * added tests for get_secret with oidc * moved tests to litellm tests folder * tes file naming aligned with source code * renamed test_main because it caused issue in the test in github workflow * updated docs * moved docs to the end of file * fix aws region in example config * renamed test file * added tests for get_secret with oidc * moved tests to litellm tests folder * tes file naming aligned with source code * renamed test_main because it caused issue in the test in github workflow * updated docs * moved docs to the end of file * renamed test file * fix merge conflict resolution error	2025-05-28 09:33:13 -07:00
Krish Dholakia	05e0a6d8d5	Return anthropic thinking blocks on streaming + VertexAI Minor Fixes & Improvements (Thinking, Global regions, Parallel tool calling) (#11194 ) * fix(anthropic/chat/handler.py): Fixes https://github.com/BerriAI/litellm/issues/10328 Adopts changes from https://github.com/BerriAI/litellm/pull/10329 * fix(vertex_and_google_ai_studio.py): don't set 'include thoughts' if thinking budget = 0 VertexAI raises errors * fix(vertex_llm_base.py): new function for deciding the api base, handles 'global' api base Fixes https://github.com/BerriAI/litellm/issues/11190 * fix(vertex_ai/partner_models): fix instrumentation for custom api base check * refactor(vertex_ai/partner): refactor function to keep below 50 LOC * fix(vertex_ai/gemini): remove parallel tool calls error for >1 tool - just ignore (prevent call from failing) * fix: fix linting error	2025-05-27 23:07:13 -07:00
RMeans	d181138ad5	Add Pangea provider to Guardrails hook (#10775 ) * added Pangea as a guardrail vendor * Adding output recipe, cleaning up some imports * Add Pangea guardrails tests * Add docs and sidebar * Move to use async_precall_hook instead of moderation hook * Update to "new" format (accept mode) Add response for redaction, support transforming request / response based off the original type of call (/v1/completions, /v1/chat/completions) * Fix tests * Fix unused imports * Fix .md --------- Co-authored-by: michael weinberger <michael.lee.weinberger@gmail.com>	2025-05-27 22:01:28 -07:00
Krish Dholakia	7072466775	VertexAI - `codeExecution` tool support + anyOf handling (#11195 ) * fix(vertex_and_google_ai_studio_gemini.py): handle both camel case and underscores in the tool for vertex ai code execution support vertex ai code execution * docs(vertex.md): add code execution example to vertex ai * fix(vertex_ai/common_utils.py): when anyof in field, just select anyof - don't include other k,v pairs - vertex throws error Fixes https://github.com/BerriAI/litellm/issues/11164 * fix(common_utils.py): add title field inside anyof - to retain some description Addresses https://github.com/BerriAI/litellm/issues/11164#issuecomment-2914728385	2025-05-27 21:23:14 -07:00
Krrish Dholakia	40a4e97d8e	test: update test	2025-05-27 21:15:22 -07:00
Ishaan Jaff	1a17755c60	test: fix test_ensure_initialize_azure_sdk_client_always_used	2025-05-27 19:02:11 -07:00
Ishaan Jaff	6c36dc269b	test: fix test_vertexai_model_garden_model_completion	2025-05-27 18:51:50 -07:00
Ishaan Jaff	a76d991952	[Fix]: Add cost tracking for image edits endpoint [OpenAI, Azure] (#11186 ) * fix: add cost tracking for image edits * fix: add cost tracking for azure image edits * fix: fix linting error * fix: fixes for background param * fix: image edit test fixes * fix: openai image edit cost tracking	2025-05-27 17:52:15 -07:00
Krrish Dholakia	95de41d10c	build: publish new litellm-proxy-extras file	2025-05-27 17:44:23 -07:00
Ishaan Jaff	0590b1eb3a	[Fix] Prometheus Metrics - Do not track end_user by default + expose flag to enable tracking end_user on prometheus (#11192 ) * fix: testing for disabling end user on metrics * fix: fixes for test_prometheus_factory * Delete litellm/model_prices_and_context_window_backup.json * fix: issues with merge conflicts * fix: test_get_end_user_id_for_cost_tracking_prometheus_only * Update tests/test_litellm/integrations/test_prometheus.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-05-27 17:06:58 -07:00
Krish Dholakia	4c82dd9b27	Ollama Chat - parse tool calls on streaming (#11171 ) * fix(user_api_key_auth.py): fix else block Fixes https://github.com/BerriAI/litellm/issues/11170 * refactor(ollama/chat): refactor to base config pattern easier to maintain fixes * fix(ollama/chat): support tool call parsing on streaming Closes https://github.com/BerriAI/litellm/issues/11104 * test: update import location * fix: cleanup unused import * fix: fix ruff check error * test: update import * test: update test on ci * ci: cleanup * fix: fix chekc * fix: fix api key check order * test: fix import * ci: fix script * test: fix imports * fix: fix tests	2025-05-27 16:14:49 -07:00
Krish Dholakia	64096aeb1e	Codestral - return litellm latency overhead on `/v1/completions` + Add '__contains__' support for ChatCompletionDeltaToolCall (#10879 ) * feat(codestral/completion): return litellm latency overhead for codestral enables easier debugging of latency issues * fix(types/utils.py): support _response_ms on hidden params model dump Fixes issue where 'x-litellm-overhead-duration-ms' wasn't being returned on text c ompletion calls * fix(types/utils.py): add '__contains__' support for chatcompletiondeltatool call Fixes https://github.com/BerriAI/litellm/issues/7099 * fix: fix linting error * fix: fix linting error	2025-05-27 16:13:44 -07:00

1 2 3 4 5 ...

22264 Commits