Commit Graph

22264 Commits

Author SHA1 Message Date
Krrish Dholakia dececb84c1 docs(image_edits.md): clarify proxy version support for image edits 2025-05-29 07:50:41 -07:00
Krrish Dholakia 077b69eafa fix(key_management_endpoints.py): allow master key rotations for OSS users
Closes https://github.com/BerriAI/litellm/issues/11210
2025-05-28 22:56:25 -07:00
Krish Dholakia 8fb2779c9e build(model_prices_and_context_window.json): add supports parallel function calling to all gemini models (#11225) 2025-05-28 22:32:02 -07:00
Krish Dholakia 1995c7aad5 fix(utils.py): support non default params for audio transcription (#11212)
* fix(utils.py): support non default params for audio transcription

allows passing provider specific params straight through on transcription calls

* fix(gpt_transformation.py): fix o_series model routing

call _transform_request on async event

* refactor: refactor tests

* test(test_azure_chat_o_series_transformation.py): add unit test for azure o series error

* test: update test

* test: update json

* fix: fix mutiple keyword error
2025-05-28 22:24:02 -07:00
Krish Dholakia ba39f9e360 Helicone base url support + fix for embedding cache hits on str input (#11211)
* fix(helicone.py): add helicone api base support

Fixes https://github.com/BerriAI/litellm/issues/10825

* test: add unit test for cache hit response on embedding calls

* fix(caching_handler.py): fix handling cache hit on embedding when input is string

Fixes LIT-197

* docs(helicone_integration.md): document new helicone api base param
2025-05-28 22:02:55 -07:00
Krrish Dholakia 75f591a6e2 docs(cli.md): add new skip_server_startup flag 2025-05-28 21:47:08 -07:00
Ishaan Jaff 2e5f635803 bump: version 1.71.2 → 1.71.3 2025-05-28 21:39:14 -07:00
Ishaan Jaff 2372492237 test: fix test_user_email_metrics 2025-05-28 21:30:47 -07:00
Ishaan Jaff 226c3b74ba build: fix ui build 2025-05-28 21:26:18 -07:00
Ishaan Jaff 711c931c71 test: fix test_key_generation_with_object_permission 2025-05-28 21:24:35 -07:00
Ishaan Jaff c5a82b03f4 Revert "(build) fix ui build"
This reverts commit 73a82d24d8.
2025-05-28 21:21:10 -07:00
Ishaan Jaff 7ac05087c3 Revert "build(deps): bump next from 14.2.26 to 15.2.4 in /ui/litellm-dashboard (#11216)"
This reverts commit 461e1a4751.
2025-05-28 21:20:56 -07:00
Ishaan Jaff f0618fd297 Revert "(ui) fix build"
This reverts commit 90dbf4cf0b.
2025-05-28 21:20:35 -07:00
Ishaan Jaff 90dbf4cf0b (ui) fix build 2025-05-28 21:18:09 -07:00
Ishaan Jaff edf85f2394 fix: code QA check 2025-05-28 21:16:17 -07:00
Ishaan Jaff 73a82d24d8 (build) fix ui build 2025-05-28 21:05:33 -07:00
Ishaan Jaff 745d9acdc3 fix: code QA check 2025-05-28 21:03:33 -07:00
Ishaan Jaff 6b527a77aa ci/cd fix build ui 2025-05-28 21:00:39 -07:00
Ishaan Jaff 68a2466f3d fix debug langfuse clients started (#11221) 2025-05-28 19:16:52 -07:00
Ishaan Jaff 9e75409518 bump litellm proxy extras v1.71.2.dev1 2025-05-28 18:58:33 -07:00
Ishaan Jaff 5a32fb89b0 build: add vector stores to object permissions 2025-05-28 18:55:05 -07:00
Ishaan Jaff df76b8c7b9 ui new build 2025-05-28 18:53:37 -07:00
Ishaan Jaff b35806d0f0 ui new build 2025-05-28 18:53:01 -07:00
Ishaan Jaff 096e03bbc6 scim fix - key (#11220) 2025-05-28 18:47:51 -07:00
Stephen Solka 3960e08dd7 Fix: Remove duplicate object_permission field in LiteLLM_TeamTable (#11219)
This fixes a mypy error where 'object_permission' was already defined on line 1267.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-authored-by: Stephen Solka <stephen@standd.io>
Co-authored-by: Claude <noreply@anthropic.com>
2025-05-28 18:40:45 -07:00
Ishaan Jaff 4e6c4beef8 [Feat] Permission management vector stores on LiteLLM Key, Team, Orgs (#11213)
* fix: init commit for object permissions

* fix: init commit for object permissions

* fix: add vector_store_id to permissions

* fix vector store selector

* feat:add vector store permission mgmt

* feat: ui add allowed vector stores dropdown

* feat: add new vector store object permissions

* testing: key mgmt

* fix: stor vector store permissions on team

* ui select vector store for teams

* ui add vector store settings for orgs

* feat: allow setting org vector store permissions

* test: adding team permissions for vector stores
2025-05-28 16:58:53 -07:00
Vinnie-Singleton-NN 178a614d4a Add sentry sample rate (#10283)
* Add SENTRY_API_SAMPLE_RATE configuration option for Sentry SDK

* removed print line

* Update Sentry documentation with sample rate information

---------

Co-authored-by: Vinnie <vinnie@Vinnies-MacBook-Pro.local>
2025-05-28 16:44:10 -07:00
Regis David Souza Mesquita 56c32ef503 Update mistral-medium prices and context sizes (#10729)
* Update mistral-medium prices and context sizes

While testing the Mistral model, I noticed a discrepancy in the pricing shown on the logs screen. After reviewing the code, I confirmed that the pricing values were incorrect.

This PR corrects the input and output token pricing for the latest Mistral model and adds the newly released mistral-medium-2505 version.

* Adds tool calling flag to mistral-medium

* Adds mistral-medium price updates to the main model price file

* Update model_prices_and_context_window_backup.json

sets mistral medium alias to the old values as it probably points to the old version.

* Update model_prices_and_context_window.json

* Update model_prices_and_context_window_backup.json

* Update model_prices_and_context_window.json
2025-05-28 16:42:28 -07:00
tanjiro 14cdf61537 Improve display of model tags on Keys, Teams and Org page (#11022)
* added accordian for models tag

* remove margin top

* match badge styles same as on teams page

* add badge accordian for organizations

* badges fixed on keys page

* fixed for singular and plural

* fix merge conflict
2025-05-28 16:41:26 -07:00
dependabot[bot] 461e1a4751 build(deps): bump next from 14.2.26 to 15.2.4 in /ui/litellm-dashboard (#11216)
Bumps [next](https://github.com/vercel/next.js) from 14.2.26 to 15.2.4.
- [Release notes](https://github.com/vercel/next.js/releases)
- [Changelog](https://github.com/vercel/next.js/blob/canary/release.js)
- [Commits](https://github.com/vercel/next.js/compare/v14.2.26...v15.2.4)

---
updated-dependencies:
- dependency-name: next
  dependency-version: 15.2.4
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-28 16:39:20 -07:00
tanjiro d8c7dd643d Removed teamInfo call and used existing teamsList call (#11051)
* removed teamInfo call and used existing teams list call

* made perTeamInfo typesafe
2025-05-28 16:39:07 -07:00
tanjiro 86409976a9 put all proxy models if no models are selected. (#11187) 2025-05-28 16:31:31 -07:00
Jacob Hagstedt P Suorra 7d344c9aee Add price for gpt4o mini audio preview without date at the end of the name, just like for the non mini model (#11207) 2025-05-28 16:30:05 -07:00
Ishaan Jaff 9a6d5c119e feat: Allow Adding MCP Servers Through LiteLLM UI (#11208)
* feat: MCP Servers with CRUD operations (#10699)

* feat: mcp CRUD operations with authn/authz

* feat: mcp server UI

* mcp server page with overview, mcp tools, and settings page

* Adding MCP Server flow

* prisma generate before test

* UI callbacks add/remove with api server refetch

* test fix: poetry run prisma

* feat: mcp server db and config connection

* fix: MCPTool filter on description when not present

* feat: mcp on UI and integrated with list tools

* feat: Update mcp server endpoint

* tests: Unit and integration tests for mcp management endpoints

* fix: docs and ensuring global_mcp_manage up to date

* ui: remove the mcp tools view

* fix: ruff lint

* fix: unit -> integration test area

* fix(ui): remove left nav menu of previous tools

---------

Co-authored-by: wagnerjt <wagnerjt@github.com>
Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>

* fix: sync DB MCP tools with in memory

* fix: sync DB MCP tools with in memory

* fix: stop using prisma.models

* fix: code qa check

* fix: import MCP

* fix: code QA checks

* fix: code QA checks

* fixes - only list tools for the specific MCP server

* fix: only list MCP tools for selected server

* fix linting error

---------

Co-authored-by: Tyler Wagner <wagnerjt@users.noreply.github.com>
Co-authored-by: wagnerjt <wagnerjt@github.com>
2025-05-28 16:29:27 -07:00
Krrish Dholakia d8415f0c39 bump: version 1.71.1 → 1.71.2 v1.71.2-nightly 2025-05-28 11:34:16 -07:00
Krrish Dholakia cc780ecf5a fix(vertex_llm_base.py): handle default vertex location 2025-05-28 11:33:39 -07:00
Adnan Alkattan 18f7316b08 Add missing request_kwargs to function call (#11202) 2025-05-28 10:55:41 -07:00
Ishaan Jaff 1301cfb06b add well known MCP servers (#11209) 2025-05-28 10:46:26 -07:00
Niko Izsak 85525c5c6b Azure OIDC provider improvements + OIDC audience bug fix (#10054)
* added support for custom scope in get_azure_ad_token_provider

* if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope

* fix bug where oidc audience that contains "/" won't be extract correctly

* added tests for get_secret with oidc

* moved tests to litellm tests folder

* tes file naming aligned with source code

* renamed test_main because it caused issue in the test in github workflow

* updated docs

* moved docs to the end of file

* fix aws region in example config

* renamed test file

* added support for custom scope in get_azure_ad_token_provider

* if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope

* fix bug where oidc audience that contains "/" won't be extract correctly

* added tests for get_secret with oidc

* moved tests to litellm tests folder

* tes file naming aligned with source code

* renamed test_main because it caused issue in the test in github workflow

* updated docs

* moved docs to the end of file

* fix aws region in example config

* renamed test file

* fix merge conflict resolution error

* added support for custom scope in get_azure_ad_token_provider

* if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope

* fix bug where oidc audience that contains "/" won't be extract correctly

* added tests for get_secret with oidc

* moved tests to litellm tests folder

* tes file naming aligned with source code

* renamed test_main because it caused issue in the test in github workflow

* updated docs

* moved docs to the end of file

* fix aws region in example config

* renamed test file

* added tests for get_secret with oidc

* moved tests to litellm tests folder

* tes file naming aligned with source code

* renamed test_main because it caused issue in the test in github workflow

* updated docs

* moved docs to the end of file

* renamed test file

* fix merge conflict resolution error
2025-05-28 09:33:13 -07:00
Krish Dholakia 05e0a6d8d5 Return anthropic thinking blocks on streaming + VertexAI Minor Fixes & Improvements (Thinking, Global regions, Parallel tool calling) (#11194)
* fix(anthropic/chat/handler.py): Fixes https://github.com/BerriAI/litellm/issues/10328

Adopts changes from https://github.com/BerriAI/litellm/pull/10329

* fix(vertex_and_google_ai_studio.py): don't set 'include thoughts' if thinking budget = 0

VertexAI raises errors

* fix(vertex_llm_base.py): new function for deciding the api base, handles 'global' api base

Fixes https://github.com/BerriAI/litellm/issues/11190

* fix(vertex_ai/partner_models): fix instrumentation for custom api base check

* refactor(vertex_ai/partner): refactor function to keep below 50 LOC

* fix(vertex_ai/gemini): remove parallel tool calls error for >1 tool - just ignore (prevent call from failing)

* fix: fix linting error
2025-05-27 23:07:13 -07:00
RMeans d181138ad5 Add Pangea provider to Guardrails hook (#10775)
* added Pangea as a guardrail vendor

* Adding output recipe, cleaning up some imports

* Add Pangea guardrails tests

* Add docs and sidebar

* Move to use async_precall_hook instead of moderation hook

* Update to "new" format (accept mode)

Add response for redaction, support transforming request / response
based off the original type of call (/v1/completions,
/v1/chat/completions)

* Fix tests

* Fix unused imports

* Fix .md

---------

Co-authored-by: michael weinberger <michael.lee.weinberger@gmail.com>
2025-05-27 22:01:28 -07:00
Krish Dholakia 7072466775 VertexAI - codeExecution tool support + anyOf handling (#11195)
* fix(vertex_and_google_ai_studio_gemini.py): handle both camel case and underscores in the tool for vertex ai code execution

support vertex ai code execution

* docs(vertex.md): add code execution example to vertex ai

* fix(vertex_ai/common_utils.py): when anyof in field, just select anyof - don't include other k,v pairs - vertex throws error

Fixes https://github.com/BerriAI/litellm/issues/11164

* fix(common_utils.py): add title field inside anyof - to retain some description

Addresses https://github.com/BerriAI/litellm/issues/11164#issuecomment-2914728385
2025-05-27 21:23:14 -07:00
Krrish Dholakia 40a4e97d8e test: update test 2025-05-27 21:15:22 -07:00
Ishaan Jaff 1a17755c60 test: fix test_ensure_initialize_azure_sdk_client_always_used 2025-05-27 19:02:11 -07:00
Ishaan Jaff 6c36dc269b test: fix test_vertexai_model_garden_model_completion 2025-05-27 18:51:50 -07:00
Ishaan Jaff a76d991952 [Fix]: Add cost tracking for image edits endpoint [OpenAI, Azure] (#11186)
* fix: add cost tracking for image edits

* fix: add cost tracking for azure image edits

* fix: fix linting error

* fix: fixes for background param

* fix: image edit test fixes

* fix: openai image edit cost tracking
2025-05-27 17:52:15 -07:00
Krrish Dholakia 95de41d10c build: publish new litellm-proxy-extras file 2025-05-27 17:44:23 -07:00
Ishaan Jaff 0590b1eb3a [Fix] Prometheus Metrics - Do not track end_user by default + expose flag to enable tracking end_user on prometheus (#11192)
* fix: testing for disabling end user on metrics

* fix: fixes for test_prometheus_factory

* Delete litellm/model_prices_and_context_window_backup.json

* fix: issues with merge conflicts

* fix: test_get_end_user_id_for_cost_tracking_prometheus_only

* Update tests/test_litellm/integrations/test_prometheus.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-05-27 17:06:58 -07:00
Krish Dholakia 4c82dd9b27 Ollama Chat - parse tool calls on streaming (#11171)
* fix(user_api_key_auth.py): fix else block

Fixes https://github.com/BerriAI/litellm/issues/11170

* refactor(ollama/chat): refactor to base config pattern

easier to maintain fixes

* fix(ollama/chat): support tool call parsing on streaming

Closes https://github.com/BerriAI/litellm/issues/11104

* test: update import location

* fix: cleanup unused import

* fix: fix ruff check error

* test: update import

* test: update test on ci

* ci: cleanup

* fix: fix chekc

* fix: fix api key check order

* test: fix import

* ci: fix script

* test: fix imports

* fix: fix tests
2025-05-27 16:14:49 -07:00
Krish Dholakia 64096aeb1e Codestral - return litellm latency overhead on /v1/completions + Add '__contains__' support for ChatCompletionDeltaToolCall (#10879)
* feat(codestral/completion): return litellm latency overhead for codestral

enables easier debugging of latency issues

* fix(types/utils.py): support _response_ms on hidden params model dump

Fixes issue where 'x-litellm-overhead-duration-ms' wasn't being returned on text c
ompletion calls

* fix(types/utils.py): add '__contains__' support for chatcompletiondeltatool call

Fixes https://github.com/BerriAI/litellm/issues/7099

* fix: fix linting error

* fix: fix linting error
2025-05-27 16:13:44 -07:00