Commit Graph

3611 Commits

Author SHA1 Message Date
Krrish Dholakia dececb84c1 docs(image_edits.md): clarify proxy version support for image edits 2025-05-29 07:50:41 -07:00
Krish Dholakia ba39f9e360 Helicone base url support + fix for embedding cache hits on str input (#11211)
* fix(helicone.py): add helicone api base support

Fixes https://github.com/BerriAI/litellm/issues/10825

* test: add unit test for cache hit response on embedding calls

* fix(caching_handler.py): fix handling cache hit on embedding when input is string

Fixes LIT-197

* docs(helicone_integration.md): document new helicone api base param
2025-05-28 22:02:55 -07:00
Krrish Dholakia 75f591a6e2 docs(cli.md): add new skip_server_startup flag 2025-05-28 21:47:08 -07:00
Vinnie-Singleton-NN 178a614d4a Add sentry sample rate (#10283)
* Add SENTRY_API_SAMPLE_RATE configuration option for Sentry SDK

* removed print line

* Update Sentry documentation with sample rate information

---------

Co-authored-by: Vinnie <vinnie@Vinnies-MacBook-Pro.local>
2025-05-28 16:44:10 -07:00
Ishaan Jaff 9a6d5c119e feat: Allow Adding MCP Servers Through LiteLLM UI (#11208)
* feat: MCP Servers with CRUD operations (#10699)

* feat: mcp CRUD operations with authn/authz

* feat: mcp server UI

* mcp server page with overview, mcp tools, and settings page

* Adding MCP Server flow

* prisma generate before test

* UI callbacks add/remove with api server refetch

* test fix: poetry run prisma

* feat: mcp server db and config connection

* fix: MCPTool filter on description when not present

* feat: mcp on UI and integrated with list tools

* feat: Update mcp server endpoint

* tests: Unit and integration tests for mcp management endpoints

* fix: docs and ensuring global_mcp_manage up to date

* ui: remove the mcp tools view

* fix: ruff lint

* fix: unit -> integration test area

* fix(ui): remove left nav menu of previous tools

---------

Co-authored-by: wagnerjt <wagnerjt@github.com>
Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>

* fix: sync DB MCP tools with in memory

* fix: sync DB MCP tools with in memory

* fix: stop using prisma.models

* fix: code qa check

* fix: import MCP

* fix: code QA checks

* fix: code QA checks

* fixes - only list tools for the specific MCP server

* fix: only list MCP tools for selected server

* fix linting error

---------

Co-authored-by: Tyler Wagner <wagnerjt@users.noreply.github.com>
Co-authored-by: wagnerjt <wagnerjt@github.com>
2025-05-28 16:29:27 -07:00
Niko Izsak 85525c5c6b Azure OIDC provider improvements + OIDC audience bug fix (#10054)
* added support for custom scope in get_azure_ad_token_provider

* if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope

* fix bug where oidc audience that contains "/" won't be extract correctly

* added tests for get_secret with oidc

* moved tests to litellm tests folder

* tes file naming aligned with source code

* renamed test_main because it caused issue in the test in github workflow

* updated docs

* moved docs to the end of file

* fix aws region in example config

* renamed test file

* added support for custom scope in get_azure_ad_token_provider

* if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope

* fix bug where oidc audience that contains "/" won't be extract correctly

* added tests for get_secret with oidc

* moved tests to litellm tests folder

* tes file naming aligned with source code

* renamed test_main because it caused issue in the test in github workflow

* updated docs

* moved docs to the end of file

* fix aws region in example config

* renamed test file

* fix merge conflict resolution error

* added support for custom scope in get_azure_ad_token_provider

* if AZURE_FEDERATED_TOKEN_FILE not set, use azure_token_provider to retrive token with the oidc audiances as scope

* fix bug where oidc audience that contains "/" won't be extract correctly

* added tests for get_secret with oidc

* moved tests to litellm tests folder

* tes file naming aligned with source code

* renamed test_main because it caused issue in the test in github workflow

* updated docs

* moved docs to the end of file

* fix aws region in example config

* renamed test file

* added tests for get_secret with oidc

* moved tests to litellm tests folder

* tes file naming aligned with source code

* renamed test_main because it caused issue in the test in github workflow

* updated docs

* moved docs to the end of file

* renamed test file

* fix merge conflict resolution error
2025-05-28 09:33:13 -07:00
RMeans d181138ad5 Add Pangea provider to Guardrails hook (#10775)
* added Pangea as a guardrail vendor

* Adding output recipe, cleaning up some imports

* Add Pangea guardrails tests

* Add docs and sidebar

* Move to use async_precall_hook instead of moderation hook

* Update to "new" format (accept mode)

Add response for redaction, support transforming request / response
based off the original type of call (/v1/completions,
/v1/chat/completions)

* Fix tests

* Fix unused imports

* Fix .md

---------

Co-authored-by: michael weinberger <michael.lee.weinberger@gmail.com>
2025-05-27 22:01:28 -07:00
Krish Dholakia 7072466775 VertexAI - codeExecution tool support + anyOf handling (#11195)
* fix(vertex_and_google_ai_studio_gemini.py): handle both camel case and underscores in the tool for vertex ai code execution

support vertex ai code execution

* docs(vertex.md): add code execution example to vertex ai

* fix(vertex_ai/common_utils.py): when anyof in field, just select anyof - don't include other k,v pairs - vertex throws error

Fixes https://github.com/BerriAI/litellm/issues/11164

* fix(common_utils.py): add title field inside anyof - to retain some description

Addresses https://github.com/BerriAI/litellm/issues/11164#issuecomment-2914728385
2025-05-27 21:23:14 -07:00
Akim Tsvigun acaa80294c Integration with Nebius AI Studio added (#11143)
* integration with Nebius AI Studio added

* Merged with main

* Reviewer's comments resolved

* spelling error fixed

* accidental change reverted
2025-05-27 11:05:22 -07:00
k10 9a35c41462 added logprobs (#11161) 2025-05-26 11:21:47 -07:00
Ishaan Jaff 77d2615185 docs release notes 2025-05-24 21:24:44 -07:00
Ishaan Jaff 66c44bb9f3 docs notes 2025-05-24 21:00:01 -07:00
Ishaan Jaff 39c85c22de docs document all config settings 2025-05-24 20:48:09 -07:00
Krrish Dholakia bfbec06214 docs(index.md): cleanup 2025-05-24 19:53:59 -07:00
Ishaan Jaff 95b59cbf4d docs image edits 2025-05-24 19:46:10 -07:00
Ishaan Jaff 7fae80b4c6 docs image edits 2025-05-24 19:44:54 -07:00
Ishaan Jaff cadd86af95 docs image edit 2025-05-24 19:44:20 -07:00
Ishaan Jaff 1637413677 docs image edits 2025-05-24 19:43:48 -07:00
Krrish Dholakia cb58fc7f6d docs: doc improvements 2025-05-24 19:06:00 -07:00
Krrish Dholakia 061a2a730e docs(managed_finetuning.md): cleanup 2025-05-24 18:21:06 -07:00
Ishaan Jaff 7775e8471c docs benchmarks 2025-05-24 17:43:38 -07:00
Ishaan Jaff 39feb742cd docs aiohttp benchmarks 2025-05-24 17:41:10 -07:00
Ishaan Jaff 85bd3cfca1 fixes 2025-05-24 17:31:57 -07:00
Ishaan Jaff c73ac784d6 docs latest release 2025-05-24 17:31:19 -07:00
Ishaan Jaff c761c0ca2f docs release notes 2025-05-24 17:23:31 -07:00
Ishaan Jaff e957fbf38c docs release notes 2025-05-24 17:17:56 -07:00
Ishaan Jaff db96c8ca53 docs release notes 2025-05-24 17:17:44 -07:00
Ishaan Jaff e7ede28189 notes update 2025-05-24 17:17:30 -07:00
Ishaan Jaff c3f74b47f8 v1.71.1-stable - notes (#11133)
* draft notes

* fixes anthropic

* fixes release notes
2025-05-24 16:56:25 -07:00
Krrish Dholakia c725cbbde6 docs: add managed finetuning docs 2025-05-24 16:14:35 -07:00
Krrish Dholakia 76107bfb6b docs(litellm_managed_files.md): add tutorial on managing file permissions 2025-05-24 14:11:15 -07:00
David Lojudice Sobrinho 93e2c82642 fix(session): correctly place litellm_session_id at root level instead of metadata (#11088)
- Update documentation examples to show correct implementation
- Resolves #11065
2025-05-24 09:12:11 -07:00
Krish Dholakia a44bc9f613 Remove + Check for unsafe enterprise/ folder imports (#11107)
* refactor(litellm/): wrap enterprise package usage in safe try-excepts

prevents missing package error for OSS users

Fixes https://github.com/BerriAI/litellm/issues/10795

* ci(config.yml): add script to check for unsafe imports to ci/cd

Closes https://github.com/BerriAI/litellm/issues/10795

* fix: add missing docs

* test: test_openai_image_edit_litellm_sdk

* use n 4 for mapped tests (#11109)

* Fix/background health check (#10887)

* fix: improve health check logic by deep copying model list on each iteration

* test: add async test for background health check reflecting model list changes

* fix: validate health check interval before executing background health check

* fix: specify type for health check results dictionary

* fix(user_api_key_auth.py): handle user custom auth set with no custom settings

* bump: version 0.1.21 → 0.2.0

* ci(config.yml): run enterprise and litellm tests separately

* fix: fix linting error

* docs: add missing docs

* [Feat] Add content policy violation error mapping for image editd (#11113)

* feat: add image edit mapping for content policy violations

* test fix

* Expose `/list` and `/info` endpoints for Audit Log events (#11102)

* feat(audit_logging_endpoints.py): expose list endpoint to show all audit logs

make it easier for user to retrieve individual endpoints

* feat(enterprise/): add audit logging endpoint

* feat(audit_logging_endpoints.py): expose new GET `/audit/{id}` endpoint

make it easier to retrieve view individual audit logs

* feat(key_management_event_hooks.py): correctly show the key of the user who initiated the change

* fix(key_management_event_hooks.py): add key rotations as an audit log event

'

* test(test_audit_logging_endpoints.py): add simple unit testing for audit log endpoint

* fix: testing fixes

* fix: fix ruff check

* [Feat] Use aiohttp transport by default - 97% lower median latency  (#11097)

* fix: add flag for disabling use_aiohttp_transport

* feat: add _create_async_transport

* feat: fixes for transport

* add httpx-aiohttp

* feat: fixes for transport

* refactor: fixes for transport

* build: fix deps

* fixes: test fixes

* fix: ensure aiohttp does not auto set content type

* test: test fixes

* feat: add LiteLLMAiohttpTransport

* fix: fixes for responses API handling

* test: fixes for responses API handling

* test: fixes for responses API handling

* feat: fixes for transport

* fix: base embedding handler

* test: test_async_http_handler_force_ipv4

* test: fix failing deepeval test

* fix: add YARL for bedrock urls

* fix: issues with transport

* fix: comment out linting issues

* test fix

* test: XAI is unstable

* test: fixes for using respx

* test: XAI fixes

* test: XAI fixes

* test: infinity testing fixes

* docs(config_settings.md): document param

* test: test_openai_image_edit_litellm_sdk

* test: remove deprecated test

* bump respx==0.22.0

* test: test_xai_message_name_filtering

* test: fix anthropic test after bumping httpx

* use n 4 for mapped tests (#11109)

* fix: use 1 session per event loop

* test: test_client_session_helper

* fix: linting error

* fix: resolving GET requests on httpx 0.28.1

* test fixes proxy unit tests

* fix: add ssl verify settings

* fix: proxy unit tests

* fix: refactor

* tests: basic unit tests for aiohttp transports

* tests: fixes xai

---------

Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>

* test: cleanup redundant test

* fix: fix import

* fix: fix linting errors

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
Co-authored-by: JuHyun Bae <jhyun0408@nate.com>
2025-05-24 08:59:02 -07:00
Ishaan Jaff 86cdb8382b [Feat] Use aiohttp transport by default - 97% lower median latency (#11097)
* fix: add flag for disabling use_aiohttp_transport

* feat: add _create_async_transport

* feat: fixes for transport

* add httpx-aiohttp

* feat: fixes for transport

* refactor: fixes for transport

* build: fix deps

* fixes: test fixes

* fix: ensure aiohttp does not auto set content type

* test: test fixes

* feat: add LiteLLMAiohttpTransport

* fix: fixes for responses API handling

* test: fixes for responses API handling

* test: fixes for responses API handling

* feat: fixes for transport

* fix: base embedding handler

* test: test_async_http_handler_force_ipv4

* test: fix failing deepeval test

* fix: add YARL for bedrock urls

* fix: issues with transport

* fix: comment out linting issues

* test fix

* test: XAI is unstable

* test: fixes for using respx

* test: XAI fixes

* test: XAI fixes

* test: infinity testing fixes

* docs(config_settings.md): document param

* test: test_openai_image_edit_litellm_sdk

* test: remove deprecated test

* bump respx==0.22.0

* test: test_xai_message_name_filtering

* test: fix anthropic test after bumping httpx

* use n 4 for mapped tests (#11109)

* fix: use 1 session per event loop

* test: test_client_session_helper

* fix: linting error

* fix: resolving GET requests on httpx 0.28.1

* test fixes proxy unit tests

* fix: add ssl verify settings

* fix: proxy unit tests

* fix: refactor

* tests: basic unit tests for aiohttp transports

* tests: fixes xai

---------

Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>
2025-05-23 22:55:35 -07:00
Krrish Dholakia af930f22a3 docs: add missing docs 2025-05-23 21:55:30 -07:00
Mayank 8da898c55e [feature] ConfidentAI logging enabled for proxy and sdk (#10649)
* async success implemented

* fail async event

* sync events added

* docs added

* docs added

* test added

* style

* test

* .

* lock file genrated due to tenacity change

* mypy errors

* resolved comments

* resolved comments

* resolved comments

* resolved comments

* style

* style

* resolved comments
2025-05-23 11:40:48 -07:00
Ishaan Jaff c8a0088970 [Fix] Reliability Fix - Removing code that was creating threads on errors (#11066)
* fix: only init langfuse if active

* fix: only init langfuse if active

* fix: add initialized_langfuse_clients count

* fix: add MAX_LANGFUSE_INITIALIZED_CLIENTS

* fix: use safe init langfuse

* test: init langfuse clients

* test: test_langfuse_not_initialized_returns_none_early

* docs MAX_LANGFUSE_INITIALIZED_CLIENTS

* fix: use correct langfuse callback

* fix: code qa
2025-05-22 18:04:15 -07:00
Ishaan Jaff 89daa1dbad docs add claude-4 models 2025-05-22 13:41:09 -07:00
Ishaan Jaff 2c90ca0189 docs fix example 2025-05-22 09:33:22 -07:00
Ishaan Jaff d95c3a16f3 docs fix ad hoc recognizer 2025-05-22 08:55:07 -07:00
Ishaan Jaff dd4a65b83a Feat: add MCP to Responses API and bump openai python sdk (#11029)
* feat: add MCP to responses API

* feat: bump openai version to 1.75.0

* docs MCP + responses API

* fixes: type checking

* fixes: type checking

* build: use latest openai 1.81.0

* fix: linting error

* fix: linting error

* fix: test

* fix: linting errors

* fix: test

* fix: test

* fix: linting

* Revert "fix: linting"

This reverts commit ebb19ff8cb1f8fcc3e224390e351676daccb33de.

* fix: linting
2025-05-22 07:24:10 -07:00
fzowl 73727abd84 Updating the available VoyageAI models in the docs (#11003)
* Refresh VoyageAI models and prices and context

* Refresh VoyageAI models and prices and context

* Refresh VoyageAI models and prices and context

* Updating the available VoyageAI models in the docs

* Updating the available VoyageAI models in the docs
2025-05-21 14:04:43 -07:00
Søren Mathiasen aa11ea3a9f support vertex_ai global endpoints for chat (#10658) 2025-05-21 07:58:36 -07:00
Ishaan Jaff 3a6802fef1 [Feat] - Add Support for Showing Passthrough endpoint Error Logs on LiteLLM UI (#10990)
* fix: add error logging for passthrough endpoints

* feat: add error logging for passthrough endpoints

* fix: post_call_failure_hook track errors on pt

* fix: use constant for MAXIMUM_TRACEBACK_LINES_TO_LOG

* docs MAXIMUM_TRACEBACK_LINES_TO_LOG

* test: ensure failure callback triggered

* fix: move _init_kwargs_for_pass_through_endpoint
2025-05-20 18:29:39 -07:00
Krrish Dholakia a676f69cab docs: cleanup 2025-05-19 22:10:04 -07:00
Jugal D. Bhatt c6260fe31a add cla to docs (#10963)
* add cla to docs

* cla docs clarity
2025-05-19 21:23:22 -07:00
Cole McIntosh 4381ddc963 Add new documentation files for LiteLLM (#10961)
- Created `llms-full.txt` with comprehensive details on LiteLLM features, usage, and supported models.
- Added `llms.txt` for quick access to key links and resources related to LiteLLM, including guides, release notes, and integration documentation.
2025-05-19 15:17:44 -07:00
Daniel Staiger be672b2b18 Update github.md (#10946)
Updated clarification in the use of the models form github. (Github uses the model name: <company>/<model-Name> while litellm wants github/<model-Name>

Updated the example to a model that is actually supported / available on github right now
2025-05-19 13:21:50 -07:00
Jorge Piedrahita Ortiz fd40bc6ddd update sambanova models and parameters (#10900)
* add sambanova to completion input params table

* update sambanova supported args

* update sambanova supported models

* minor changes

* fix sambanova model list

* update sambanova models

* update sambanova models

* update sambanova docs

* minor chnage sambanova url

* update type to match OpenAIGPTConfig

* minor change
2025-05-18 20:45:20 -07:00
Earl St Sauver 6ac3c6ba35 Enable structured JSON schema outputs in LM Studio (for validated responses) (#10929)
- docs/my-website/docs/providers/lm_studio.md: add Structured Output section with JSON schema and Pydantic examples
- litellm/llms/lm_studio/chat/transformation.py: extend map_openai_params to handle `response_format` mappings (`json_schema`, `json_object`) and move them to optional_params
- litellm/utils.py: include `LM_STUDIO` in `supports_response_schema` list
- tests/litellm/llms/lm_studio/test_lm_studio_chat_transformation.py: add tests for Pydantic model and dict-based JSON schema handling

Co-authored-by: Earl St Sauver <estasuver@gmail.com>
2025-05-18 20:43:49 -07:00