Commit Graph

4726 Commits

Author SHA1 Message Date
Sameer Kankute 6fc7397dde Add Vertex AI Image Edit Support (#16828)
* Add vertex ai image edit support

* Fix lint errors
2025-11-19 18:39:28 -08:00
Krrish Dholakia 08246bf908 fix: fix broken doc link 2025-11-19 08:35:38 -08:00
Rob Geada afc9a763cb Fix IBM Guardrails optional params, add extra_headers field (#16771)
Signed-off-by: Rob Geada <rob@geada.net>
2025-11-18 19:55:40 -08:00
Ishaan Jaff 35ab0e109c [Docs] SSO - Manage User Roles via Azure App Roles (#16796)
* add img 2

* add app roles

* docs
2025-11-18 16:00:36 -08:00
yuneng-jiang f9ec353b80 [Feature] UI - Allow setting base_url in API reference docs (#16674)
* Allow setting base_url in API reference docs

* Add logic to change base url for test key page
2025-11-18 11:27:28 -08:00
Sameer Kankute acf206bec6 Add Day 0 gemini-3-pro-preview support (#16719)
* Add thinking signature support for gemini

* Add docs related to thinking signature

* remove double base64 import

* fix mypy errors

* fix litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py mypy

* Add new gemini 3 model and features

* Add docs related to gemini 3

* Update gemini 3 pricing

* fix llm translation tests

* fix mapped tests
2025-11-18 09:44:45 -08:00
Andrey 4dbbec233c Snowflake provider support: added embeddings, PAT, account_id (#15727)
* added oauth mcp to docs

* added azure ai/grok-4 model family

* Revert "added oauth mcp to docs"

This reverts commit 950b7cef44f14b2db1429f6fbd32548a7c95d325.

* fix: arize ui integration

* need to remove a file

This reverts commit d6c877b73ac763464f204b77135f3786342373b7.

* snowflake support PAT, account_id and embeddings

* format

* test embeddings

* format

* complete test

* fix: add arize from ui

* updated clarifai functions to openai compatible (#15615)

* fix: npm build errors

* update tests

* SnowflakeBaseConfig moved to utils

* rename pat_key => api_ke

* key_type=PAT => 'pat/key'

* fix if api_key is None

* doc update

* doc update

---------

Co-authored-by: mubashir1osmani <mubashir.osmani777@gmail.com>
Co-authored-by: Mubashir Osmani <ilikewafflesomcuh@gmail.com>
Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
Co-authored-by: mogith-pn <143642606+mogith-pn@users.noreply.github.com>
2025-11-17 20:27:46 -08:00
BenjaminLevy 070db16f87 this yaml actually works (#16757) 2025-11-17 20:09:44 -08:00
John Lathouwers d93e13be36 OCI Provider - Add oci_endpoint_id Parameter for OCI Dedicated Endpoints #16723 2025-11-17 19:59:34 -08:00
Cesar Garcia 88b71863ae docs(openai): Add gpt-5.1-codex and gpt-5.1-codex-mini models to documentation (#16735)
- Add gpt-5.1, gpt-5.1-codex, and gpt-5.1-codex-mini to OpenAI Chat Completion Models table
- Add gpt-5.1-codex and gpt-5.1-codex-mini to reasoning_effort supported models table
- Clarify that GPT-5-Codex models (gpt-5-codex, gpt-5.1-codex, gpt-5.1-codex-mini) do NOT support verbosity parameter
- Update verbosity section to exclude Codex variants from supported models list

Note: Model configurations already exist in model_prices_and_context_window.json
2025-11-17 19:50:58 -08:00
Ishaan Jaff bdb1e16dcf [Feat] AI Gateway Auth - Allow using JWTs for signing in with Proxy CLI (#16756)
* fix auth

* get_cli_jwt_auth_token

* fix linting

* test fixes

* docs

* test fixes

* fix refactor
2025-11-17 19:47:29 -08:00
Cesar Garcia 5ae99e2761 docs: respect system color scheme preference in Docusaurus (#16748)
Configure Docusaurus to automatically detect and respect the user's
system/browser color scheme preference (prefers-color-scheme).

This improves UX by matching the documentation theme to the user's
OS-level dark/light mode setting, while still allowing manual override
via the theme switcher.

Changes:
- Add colorMode configuration to docusaurus.config.js
- Enable respectPrefersColorScheme option
2025-11-17 19:46:56 -08:00
Krrish Dholakia f2314bab34 docs: improve dev docs 2025-11-17 19:46:39 -08:00
Sameer Kankute b9bc903536 remove deprecated embedding model (#16724) 2025-11-17 18:46:20 -08:00
Sameer Kankute 7cebc151b0 Add managed files support for responses API (#16733)
* Fix responses api with managed files

* fix litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py mypy

* fix litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py mypy

* fix mypy errors
2025-11-17 18:41:26 -08:00
Ishaan Jaff 83ea037bc9 [Feat] AI Gateway - Allow admins to disable, dynamic callback controls (#16750)
* add flag to allow_dynamic_callback_disabling

* fix EnterpriseCallbackControls

* test controls

* add docs on dynamic logging
2025-11-17 18:29:07 -08:00
Krrish Dholakia fdb5df1f27 docs(managed_batches.md): update docs 2025-11-17 18:00:08 -08:00
Krrish Dholakia e73ac28178 docs(ui_logs.md): link to spec showing what gets logged 2025-11-17 17:56:24 -08:00
Krrish Dholakia 1cdd4065a6 docs: fix docs 2025-11-17 17:54:26 -08:00
YutaSaito 6444bf366c [Docs] mcp oauth flow details (#16742)
* docs: mcp oauth flow details

* fix: Exception (AsyncHTTPHandler.__init__() got an unexpected keyword argument 'follow_redirects')
2025-11-17 17:45:03 -08:00
Ishaan Jaff 08e115ecff [Feat] Add SSML Support for Azure Text-to-Speech (AVA) (#16747)
* detect SSML in input

* transform_text_to_speech_request

* test_litellm_speech_with_ssml_passthrough

* add Passing Raw SSML

* fix is_ssml_input
2025-11-17 15:41:22 -08:00
Krrish Dholakia 88f5110e11 docs: improve docs 2025-11-17 09:54:13 -08:00
Krrish Dholakia 3c6f81e6ce docs: simplify docs 2025-11-17 09:39:32 -08:00
Alexsander Hamir 0292b84dc4 docs: add missing details to benchmark comparison (#16690) 2025-11-15 20:12:32 -08:00
Krrish Dholakia b79e92a21b docs(index.md): document new agent hub 2025-11-15 18:49:51 -08:00
Krish Dholakia 39917f2ff7 Agents - AI Hub, make agents discoverable on model hub page for internal discovery (#16678)
* feat: initial commit adding agent hub to ui

* feat: add viewable agent hub

* feat: working support for making both config + db agents public via new 'public_agent_groups' list

* fix: agents.py

fix types

* feat: working PATCH endpoint for UI changes

* feat: add new agents panel with working crud

* refactor: refactor to show created_at on be/fe

* style: align new page with the agents table

* style: more style alignment logic

* feat: return if agent is public or not in /v1/agents

* feat: initial commit adding ui flow for making agents discoverable

* feat: new batch make public endpoint

* feat(public_model_hub.tsx): show public agents on public model hub table page

* fix(public_model_hub.tsx): add code examples for using the agent in a2a

* fix: fix indicating if agent has already been made public

* docs: document expected spec for agents is A2A

* docs: add agent hub docs

* docs: document making agents discoverable

* docs: add demo video to docs

* fix: fix ui linting errors

* fix: update tests
2025-11-15 18:25:16 -08:00
Ishaan Jaffer 6b9fbc5a36 fix docs 2025-11-15 16:58:47 -08:00
Ishaan Jaffer f9b5820ba8 fix build 2025-11-15 16:53:12 -08:00
Ishaan Jaff 25cb873467 [UI] expose backend endpoint for callbacks settings (#16698)
* init json for

* add SQS fields

* fix params

* return callbacks configs

* test_get_callback_configs

* test_get_callback_configs

* vercel build fix
2025-11-15 16:44:17 -08:00
Alexsander Hamir 60691b771c add perf update to release notes (#16697) 2025-11-15 16:36:49 -08:00
Ishaan Jaffer c7807f4841 fix pkg 2025-11-15 16:26:56 -08:00
Ishaan Jaffer d9fd7138f9 docs fix 2025-11-15 16:17:04 -08:00
Ishaan Jaffer 2a006cabac docs fix 2025-11-15 16:11:31 -08:00
Ishaan Jaffer 03ceb75d4e sec fix 2025-11-15 15:36:05 -08:00
Ishaan Jaff e325b1833d Docs - v1.80.0 (#16694)
* fix pkg on docs

* refactor 1

* fix refactor

* docs

* fix

* fixes
2025-11-15 14:42:43 -08:00
Sameer Kankute a2e3b942dc Vector store files Stable Release (#16643)
* Add support for vector store files endpoints (#16490)

* Add base code for vector store integration

* fix azure related tests and linting error

* fix mypy errors

* Add vector store files documentation

* fix mapped tests

* Add bytedance and ideogram support in fal ai (#16636)

* Add fal ai flux pro v1.1 support (#16578)

* Add fal ai flux pro v1.1 support

* Add tests and docs

---------

Co-authored-by: Ishaan Jaffer <ishaanjaffer0324@gmail.com>
2025-11-15 13:00:33 -08:00
Sameer Kankute 59d17cc468 Litellm anthropic contex management param support (#16528)
* Add support for anthropic context_management param

* Add support for anthropic context_management param

* Add context management in response

* fix review changes

---------

Co-authored-by: Ishaan Jaffer <ishaanjaffer0324@gmail.com>
2025-11-15 13:00:09 -08:00
Krish Dholakia 586f126395 Prometheus - make OSS (#16689)
* refactor: remove enterprise license from prometheus metrics

* refactor: complete refactoring imports

* refactor: complete prometheus refactoring

* bump: version 1.80.0 → 1.80.1

* bump: version 1.80.1 → 1.80.2

* build: update litellm lock file

* docs(enterprise.md): cleanup enterprise docs

* docs: more docs cleanup

* docs: doc cleanup

* fix: fix imports

* fix: fix linting error
2025-11-15 11:06:34 -08:00
Cesar Garcia cc72037cec feat(openai): Add verbosity parameter support for GPT-5 family models (#16660)
OpenAI's GPT-5 model family supports a verbosity parameter to control
the length and detail of responses. This parameter accepts three values:
'low', 'medium', or 'high'.

Changes:
- Added verbosity parameter to completion() and acompletion() signatures
- Added verbosity to DEFAULT_CHAT_COMPLETION_PARAM_VALUES in constants.py
- Added verbosity to get_optional_params() in utils.py
- Added verbosity to GPT-5 supported params list
- Updated OpenAI docs with verbosity usage examples
- Added comprehensive test for verbosity parameter

Supported models: gpt-5, gpt-5.1, gpt-5-mini, gpt-5-nano, gpt-5-codex, gpt-5-pro
2025-11-14 19:38:27 -08:00
Ishaan Jaffer efb80ffab7 fix pkg lock 2025-11-14 18:43:41 -08:00
Ishaan Jaffer a1286fb609 security fix 2025-11-14 18:43:41 -08:00
YutaSaito f487f4e3a9 feat: add dynamic OAuth2 metadata discovery for MCP servers (#16676)
* feat: add dynamic OAuth2 metadata discovery for MCP servers

* fix: lint error
2025-11-14 18:14:43 -08:00
Ishaan Jaffer 7faff1a7c0 pkg lock 2025-11-14 17:43:43 -08:00
Ishaan Jaffer 936bed056b security fix 2025-11-14 17:43:00 -08:00
Ishaan Jaff 8a43fbe8f7 Revert "[Feat] VertexAI - Add BGE Embeddings support (#16033)" (#16677)
This reverts commit 7133488282.
2025-11-14 17:41:06 -08:00
Ishaan Jaff bffc36794f docs fix spend tracking (#16675) 2025-11-14 17:22:21 -08:00
Ishaan Jaff efa4ec9294 [Docs] Add docs on APIs for model access management (#16673)
* docs access groups

* docs ui for access groups

* fix code snippets

* docs model access
2025-11-14 17:07:54 -08:00
Ishaan Jaff 2bd6d0d82b [Feat] Bedrock Batches - Add support for custom KMS encryption keys in Bedrock Batch operations (#16662)
* add s3_encryption_key_id

* add s3EncryptionKeyId to BedrockS3OutputDataConfig

* use s3EncryptionKeyId in bedrock output

* docs s3_encryption_key_id

* test_bedrock_batch_with_encryption_key_in_post_request
2025-11-14 16:00:43 -08:00
fzowl b1922e19f8 Voyageai pricing and doc update (#16641)
* Refresh VoyageAI models and prices and context

* Refresh VoyageAI models and prices and context

* Refresh VoyageAI models and prices and context

* Updating the available VoyageAI models in the docs

* Updating the available VoyageAI models in the docs

* Updating the model prices and the docs
2025-11-14 14:09:11 -08:00
Cesar Garcia 65061bafc7 feat(openai): Add support for reasoning_effort='none' in GPT-5.1 (#16658)
* feat(openai): Add support for reasoning_effort='none' in GPT-5.1

OpenAI's GPT-5.1 introduced a new reasoning effort parameter 'none'
which replaces the previous 'minimal' setting for faster, lower-latency
responses. This is now the default setting for GPT-5.1.

Changes:
- Updated REASONING_EFFORT type to include 'none' value
- Added GPT-5.1, GPT-5-mini, and GPT-5-nano to documentation
- Updated docs to reflect 'none' as GPT-5.1's default reasoning effort
- Added test to verify reasoning_effort='none' passes through correctly

Fixes #16633

* feat(responses): Add support for reasoning_effort='none' in Responses API transformation
2025-11-14 13:41:49 -08:00