Derek Duenas
bbaf0af907
Grayswan guardrail passthrough on flagged ( #16891 )
...
* attempt to implement the passthrough feature
* Formatting and small change
* Fix formatting
* Format test file
---------
Co-authored-by: Xiaohan Fu <xiaohan@grayswan.ai >
2025-11-21 20:01:35 -08:00
Mubashir Osmani
db58f6aeb1
fix: arize phoenix logging ( #16301 )
...
* arize phx
* fix arize integration
* traces to specific project name
* fix
* look for http endpoint
2025-11-21 18:46:18 -08:00
yuneng-jiang
1ebe1fea37
Docs for Model Compare UI and Org Usage ( #16928 )
...
* Docs for Model Compare UI and Org Usage
* Fix typo in img path and add Model Compare to sidebars.js
* Updated to remove from 1.80 writeup
2025-11-21 16:45:55 -08:00
Ishaan Jaff
8e318dd06c
[Feat] New LLM Provider - Docker Model Runner ( #16948 )
...
* add DOCKER_MODEL_RUNNER
* add DockerModelRunnerChatConfig Transorm
* add docker_model_runner
* add docker_model_runner
* docs docker model runner
* add DockerModelRunnerChatConfig
* add docker_model_runner to providers
* test_completion_hits_correct_url_and_body
* fix sidebar
* TestDockerModelRunnerIntegration
* test_completion_with_custom_engine_and_host
* docs docker model runner
* docs fix
2025-11-21 16:09:32 -08:00
Suresh Kumar
5b4a848391
fix anthropic pass-through endpoint ( #16883 )
2025-11-21 16:00:05 -08:00
Cesar Garcia
1c65800f4a
Feat: add support for Grok 4.1 Fast models ( #16936 )
...
* feat: Add support for Grok 4.1 Fast models
Add new xAI Grok 4.1 Fast models optimized for high-performance agentic tool calling:
- xai/grok-4-1-fast (alias for grok-4-1-fast-reasoning)
- xai/grok-4-1-fast-reasoning (with reasoning capabilities)
- xai/grok-4-1-fast-reasoning-latest
- xai/grok-4-1-fast-non-reasoning (without reasoning for faster responses)
- xai/grok-4-1-fast-non-reasoning-latest
Features:
- Context window: 2,000,000 tokens
- Pricing: $0.20/1M input, $0.50/1M output tokens
- Cached tokens: $0.05/1M tokens
- Supports: Function calling, Structured outputs, Vision, Audio input, Web search, Reasoning
Fixes #16927
* docs: Add comprehensive Grok models documentation
- Add 'Supported Models' section highlighting new Grok 4.1 Fast models
- Include comparison guide for reasoning vs non-reasoning models
- Add complete model family table (Grok 4.1, 4, 3, Code, 2)
- Add features legend explaining capabilities
- Remove pricing details (link to xAI docs instead for current rates)
- Improve documentation clarity and consistency
Related to #16927
* docs: Minor corrections to xai.md
2025-11-21 15:57:55 -08:00
YutaSaito
7c4ef090c1
docs: fix mcp url format ( #16940 )
...
* docs: fix mcp url format
* fix: update Cursor MCP example to use url instead of server_url
2025-11-21 15:43:26 -08:00
colinlin-stripe
f9d8eeaf8e
[stripe] gemini 3 thought signatures in tool call id ( #16895 )
...
* though signature tool call id
* [stripe] refactor and tests
* [stripe] remove md and move to factory
* [stripe] remove redudant test
* [stripe] ran black formatting
* [stripe] add thought signature docs
* [stripe] remove unused import
2025-11-21 13:44:53 -08:00
YutaSaito
041ac054b6
feat: allow custom violation message for tool-permission guardrail ( #16916 )
2025-11-21 08:52:01 -08:00
Krrish Dholakia
e7751f0c12
docs: fix docs bug
2025-11-20 16:05:39 -08:00
Ishaan Jaff
57544f1662
[Feat] Adds IAM role assumption support for AWS Secret Manager ( #16887 )
...
* add AWS fields for KeyManagementSettings
* docs IAM roles
* use aws iam auth on secret manager v2
* fix: load_aws_secret_manager
* test_secret_manager_with_iam_role_settings
2025-11-20 12:38:48 -08:00
yuneng-jiang
9120a02474
Change favicon ( #16837 )
2025-11-19 20:38:22 -08:00
Krrish Dholakia
28cadaa123
docs: fix tags
2025-11-19 20:26:48 -08:00
Krrish Dholakia
0389f2d064
docs: cleanup
2025-11-19 20:26:48 -08:00
Krrish Dholakia
87be419559
docs(index.md): cleanup
2025-11-19 20:23:22 -08:00
Krrish Dholakia
778425f02f
docs: add initial blog post for Gemini 3 on LiteLLM
2025-11-19 20:22:24 -08:00
Krrish Dholakia
208027dc71
docs(ui.md): reorder ui page
2025-11-19 19:17:54 -08:00
Sameer Kankute
6fc7397dde
Add Vertex AI Image Edit Support ( #16828 )
...
* Add vertex ai image edit support
* Fix lint errors
2025-11-19 18:39:28 -08:00
Krrish Dholakia
08246bf908
fix: fix broken doc link
2025-11-19 08:35:38 -08:00
Rob Geada
afc9a763cb
Fix IBM Guardrails optional params, add extra_headers field ( #16771 )
...
Signed-off-by: Rob Geada <rob@geada.net >
2025-11-18 19:55:40 -08:00
Ishaan Jaff
35ab0e109c
[Docs] SSO - Manage User Roles via Azure App Roles ( #16796 )
...
* add img 2
* add app roles
* docs
2025-11-18 16:00:36 -08:00
yuneng-jiang
f9ec353b80
[Feature] UI - Allow setting base_url in API reference docs ( #16674 )
...
* Allow setting base_url in API reference docs
* Add logic to change base url for test key page
2025-11-18 11:27:28 -08:00
Sameer Kankute
acf206bec6
Add Day 0 gemini-3-pro-preview support ( #16719 )
...
* Add thinking signature support for gemini
* Add docs related to thinking signature
* remove double base64 import
* fix mypy errors
* fix litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py mypy
* Add new gemini 3 model and features
* Add docs related to gemini 3
* Update gemini 3 pricing
* fix llm translation tests
* fix mapped tests
2025-11-18 09:44:45 -08:00
Andrey
4dbbec233c
Snowflake provider support: added embeddings, PAT, account_id ( #15727 )
...
* added oauth mcp to docs
* added azure ai/grok-4 model family
* Revert "added oauth mcp to docs"
This reverts commit 950b7cef44f14b2db1429f6fbd32548a7c95d325.
* fix: arize ui integration
* need to remove a file
This reverts commit d6c877b73ac763464f204b77135f3786342373b7.
* snowflake support PAT, account_id and embeddings
* format
* test embeddings
* format
* complete test
* fix: add arize from ui
* updated clarifai functions to openai compatible (#15615 )
* fix: npm build errors
* update tests
* SnowflakeBaseConfig moved to utils
* rename pat_key => api_ke
* key_type=PAT => 'pat/key'
* fix if api_key is None
* doc update
* doc update
---------
Co-authored-by: mubashir1osmani <mubashir.osmani777@gmail.com >
Co-authored-by: Mubashir Osmani <ilikewafflesomcuh@gmail.com >
Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com >
Co-authored-by: mogith-pn <143642606+mogith-pn@users.noreply.github.com >
2025-11-17 20:27:46 -08:00
BenjaminLevy
070db16f87
this yaml actually works ( #16757 )
2025-11-17 20:09:44 -08:00
John Lathouwers
d93e13be36
OCI Provider - Add oci_endpoint_id Parameter for OCI Dedicated Endpoints #16723
2025-11-17 19:59:34 -08:00
Cesar Garcia
88b71863ae
docs(openai): Add gpt-5.1-codex and gpt-5.1-codex-mini models to documentation ( #16735 )
...
- Add gpt-5.1, gpt-5.1-codex, and gpt-5.1-codex-mini to OpenAI Chat Completion Models table
- Add gpt-5.1-codex and gpt-5.1-codex-mini to reasoning_effort supported models table
- Clarify that GPT-5-Codex models (gpt-5-codex, gpt-5.1-codex, gpt-5.1-codex-mini) do NOT support verbosity parameter
- Update verbosity section to exclude Codex variants from supported models list
Note: Model configurations already exist in model_prices_and_context_window.json
2025-11-17 19:50:58 -08:00
Ishaan Jaff
bdb1e16dcf
[Feat] AI Gateway Auth - Allow using JWTs for signing in with Proxy CLI ( #16756 )
...
* fix auth
* get_cli_jwt_auth_token
* fix linting
* test fixes
* docs
* test fixes
* fix refactor
2025-11-17 19:47:29 -08:00
Cesar Garcia
5ae99e2761
docs: respect system color scheme preference in Docusaurus ( #16748 )
...
Configure Docusaurus to automatically detect and respect the user's
system/browser color scheme preference (prefers-color-scheme).
This improves UX by matching the documentation theme to the user's
OS-level dark/light mode setting, while still allowing manual override
via the theme switcher.
Changes:
- Add colorMode configuration to docusaurus.config.js
- Enable respectPrefersColorScheme option
2025-11-17 19:46:56 -08:00
Krrish Dholakia
f2314bab34
docs: improve dev docs
2025-11-17 19:46:39 -08:00
Sameer Kankute
b9bc903536
remove deprecated embedding model ( #16724 )
2025-11-17 18:46:20 -08:00
Sameer Kankute
7cebc151b0
Add managed files support for responses API ( #16733 )
...
* Fix responses api with managed files
* fix litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py mypy
* fix litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py mypy
* fix mypy errors
2025-11-17 18:41:26 -08:00
Ishaan Jaff
83ea037bc9
[Feat] AI Gateway - Allow admins to disable, dynamic callback controls ( #16750 )
...
* add flag to allow_dynamic_callback_disabling
* fix EnterpriseCallbackControls
* test controls
* add docs on dynamic logging
2025-11-17 18:29:07 -08:00
Krrish Dholakia
fdb5df1f27
docs(managed_batches.md): update docs
2025-11-17 18:00:08 -08:00
Krrish Dholakia
e73ac28178
docs(ui_logs.md): link to spec showing what gets logged
2025-11-17 17:56:24 -08:00
Krrish Dholakia
1cdd4065a6
docs: fix docs
2025-11-17 17:54:26 -08:00
YutaSaito
6444bf366c
[Docs] mcp oauth flow details ( #16742 )
...
* docs: mcp oauth flow details
* fix: Exception (AsyncHTTPHandler.__init__() got an unexpected keyword argument 'follow_redirects')
2025-11-17 17:45:03 -08:00
Ishaan Jaff
08e115ecff
[Feat] Add SSML Support for Azure Text-to-Speech (AVA) ( #16747 )
...
* detect SSML in input
* transform_text_to_speech_request
* test_litellm_speech_with_ssml_passthrough
* add Passing Raw SSML
* fix is_ssml_input
2025-11-17 15:41:22 -08:00
Krrish Dholakia
88f5110e11
docs: improve docs
2025-11-17 09:54:13 -08:00
Krrish Dholakia
3c6f81e6ce
docs: simplify docs
2025-11-17 09:39:32 -08:00
Alexsander Hamir
0292b84dc4
docs: add missing details to benchmark comparison ( #16690 )
2025-11-15 20:12:32 -08:00
Krrish Dholakia
b79e92a21b
docs(index.md): document new agent hub
2025-11-15 18:49:51 -08:00
Krish Dholakia
39917f2ff7
Agents - AI Hub, make agents discoverable on model hub page for internal discovery ( #16678 )
...
* feat: initial commit adding agent hub to ui
* feat: add viewable agent hub
* feat: working support for making both config + db agents public via new 'public_agent_groups' list
* fix: agents.py
fix types
* feat: working PATCH endpoint for UI changes
* feat: add new agents panel with working crud
* refactor: refactor to show created_at on be/fe
* style: align new page with the agents table
* style: more style alignment logic
* feat: return if agent is public or not in /v1/agents
* feat: initial commit adding ui flow for making agents discoverable
* feat: new batch make public endpoint
* feat(public_model_hub.tsx): show public agents on public model hub table page
* fix(public_model_hub.tsx): add code examples for using the agent in a2a
* fix: fix indicating if agent has already been made public
* docs: document expected spec for agents is A2A
* docs: add agent hub docs
* docs: document making agents discoverable
* docs: add demo video to docs
* fix: fix ui linting errors
* fix: update tests
2025-11-15 18:25:16 -08:00
Ishaan Jaffer
6b9fbc5a36
fix docs
2025-11-15 16:58:47 -08:00
Ishaan Jaffer
f9b5820ba8
fix build
2025-11-15 16:53:12 -08:00
Ishaan Jaff
25cb873467
[UI] expose backend endpoint for callbacks settings ( #16698 )
...
* init json for
* add SQS fields
* fix params
* return callbacks configs
* test_get_callback_configs
* test_get_callback_configs
* vercel build fix
2025-11-15 16:44:17 -08:00
Alexsander Hamir
60691b771c
add perf update to release notes ( #16697 )
2025-11-15 16:36:49 -08:00
Ishaan Jaffer
c7807f4841
fix pkg
2025-11-15 16:26:56 -08:00
Ishaan Jaffer
d9fd7138f9
docs fix
2025-11-15 16:17:04 -08:00
Ishaan Jaffer
2a006cabac
docs fix
2025-11-15 16:11:31 -08:00