Commit Graph

5401 Commits

Author SHA1 Message Date
Sameer Kankute 7fa4d090ec Add doc for chat completion web search 2026-02-09 13:51:26 +05:30
michelligabriele 8e1201ca3d docs: improve Okta SSO setup guide with step-by-step instructions (#20353)
* docs: improve Okta SSO setup guide with step-by-step instructions

- Add clear steps for creating OIDC app in Okta Admin Console
- Document critical Authorization Server Access Policy configuration
- Add troubleshooting table for common errors
- Link to official Okta documentation for detailed UI instructions
- Fix incorrect OAuth endpoint examples (was using Auth0 URLs)

* docs: add Okta console screenshots for SSO setup guide

- Add screenshot showing Sign-in redirect URI configuration
- Add screenshot showing Client Credentials location
- Add screenshot showing Authorization Server Access Policy setup

* docs: replace Okta screenshots with annotated Scribe versions

- Replace screenshots with Scribe-annotated versions highlighting key areas
- Add step-by-step screenshots for Authorization Server Access Policy setup
- Remove old unannotated screenshots
2026-02-04 18:39:51 -08:00
Shivam Rawat 795ba8efe3 Merge pull request #20443 from BerriAI/docs_langfuse_multi_project_cred_support
langfuse doc update
2026-02-04 15:55:04 -08:00
shivam 555e1da9b4 langfuse doc update 2026-02-04 15:53:24 -08:00
Sameer Kankute b2feedc469 Merge pull request #20318 from BerriAI/litellm_oss_staging_02_03_2026
feat(guardrails): implement team-based isolation guardrails mgmnt (#1…
2026-02-04 17:49:30 +05:30
Sameer Kankute bfc0c4b4ab Merge pull request #20396 from BerriAI/litellm_copilot_kit_sdk
[DOCS]Add copilotkit sdk doc as supported agents sdk
2026-02-04 13:41:08 +05:30
Sameer Kankute cdd4569064 Apply suggestion from @greptile-apps[bot]
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
2026-02-04 11:56:17 +05:30
Sameer Kankute 465bbf3514 Add copilotkit sdk doc as supported agents sdk 2026-02-04 11:53:49 +05:30
Sameer Kankute 8109413a54 Add copilotkit sdk doc as supported agents sdk 2026-02-04 11:52:17 +05:30
Ishaan Jaffer 7b8e3c7736 docs fix 2026-02-03 20:11:48 -08:00
Ishaan Jaff da4cf4942f [Feat] Add xAI /realtime API Support - works with LiveKitSDK (#20381)
* init: _realtime_health_check + routing

* refactor: OpenAIRealtime

* refactor: XAI_API_BASE

* feat: XAIRealtime

* init feat: XAIRealtime

* OpenAIRealtime

* TestXAIRealtime

* test fixes

* test OAI

* TEST xAI, OAI

* clean realtime jobs

* refactor

* test XAI

* docs xAI

* fix xAI

* fix lint errors

* test_async_realtime_url_contains_model

* test fix

* document test changes

* _realtime_health_check

* docs xai realtime

* fix handlers

* add additional_headers

* fix
2026-02-03 19:58:28 -08:00
Krish Dholakia 7056d9984e Custom Code Guardrails UI Playground (#20377)
* feat(guardrails/): allow custom code execution for guardrails

first step in allowing teams to submit custom code for guardrails

* feat: custom_code_guardrail.md

support passing custom code for guardrails

* feat: initial commit adding ui for custom code guardrails

allows users to write guardrails based on custom code

* feat: expose new test custom code guardrail endpoint

allows ui testing playground to sanity check if guardrail is working as expected

* fix: fix linting errors

* fix: fix max recursion check

* fix: fix linting error
2026-02-03 19:57:24 -08:00
Sameer Kankute 25fa1ad4e7 Merge pull request #20386 from naaa760/fix/extra-head-chat-comp-brid
fix(proxy): forward extra headers in chat
2026-02-04 09:11:43 +05:30
Ishaan Jaff d267c69086 [Feat] Use A2A registered agents with /chat/completions (#20362)
* test_a2a_registry_integration

* fix: render agents on model dropdown on UI

* init append_agents_to_model_group

* route_a2a_agent_request

* is_a2a_agent_model

* route_a2a_agent_request

* fix: error handling

* docs A2A usage

* docs fix

* feat: working A2a streaming

* fix transform
2026-02-03 15:25:38 -08:00
Xiaohan Fu 2b25d03046 Fix fail-open for grayswan and pass metadata to cygnal api endpoint (#19837)
* fix fail-open for grayswan; pass metadata to cygnal api endpoint; update docs

* pass litellm_metadata to cygnal in payload

* switch error msg to const, and clean exception handling.

* update pyproject.toml as requested

* Revert "update pyproject.toml as requested"

This reverts commit 4eece154d056ba33689a5584c86c8fc352bb7cdd.
2026-02-03 14:41:31 -08:00
Sameer Kankute b7f0d05dfd Merge pull request #20337 from BerriAI/main
update 02 staging PR
2026-02-03 17:08:01 +05:30
Sameer Kankute 017b78de40 Fix code quality tests 2026-02-03 15:01:17 +05:30
Cesar Garcia a904c3f40d fix(github_copilot): preserve system prompts and auto-inject headers (#20113)
- Remove system-to-assistant message conversion (API now supports system prompts)
- Auto-inject required Copilot headers in chat completions (same as /responses)
- Deprecate disable_copilot_system_to_assistant flag
- Update docs to remove manual extra_headers requirement

Fixes #19873
2026-02-02 22:05:44 -08:00
Ishaan Jaffer 7ae980410b docs fix 2026-02-02 19:50:22 -08:00
Ishaan Jaffer 5aa8725c63 docs Tracing Tools 2026-02-02 19:48:00 -08:00
Sameer Kankute 333419b4d2 Add documentation correctly for nova sonic 2026-02-03 09:03:27 +05:30
Ishaan Jaff 0ef506a54a Litellm docs mcp filtering semantic (#20316)
* init: SemanticMCPToolFilter

* init: SemanticToolFilterHook

* test_e2e_semantic_filter

* mock tests: test_semantic_filter_basic_filtering

* Update litellm/proxy/_experimental/mcp_server/semantic_tool_filter.py

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

* refactor folder/file organization

* docs fix

* fix filter

* fix: filter_tools

* fix linting tool filrer

* initialize_from_config

* fix: _expand_mcp_tools

* _initialize_semantic_tool_filter

* working: async_post_call_response_headers_hook

* clean up semantic tool filter

* add _initialize_semantic_tool_filter

* build_router_from_mcp_registry

* _get_tools_by_names

* fiix config

* async_post_call_response_headers_hook

* docs mcp filter

* docs fix

---------

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
2026-02-02 18:29:07 -08:00
Alexsander Hamir 1b9631d260 Add blog post: Achieving Sub-Millisecond Proxy Overhead (#20309) 2026-02-02 17:46:36 -08:00
shin-bot-litellm 0614ff9fda docs: add Prisma migration troubleshooting guide (#20300)
* docs: add Prisma migration troubleshooting guide

Add troubleshooting documentation for common Prisma migration errors
encountered when upgrading/downgrading LiteLLM proxy versions.

Covers:
- 'relation does not exist' errors after version rollback
- Blocked migrations from previous failures
- Migration state mismatch after version rollback
- General tips for prisma migrate resolve, db push, and migrate deploy

* docs: simplify prisma migration troubleshooting - focus on delete + restart
2026-02-02 14:39:18 -08:00
Ishaan Jaff 73691fb373 Model request tags documentation (#20290)
* Add request tags documentation for spend tracking

- Add new concise doc explaining how to tag model requests
- Include Python SDK and cURL examples
- Show where tags appear in spend logs
- Add common use cases table (AWS accounts, teams, projects)
- Include how to set default tags on API keys
- Add to Spend Tracking section in sidebar

Co-authored-by: ishaan <ishaan@berri.ai>

* Simplify request tags doc for AI Gateway usage

- Focus on config.yaml setup with default_key_generate_params
- Show both request body and header methods for sending tags
- Remove SDK examples, keep concise cURL examples
- Streamline for quick reference

Co-authored-by: ishaan <ishaan@berri.ai>

* Update request tags doc to show model-level config

- Set tags directly on model deployments in litellm_params
- Requests just specify model, tags applied automatically
- Use clear naming: AWS_IAM_PROD, AWS_IAM_DEV

Co-authored-by: ishaan <ishaan@berri.ai>

---------

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
Co-authored-by: ishaan <ishaan@berri.ai>
2026-02-02 11:32:00 -08:00
shin-bot-litellm 0a1b98895b docs: Add FAQ for setting up and verifying LITELLM_LICENSE (#20284)
* docs: add FAQ for setting up and verifying LITELLM_LICENSE

Added two new FAQ entries to the Enterprise docs page:
- How to set up your Enterprise License (LITELLM_LICENSE) via .env, Docker, or docker-compose
- How to verify the license is active by checking for 'Enterprise Edition' in the Swagger UI

* docs: trim license FAQ to essential steps only
2026-02-02 11:03:45 -08:00
Sameer Kankute 7329fa8e7a fix: litellm_oss_staging_01_31_2026_3 failing tests 2026-02-02 18:50:58 +05:30
Harshit Jain 6d86808eaf feat: enforce model-level TPM/RPM limits (enforce_model_rate_limits) … (#19230)
* feat: enforce model-level TPM/RPM limits (enforce_model_rate_limits) flag

* fix lint errors
2026-02-02 18:18:46 +05:30
Sameer Kankute 27b4052231 Merge pull request #20244 from BerriAI/litellm_nova_sonic_imp_2_feb
[Feat]Add support for nova sonic Speech to speech model
2026-02-02 18:04:02 +05:30
Sameer Kankute 19f6640309 Merge pull request #20249 from BerriAI/litellm_add_0_budget_model_bypass2
[Feat] Add support for 0 cost models
2026-02-02 17:45:05 +05:30
Sameer Kankute fff35ee032 Merge pull request #20257 from BerriAI/litellm_slack_alert_fix_jfeb_02
Fix: Slack alert issue
2026-02-02 17:43:05 +05:30
Sameer Kankute 72482c0cb5 Fix: Slack alert issue 2026-02-02 16:49:10 +05:30
Sameer Kankute c6f178eeae Update Vertex AI Text to Speech doc to show use of audio 2026-02-02 16:15:48 +05:30
Sameer Kankute 14c2932387 Add docs on Zero-Cost Models 2026-02-02 13:29:37 +05:30
Sameer Kankute ea6c31a02a Add documentation on nova sonic 2026-02-02 12:19:04 +05:30
shin-bot-litellm df387c39f6 docs: Update v1.81.6 release notes - focus on Logs v2 with Tool Call Tracing (#20225)
- Updated title to highlight Logs v2 feature
- Simplified Key Highlights to focus on Logs v2 / tool call tracing
- Rewrote Logs v2 description with improved language style
- Removed Claude Agents SDK and RAG API from key highlights section
- TODO: Add image (logs_v2_tool_tracing.png)

Co-authored-by: shin-bot-litellm <shin-bot-litellm@users.noreply.github.com>
2026-01-31 19:29:06 -08:00
shin-bot-litellm 93a0631ea3 docs: fix dead links in v1.81.6 release notes (#20218)
- Fix /docs/search/index -> /docs/search (404 error)
- Fix /cookbook/ -> GitHub cookbook URL (404 error)

Co-authored-by: shin-bot-litellm <shin-bot-litellm@users.noreply.github.com>
2026-01-31 17:46:56 -08:00
Ishaan Jaffer 8a57ee5efb docs fix 2026-01-31 17:34:25 -08:00
Ishaan Jaffer 01b96f1272 fixes 2026-01-31 17:15:53 -08:00
yuneng-jiang 0bf471b071 Merge pull request #20198 from BerriAI/spend_logs_docs
[Doc] Fixing Image
2026-01-31 15:28:50 -08:00
yuneng-jiang 93dfac78ed doc fix 2026-01-31 15:27:21 -08:00
yuneng-jiang 543e85a5b4 Merge pull request #20197 from BerriAI/spend_logs_docs
[Docs] UI Spend Logs Settings Docs
2026-01-31 15:20:31 -08:00
yuneng-jiang af015fe4f0 UI spend logs setting docs 2026-01-31 15:16:59 -08:00
ryan-crabbe 82383cde74 docs/blog index page (#20188)
* docs: add card-based blog index page for mobile navigation

Fixes #20100 - the blog landing page showed post content directly
instead of an index, with no way to navigate between posts on mobile.

- Swizzle BlogListPage with card-based grid layout
- Featured latest post spans full width with badge
- Responsive 2-column grid with orphan handling
- Pagination, SEO metadata, accessibility (aria-label, dateTime, heading hierarchy)
- Add description frontmatter to existing blog posts

* docs: add deterministic fallback colors for unknown blog tags

* docs: rename blog heading to The LiteLLM Blog
2026-01-31 15:11:45 -08:00
Cesar Garcia 3fa7ab1012 docs(embeddings): add supported input formats section (#20073)
Document valid input formats for /v1/embeddings endpoint per OpenAI spec.
Clarifies that array of string arrays is not a valid format.
2026-01-31 10:45:56 -08:00
shin-bot-litellm 1c757dee14 litellm_docs: add missing environment variable documentation (#20138) 2026-01-31 07:33:24 -08:00
Krrish Dholakia 0edd50fe3c docs: cleanup docs 2026-01-30 19:39:17 -08:00
Ishaan Jaff 5345a763c2 [Feat] v2 - Logs view with side panel and improved UX (#20091)
* init: azure_ai/azure-model-router

* show additional_costs in CostBreakdown

* UI show cost breakdown fields

* feat: dedicated cost calc for azure ai

* test_azure_ai_model_router

* docs azure model router

* test azure model router

* fix transfrom

* Add transform file

* fix:feat: route to config

* v0 - looks decen view

* refactored code

* fix ui

* fixes ui

* complete v2 viewer

* address feedback

* address feedback
2026-01-30 18:34:13 -08:00
Ishaan Jaff 6897d5f59e [Feat] Add async_post_call_response_headers_hook to CustomLogger (#20083)
* Add async_post_call_response_headers_hook to CustomLogger (#20070)

Allow CustomLogger callbacks to inject custom HTTP response headers
into streaming, non-streaming, and failure responses via a new
async_post_call_response_headers_hook method.

* async_post_call_response_headers_hook

---------

Co-authored-by: michelligabriele <gabriele.michelli@icloud.com>
2026-01-30 12:44:44 -08:00
Ishaan Jaff 3910161a02 Realtime API benchmarks (#20074)
* Add /realtime API benchmarks to Benchmarks documentation

- Added new section showing performance improvements for /realtime endpoint
- Included before/after metrics showing 182× faster p99 latency
- Added test setup specifications and key optimizations
- Referenced from v1.80.5-stable release notes

Co-authored-by: ishaan <ishaan@berri.ai>

* Update /realtime benchmarks to show current performance only

- Removed before/after comparison, showing only current metrics
- Clarified that benchmarks are e2e latency against fake realtime endpoint
- Simplified table format for better readability

Co-authored-by: ishaan <ishaan@berri.ai>

---------

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
Co-authored-by: ishaan <ishaan@berri.ai>
2026-01-30 11:55:29 -08:00