Commit Graph

4678 Commits

Author SHA1 Message Date
fzowl b1922e19f8 Voyageai pricing and doc update (#16641)
* Refresh VoyageAI models and prices and context

* Refresh VoyageAI models and prices and context

* Refresh VoyageAI models and prices and context

* Updating the available VoyageAI models in the docs

* Updating the available VoyageAI models in the docs

* Updating the model prices and the docs
2025-11-14 14:09:11 -08:00
Cesar Garcia 65061bafc7 feat(openai): Add support for reasoning_effort='none' in GPT-5.1 (#16658)
* feat(openai): Add support for reasoning_effort='none' in GPT-5.1

OpenAI's GPT-5.1 introduced a new reasoning effort parameter 'none'
which replaces the previous 'minimal' setting for faster, lower-latency
responses. This is now the default setting for GPT-5.1.

Changes:
- Updated REASONING_EFFORT type to include 'none' value
- Added GPT-5.1, GPT-5-mini, and GPT-5-nano to documentation
- Updated docs to reflect 'none' as GPT-5.1's default reasoning effort
- Added test to verify reasoning_effort='none' passes through correctly

Fixes #16633

* feat(responses): Add support for reasoning_effort='none' in Responses API transformation
2025-11-14 13:41:49 -08:00
Sameer Kankute 13993d6ea3 Add fal-ai/flux/schnell support (#16580) 2025-11-13 22:31:31 -08:00
Krrish Dholakia 266744a5bd docs: add contribution guide for new guardrails 2025-11-13 22:29:42 -08:00
Ishaan Jaff 124ba463f8 [Feat] RunwayML - Add support for /audio/speech eleven_multilingual_v2 endpoint (#16604)
* init RunwayMLTextToSpeechConfig

* add RunwayMLTextToSpeechConfig

* add  RunwayMLTextToSpeechConfig

* test_runwayml_tts_async

* runway ml speech

* fix voices

* fix test

* docs runway lm

* add runwayml here

* fix RunwayMLTextToSpeechConfig

* test_openai_voice_mapping_to_runwayml
2025-11-13 14:32:09 -08:00
Ishaan Jaff 911a009869 [Docs] LiteLLM Quick start - show how model resolution works (#16602)
* docs nderstanding Model Configuration

* docs fix
2025-11-13 13:28:01 -08:00
Ishaan Jaff 7133488282 [Feat] VertexAI - Add BGE Embeddings support (#16033)
* Support for Custom Vertex AI Models via PSC Endpoint with api_base (#15953)

* Support for Custom Vertex AI Models via PSC Endpoint with api_base

* Add docs related psc

* remove not needed files

* remove print statemnt

* fix mypy errors

* add TextEmbeddingBGEInput

* add VertexBGEConfig

* add BGE handling

* test_vertex_ai_bge_embedding_with_custom_api_base

* fix request transform vertex BGE

* test_vertex_ai_bge_embedding_with_custom_api_base

* tes BGE

* test_is_bge_model_detection

* docs cleanup

* handling BGE URL

* fix VertexBGEConfig

* test_vertex_ai_bge_with_endpoint_id_pattern

* docs vertex BGE

* docs

* docs fix

* fix VertexAIModelRoute

* from ..common_utils import VertexAIError, get_vertex_base_model_name
add

* fix VertexAIGemmaModels

* fix get_vertex_base_model_name

* test_vertex_ai_bge_psc_endpoint_url_construction

---------

Co-authored-by: Sameer Kankute <sameer@berri.ai>
2025-11-13 12:41:00 -08:00
Cesar Garcia 491f57a349 feat: Add support for reasoning_effort="none" for Gemini models (#16548)
Implements support for reasoning_effort="none" parameter for Gemini models,
providing significant cost savings (up to 96% cheaper) by disabling thinking
budget while maintaining response quality.

Changes:
- Added "supports_reasoning": true to gemini-2.0-flash-thinking-exp-01-21 in model config
- Implemented mapping for reasoning_effort="none" to thinkingConfig {thinkingBudget: 0, includeThoughts: false}
- Added unit test to verify the mapping works correctly

Performance impact:
- Without reasoning_effort: ~313 tokens
- With reasoning_effort="none": ~12 tokens (96% cheaper)

Closes #16420

Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
2025-11-12 19:41:07 -08:00
Cesar Garcia c017f665e0 docs(openai): Document reasoning_effort summary field options (#16549)
Related to PR #16210 which fixed automatic summary field addition

Changes:
- Document reasoning_effort string vs dict formats
- Add summary field options (auto, detailed, concise)
- Add table of supported reasoning_effort values by GPT-5 model
- Clarify model-specific support and limitations
- Note that summary field requires org verification

The previous implementation automatically added summary field causing
400 errors for unverified orgs. Now users can opt-in by passing
reasoning_effort as dict with explicit summary field.
2025-11-12 19:40:11 -08:00
Sameer Kankute 018bd2e039 Add Gemini image edit support (#16430)
* Add gemini image edit support

* fix lint errors

* fix lint errors

* fix lint errors

* Add docs
2025-11-12 18:48:27 -08:00
Ishaan Jaffer 78c169a524 docs fix 2025-11-12 18:30:01 -08:00
Sameer Kankute 394da34a0b Add all gemini image models support in image generation (#16526) 2025-11-12 18:26:44 -08:00
Ishaan Jaff b30439257b [Feat] Add RunwayML Img Gen API support (#16557)
* TestRunwaymlImageGeneration

* fix RUNWAYML

* rename

* fix rename

* get_runwayml_image_generation_config

* get_runwayml_image_generation_config

* TestRunwaymlImageGeneration

* add RUNWAYML_POLLING_TIMEOUT

* fix rnwayml transform img gen

* runwayml_image_cost_calculator

* runwayml_image_cost_calculator

* docs runwayml

* fix runwayML polling

* test_get_first_default_fallback
2025-11-12 18:20:14 -08:00
Benjamin Chrobot 1393900c22 [Docs] Fix code block indentation for fallbacks page (#16542) 2025-11-12 18:08:52 -08:00
Krrish Dholakia a05ad46394 feat: add litellm cloud self-serve to docs 2025-11-12 17:13:14 -08:00
Krrish Dholakia ae3178d5d4 docs(deploy.md): document how to disable pulling live model prices (faster startup time) on docker deployment 2025-11-12 13:49:44 -08:00
Krrish Dholakia 020a66c01f docs(model_access_guide.md): explain how model access works on litellm 2025-11-12 13:44:38 -08:00
Krrish Dholakia c6d2714c52 docs(model_access_guide.md): document how model access works on litellm 2025-11-12 13:43:13 -08:00
Cesar Garcia 20350fa094 docs: update broken Slack invite links to support page (#16546)
Replace broken Slack links (litellmossslack.slack.com and expired invite URLs)
with the correct support page URL (https://www.litellm.ai/support) across all
documentation files.

Files updated:
- CONTRIBUTING.md
- docs/my-website/docs/contact.md
- docs/my-website/docs/proxy/docker_quick_start.md
- docs/my-website/docs/troubleshoot.md
- docs/my-website/src/pages/contact.md
2025-11-12 12:41:55 -08:00
Anthony Monaco a1748ad550 Documentation Code Example corrections (#16502)
* Update quick_start.md

changed -D to -d

* Update users.md

Changed in a number of locations:

 "budget_duration": 10s,

to 

 "budget_duration": "10s",

* Update users.md

Changed all 10s to 30s to keep in line with the example
2025-11-11 19:06:44 -08:00
Ishaan Jaff 50b5cf5215 [Feat] New Provider - Add RunwayML Provider for video generations (#16505)
* add RUNWAYML

* init folders

* add RunwayMLVideoConfig

* add RUNWAYML_DEFAULT_API_VERSION

* add RunwayMLVideoConfig

* fix getting status

* add async_transform_video_content_response

* add runwayml transform_video_content_response

* fix config.yaml

* add runwayml docs

* add runwayml to videos

* docs runwayml video gen

* add new models to model cost map

* TestRunwayMLVideoTransformation

* fix linting errors
2025-11-11 18:48:23 -08:00
Pedro Azevedo 663f2d7e7f docs: remove enterprise restriction from guardrails list endpoint (#15333)
- Remove enterprise-only label from 'View Available Guardrails' section
- The /guardrails/list endpoint appears to be available in OSS version
- Makes documentation more accurate for OSS users
2025-11-11 18:45:26 -08:00
jwang-gif 443bada425 Add Zscaler AI Guard hook (#15691)
* Add Zscaler AI Guard hook

Co-authored-by: Angela Tao <atao@zscaler.com>

* Fix lint error, update document

* Fix lint error, update document

* update document

* fix mypy type error

* fix mypy issue

* fix test

* fix test

* improve document

* remove unuseful code

* use litellm httphandler

* update test cases

* revover guardrail_initializers.py and guardrail_registry.py

* remove unuse import

* app apply_guardrail

* remove functions repleased by apply_guardrail, update test and doc

* remove functions repleased by apply_guardrail, update test and doc

---------

Co-authored-by: Angela Tao <atao@zscaler.com>
2025-11-11 15:34:27 -08:00
Ishaan Jaffer be05324645 docs fix MAX_LANGFUSE_INITIALIZED_CLIENTS 2025-11-11 11:26:33 -08:00
Ishaan Jaff 5c9f50d584 [AI Gateway] - End User Budgets - Allow pointing max_end_user budget to an id, so the default ID applies to all end users (#16456)
* add _apply_budget_limits_to_end_user_params

* add _apply_budget_limits_to_end_user_params

* add _apply_budget_limits_to_end_user_params

* test_default_budget_applied_to_end_user_without_budget

* docs fix

* fix config
2025-11-11 08:20:13 -08:00
Sameer Kankute aaa8cba00b Add docs for tracking callback failure (#16474) 2025-11-10 19:15:14 -08:00
Sameer Kankute bf363cdf10 Add sdk focused examples (#16441) 2025-11-10 18:39:34 -08:00
Sameer Kankute be3c09e6d5 Add GET list of providers endpoint (#16432) 2025-11-10 18:38:09 -08:00
‮Artem a97676b8ae Add softgen to projects that are using litellm (#16423)
* Add Softgen project documentation

* Add Softgen project to sidebars

* Update Softgen
2025-11-10 11:09:29 -08:00
Sameer Kankute 013c195ab7 Fix container api link (#16440) 2025-11-10 11:09:10 -08:00
Krrish Dholakia 50f35fca47 docs: reorder docs 2025-11-10 07:33:22 -08:00
Ishaan Jaffer c088483da9 docs playground 2025-11-08 18:15:52 -08:00
Ishaan Jaffer f2832ba2f8 docs fix 2025-11-08 17:25:01 -08:00
Ishaan Jaffer b803af4aae docs fix 2025-11-08 17:05:20 -08:00
Ishaan Jaffer 36d181e398 docs fix + linting fix 2025-11-08 17:02:00 -08:00
Ishaan Jaffer 42c1e4740a docs built in guard 2025-11-08 16:59:00 -08:00
Ishaan Jaffer a298a9d492 docs fix 2025-11-08 16:52:26 -08:00
Ishaan Jaffer 7fb46e0b29 docs fix 2025-11-08 16:50:51 -08:00
Ishaan Jaffer 4647b41d7f docs fix 2025-11-08 16:50:06 -08:00
Ishaan Jaff 08cfc2bf7e [Docs] Litellm 1 79 2 rc (#16415)
* stash - v1

* docs fix

* docs fix

* docs

* folder fix

* docs fix
2025-11-08 16:47:15 -08:00
Alexsander Hamir 277b564385 add: performance improvements to release notes (#16401)
* add: performance improvements to release notes

* fix: mention which endpoint is getting faster

* fix: clarify that reported latency is end-to-end
2025-11-08 16:42:58 -08:00
Ishaan Jaff 20180ff93b [Docs] litellm content filter guard (#16413)
* add gif 1

* add guard 2

* add guard 3

* guard 5

* docs fix

* docs fix
2025-11-08 16:30:13 -08:00
Sameer Kankute 86d73c918c Adds support for returning Azure Content Policy error information when exceptions from Azure OpenAI occur (#16231)
* add provider_specific_fields to ContentPolicyViolationError

* use provider_specific_fields in ProxyException

* update openai_exception_handler

* fix use exception checker for content policy violation azure

* add AzureOpenAIExceptionMapping

* test_azure_with_content_safety_error

* Accessing Provider-Specific Error Details

* TestExceptionCheckers

* unit test got provider_specific_fields=

* add clear types for error dict

* fix test_azure_with_content_safety_error

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
2025-11-08 16:04:36 -08:00
Sameer Kankute e037d9315d Add Vertex and Gemini Videos API with Cost Tracking + UI support (#16323)
* Use video id for videos api

* remove mock code

* Potential fix for code scanning alert no. 3630: Clear-text logging of sensitive information

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

* remove print statements

* Update video prefix for 'video_'

* Add veo with openai videos unified specs

* Add videos testing to UI

* remove mock code

* Remove not need ui changes:

* Fix mypy errors related to gemini

* fix test_transform_video_create_request

* Add vertex ai veo config

* Add vertex ai veo config

* Add cost tracking for gemini and add optional param passing

* fix bugs related to vertex ai veo

* Add Gemini Veo Video Generation in Openai Videos Unified Spec (#16229)

* Add veo with openai videos unified specs

* Add videos testing to UI

* remove mock code

* Remove not need ui changes:

* Fix mypy errors related to gemini

* fix test_transform_video_create_request

* Add contant video duration for gemini and vertex

* Fix litellm_mapped_tests tests

* fix azure videos issue

* Added doc for videos vertex ai

* fix seconds param error

* fix lint errors

* test_transform_video_create_response_cost_tracking_no_duration

---------

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
Co-authored-by: Ishaan Jaffer <ishaanjaffer0324@gmail.com>
2025-11-08 16:03:51 -08:00
Alan Ponnachan 30873688f2 docs: Add documentation for Anthropic memory tool (#16388) 2025-11-08 12:38:57 -08:00
Ishaan Jaffer bae1857787 fix doc 2025-11-08 10:31:19 -08:00
Cesar Garcia d65a29b88d docs: fix image generation response format from 'image' to 'images' (#16378)
Update documentation to reflect actual API response format:
- Change singular 'image' field to plural 'images' array
- Add complete ImageURLListItem structure with index and type fields
- Update all code examples to use message.images instead of message.image
- Fix streaming examples to access images[0]["image_url"]["url"]

The documentation was incorrectly showing 'image' (singular object)
but the actual implementation returns 'images' (array of ImageURLListItem).

Related to issue #16227
2025-11-07 19:06:03 -08:00
Krrish Dholakia 532ebf43d0 docs(moderation.md): fix moderation quick start docs 2025-11-07 16:25:08 -08:00
Sameer Kankute faae0ff0dc Fix Azure DALL-E-3 health check content policy violation by using safe default prompt (#16329)
* Add custom health check prompt support

* Add constant for health check prompt

* Add constant for health check prompt
2025-11-07 15:30:56 -08:00
Krrish Dholakia 9059905d25 docs(openai/videos.md): document proxy usage on openai docs for video gen 2025-11-07 15:27:18 -08:00