litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-18 17:28:19 +00:00

Author	SHA1	Message	Date
fzowl	b1922e19f8	Voyageai pricing and doc update (#16641 ) * Refresh VoyageAI models and prices and context * Refresh VoyageAI models and prices and context * Refresh VoyageAI models and prices and context * Updating the available VoyageAI models in the docs * Updating the available VoyageAI models in the docs * Updating the model prices and the docs	2025-11-14 14:09:11 -08:00
Cesar Garcia	65061bafc7	feat(openai): Add support for reasoning_effort='none' in GPT-5.1 (#16658 ) * feat(openai): Add support for reasoning_effort='none' in GPT-5.1 OpenAI's GPT-5.1 introduced a new reasoning effort parameter 'none' which replaces the previous 'minimal' setting for faster, lower-latency responses. This is now the default setting for GPT-5.1. Changes: - Updated REASONING_EFFORT type to include 'none' value - Added GPT-5.1, GPT-5-mini, and GPT-5-nano to documentation - Updated docs to reflect 'none' as GPT-5.1's default reasoning effort - Added test to verify reasoning_effort='none' passes through correctly Fixes #16633 * feat(responses): Add support for reasoning_effort='none' in Responses API transformation	2025-11-14 13:41:49 -08:00
Sameer Kankute	13993d6ea3	Add fal-ai/flux/schnell support (#16580 )	2025-11-13 22:31:31 -08:00
Krrish Dholakia	266744a5bd	docs: add contribution guide for new guardrails	2025-11-13 22:29:42 -08:00
Ishaan Jaff	124ba463f8	[Feat] RunwayML - Add support for /audio/speech `eleven_multilingual_v2` endpoint (#16604 ) * init RunwayMLTextToSpeechConfig * add RunwayMLTextToSpeechConfig * add RunwayMLTextToSpeechConfig * test_runwayml_tts_async * runway ml speech * fix voices * fix test * docs runway lm * add runwayml here * fix RunwayMLTextToSpeechConfig * test_openai_voice_mapping_to_runwayml	2025-11-13 14:32:09 -08:00
Ishaan Jaff	911a009869	[Docs] LiteLLM Quick start - show how model resolution works (#16602 ) * docs nderstanding Model Configuration * docs fix	2025-11-13 13:28:01 -08:00
Ishaan Jaff	7133488282	[Feat] VertexAI - Add BGE Embeddings support (#16033 ) * Support for Custom Vertex AI Models via PSC Endpoint with api_base (#15953) * Support for Custom Vertex AI Models via PSC Endpoint with api_base * Add docs related psc * remove not needed files * remove print statemnt * fix mypy errors * add TextEmbeddingBGEInput * add VertexBGEConfig * add BGE handling * test_vertex_ai_bge_embedding_with_custom_api_base * fix request transform vertex BGE * test_vertex_ai_bge_embedding_with_custom_api_base * tes BGE * test_is_bge_model_detection * docs cleanup * handling BGE URL * fix VertexBGEConfig * test_vertex_ai_bge_with_endpoint_id_pattern * docs vertex BGE * docs * docs fix * fix VertexAIModelRoute * from ..common_utils import VertexAIError, get_vertex_base_model_name add * fix VertexAIGemmaModels * fix get_vertex_base_model_name * test_vertex_ai_bge_psc_endpoint_url_construction --------- Co-authored-by: Sameer Kankute <sameer@berri.ai>	2025-11-13 12:41:00 -08:00
Cesar Garcia	491f57a349	feat: Add support for reasoning_effort="none" for Gemini models (#16548 ) Implements support for reasoning_effort="none" parameter for Gemini models, providing significant cost savings (up to 96% cheaper) by disabling thinking budget while maintaining response quality. Changes: - Added "supports_reasoning": true to gemini-2.0-flash-thinking-exp-01-21 in model config - Implemented mapping for reasoning_effort="none" to thinkingConfig {thinkingBudget: 0, includeThoughts: false} - Added unit test to verify the mapping works correctly Performance impact: - Without reasoning_effort: ~313 tokens - With reasoning_effort="none": ~12 tokens (96% cheaper) Closes #16420 Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>	2025-11-12 19:41:07 -08:00
Cesar Garcia	c017f665e0	docs(openai): Document reasoning_effort summary field options (#16549 ) Related to PR #16210 which fixed automatic summary field addition Changes: - Document reasoning_effort string vs dict formats - Add summary field options (auto, detailed, concise) - Add table of supported reasoning_effort values by GPT-5 model - Clarify model-specific support and limitations - Note that summary field requires org verification The previous implementation automatically added summary field causing 400 errors for unverified orgs. Now users can opt-in by passing reasoning_effort as dict with explicit summary field.	2025-11-12 19:40:11 -08:00
Sameer Kankute	018bd2e039	Add Gemini image edit support (#16430 ) * Add gemini image edit support * fix lint errors * fix lint errors * fix lint errors * Add docs	2025-11-12 18:48:27 -08:00
Ishaan Jaffer	78c169a524	docs fix	2025-11-12 18:30:01 -08:00
Sameer Kankute	394da34a0b	Add all gemini image models support in image generation (#16526 )	2025-11-12 18:26:44 -08:00
Ishaan Jaff	b30439257b	[Feat] Add RunwayML Img Gen API support (#16557 ) * TestRunwaymlImageGeneration * fix RUNWAYML * rename * fix rename * get_runwayml_image_generation_config * get_runwayml_image_generation_config * TestRunwaymlImageGeneration * add RUNWAYML_POLLING_TIMEOUT * fix rnwayml transform img gen * runwayml_image_cost_calculator * runwayml_image_cost_calculator * docs runwayml * fix runwayML polling * test_get_first_default_fallback	2025-11-12 18:20:14 -08:00
Benjamin Chrobot	1393900c22	[Docs] Fix code block indentation for fallbacks page (#16542 )	2025-11-12 18:08:52 -08:00
Krrish Dholakia	a05ad46394	feat: add litellm cloud self-serve to docs	2025-11-12 17:13:14 -08:00
Krrish Dholakia	ae3178d5d4	docs(deploy.md): document how to disable pulling live model prices (faster startup time) on docker deployment	2025-11-12 13:49:44 -08:00
Krrish Dholakia	020a66c01f	docs(model_access_guide.md): explain how model access works on litellm	2025-11-12 13:44:38 -08:00
Krrish Dholakia	c6d2714c52	docs(model_access_guide.md): document how model access works on litellm	2025-11-12 13:43:13 -08:00
Cesar Garcia	20350fa094	docs: update broken Slack invite links to support page (#16546 ) Replace broken Slack links (litellmossslack.slack.com and expired invite URLs) with the correct support page URL (https://www.litellm.ai/support) across all documentation files. Files updated: - CONTRIBUTING.md - docs/my-website/docs/contact.md - docs/my-website/docs/proxy/docker_quick_start.md - docs/my-website/docs/troubleshoot.md - docs/my-website/src/pages/contact.md	2025-11-12 12:41:55 -08:00
Anthony Monaco	a1748ad550	Documentation Code Example corrections (#16502 ) * Update quick_start.md changed -D to -d * Update users.md Changed in a number of locations: "budget_duration": 10s, to "budget_duration": "10s", * Update users.md Changed all 10s to 30s to keep in line with the example	2025-11-11 19:06:44 -08:00
Ishaan Jaff	50b5cf5215	[Feat] New Provider - Add RunwayML Provider for video generations (#16505 ) * add RUNWAYML * init folders * add RunwayMLVideoConfig * add RUNWAYML_DEFAULT_API_VERSION * add RunwayMLVideoConfig * fix getting status * add async_transform_video_content_response * add runwayml transform_video_content_response * fix config.yaml * add runwayml docs * add runwayml to videos * docs runwayml video gen * add new models to model cost map * TestRunwayMLVideoTransformation * fix linting errors	2025-11-11 18:48:23 -08:00
Pedro Azevedo	663f2d7e7f	docs: remove enterprise restriction from guardrails list endpoint (#15333 ) - Remove enterprise-only label from 'View Available Guardrails' section - The /guardrails/list endpoint appears to be available in OSS version - Makes documentation more accurate for OSS users	2025-11-11 18:45:26 -08:00
jwang-gif	443bada425	Add Zscaler AI Guard hook (#15691 ) * Add Zscaler AI Guard hook Co-authored-by: Angela Tao <atao@zscaler.com> * Fix lint error, update document * Fix lint error, update document * update document * fix mypy type error * fix mypy issue * fix test * fix test * improve document * remove unuseful code * use litellm httphandler * update test cases * revover guardrail_initializers.py and guardrail_registry.py * remove unuse import * app apply_guardrail * remove functions repleased by apply_guardrail, update test and doc * remove functions repleased by apply_guardrail, update test and doc --------- Co-authored-by: Angela Tao <atao@zscaler.com>	2025-11-11 15:34:27 -08:00
Ishaan Jaffer	be05324645	docs fix MAX_LANGFUSE_INITIALIZED_CLIENTS	2025-11-11 11:26:33 -08:00
Ishaan Jaff	5c9f50d584	[AI Gateway] - End User Budgets - Allow pointing max_end_user budget to an id, so the default ID applies to all end users (#16456 ) * add _apply_budget_limits_to_end_user_params * add _apply_budget_limits_to_end_user_params * add _apply_budget_limits_to_end_user_params * test_default_budget_applied_to_end_user_without_budget * docs fix * fix config	2025-11-11 08:20:13 -08:00
Sameer Kankute	aaa8cba00b	Add docs for tracking callback failure (#16474 )	2025-11-10 19:15:14 -08:00
Sameer Kankute	bf363cdf10	Add sdk focused examples (#16441 )	2025-11-10 18:39:34 -08:00
Sameer Kankute	be3c09e6d5	Add GET list of providers endpoint (#16432 )	2025-11-10 18:38:09 -08:00
‮Artem	a97676b8ae	Add softgen to projects that are using litellm (#16423 ) * Add Softgen project documentation * Add Softgen project to sidebars * Update Softgen	2025-11-10 11:09:29 -08:00
Sameer Kankute	013c195ab7	Fix container api link (#16440 )	2025-11-10 11:09:10 -08:00
Krrish Dholakia	50f35fca47	docs: reorder docs	2025-11-10 07:33:22 -08:00
Ishaan Jaffer	c088483da9	docs playground	2025-11-08 18:15:52 -08:00
Ishaan Jaffer	f2832ba2f8	docs fix	2025-11-08 17:25:01 -08:00
Ishaan Jaffer	b803af4aae	docs fix	2025-11-08 17:05:20 -08:00
Ishaan Jaffer	36d181e398	docs fix + linting fix	2025-11-08 17:02:00 -08:00
Ishaan Jaffer	42c1e4740a	docs built in guard	2025-11-08 16:59:00 -08:00
Ishaan Jaffer	a298a9d492	docs fix	2025-11-08 16:52:26 -08:00
Ishaan Jaffer	7fb46e0b29	docs fix	2025-11-08 16:50:51 -08:00
Ishaan Jaffer	4647b41d7f	docs fix	2025-11-08 16:50:06 -08:00
Ishaan Jaff	08cfc2bf7e	[Docs] Litellm 1 79 2 rc (#16415 ) * stash - v1 * docs fix * docs fix * docs * folder fix * docs fix	2025-11-08 16:47:15 -08:00
Alexsander Hamir	277b564385	add: performance improvements to release notes (#16401 ) * add: performance improvements to release notes * fix: mention which endpoint is getting faster * fix: clarify that reported latency is end-to-end	2025-11-08 16:42:58 -08:00
Ishaan Jaff	20180ff93b	[Docs] litellm content filter guard (#16413 ) * add gif 1 * add guard 2 * add guard 3 * guard 5 * docs fix * docs fix	2025-11-08 16:30:13 -08:00
Sameer Kankute	86d73c918c	Adds support for returning Azure Content Policy error information when exceptions from Azure OpenAI occur (#16231 ) * add provider_specific_fields to ContentPolicyViolationError * use provider_specific_fields in ProxyException * update openai_exception_handler * fix use exception checker for content policy violation azure * add AzureOpenAIExceptionMapping * test_azure_with_content_safety_error * Accessing Provider-Specific Error Details * TestExceptionCheckers * unit test got provider_specific_fields= * add clear types for error dict * fix test_azure_with_content_safety_error --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-11-08 16:04:36 -08:00
Sameer Kankute	e037d9315d	Add Vertex and Gemini Videos API with Cost Tracking + UI support (#16323 ) * Use video id for videos api * remove mock code * Potential fix for code scanning alert no. 3630: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * remove print statements * Update video prefix for 'video_' * Add veo with openai videos unified specs * Add videos testing to UI * remove mock code * Remove not need ui changes: * Fix mypy errors related to gemini * fix test_transform_video_create_request * Add vertex ai veo config * Add vertex ai veo config * Add cost tracking for gemini and add optional param passing * fix bugs related to vertex ai veo * Add Gemini Veo Video Generation in Openai Videos Unified Spec (#16229) * Add veo with openai videos unified specs * Add videos testing to UI * remove mock code * Remove not need ui changes: * Fix mypy errors related to gemini * fix test_transform_video_create_request * Add contant video duration for gemini and vertex * Fix litellm_mapped_tests tests * fix azure videos issue * Added doc for videos vertex ai * fix seconds param error * fix lint errors * test_transform_video_create_response_cost_tracking_no_duration --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Co-authored-by: Ishaan Jaffer <ishaanjaffer0324@gmail.com>	2025-11-08 16:03:51 -08:00
Alan Ponnachan	30873688f2	docs: Add documentation for Anthropic memory tool (#16388 )	2025-11-08 12:38:57 -08:00
Ishaan Jaffer	bae1857787	fix doc	2025-11-08 10:31:19 -08:00
Cesar Garcia	d65a29b88d	docs: fix image generation response format from 'image' to 'images' (#16378 ) Update documentation to reflect actual API response format: - Change singular 'image' field to plural 'images' array - Add complete ImageURLListItem structure with index and type fields - Update all code examples to use message.images instead of message.image - Fix streaming examples to access images[0]["image_url"]["url"] The documentation was incorrectly showing 'image' (singular object) but the actual implementation returns 'images' (array of ImageURLListItem). Related to issue #16227	2025-11-07 19:06:03 -08:00
Krrish Dholakia	532ebf43d0	docs(moderation.md): fix moderation quick start docs	2025-11-07 16:25:08 -08:00
Sameer Kankute	faae0ff0dc	Fix Azure DALL-E-3 health check content policy violation by using safe default prompt (#16329 ) * Add custom health check prompt support * Add constant for health check prompt * Add constant for health check prompt	2025-11-07 15:30:56 -08:00
Krrish Dholakia	9059905d25	docs(openai/videos.md): document proxy usage on openai docs for video gen	2025-11-07 15:27:18 -08:00

1 2 3 4 5 ...

4678 Commits