Commit Graph

4075 Commits

Author SHA1 Message Date
Krrish Dholakia 66acf6dcdb docs(index.md): initial commit with release notes 2025-08-23 15:45:06 -07:00
Krish Dholakia 242c06e691 Merge pull request #13784 from mubashir1osmani/docs-deploy 2025-08-23 15:40:34 -07:00
Mubashir Osmani 1a8d3cc2d5 Added FAQ under deployment docs (#13912)
* updated faq questions

* updated faq

* added faq under deployment
2025-08-23 13:22:56 -07:00
Ishaan Jaff 9750374081 [Feat] New LLM API - AI/ML API for Image Gen (#13893)
* add LlmProviders.AIML

* add AIMLChatConfig

* add aiml

* fix AimlImageGenerationConfig

* add AimlImageGenerationConfig

* add cost_calculator

* fixes for AI ML API

* add known AIML Flux image models

* add AIML Cost Calc

* add AI/ML API

* add aiml_models
2025-08-23 13:12:44 -07:00
Krish Dholakia 9df6f4ef08 Merge branch 'main' into litellm_dev_08_20_2025_p1 2025-08-23 12:09:53 -07:00
mubashir1osmani d664860aa1 added openai models to config 2025-08-23 10:02:49 -04:00
Krish Dholakia 867b48a24f Merge pull request #13866 from jinskjoy/Mistral-File-ID
[Enhancement] Add support for Mistral model file handling and update documentation
2025-08-22 23:54:17 -07:00
Krish Dholakia 28ea6cd35f Merge pull request #13899 from TeddyAmkie/docs/team-rate-limits-update
docs: clarify prerequisites and env var for team rate limits
2025-08-22 23:50:24 -07:00
Ishaan Jaff f9e9f8c25e docs AWS_REGION 2025-08-22 17:48:37 -07:00
Ishaan Jaff 56c16fc6f6 [Performance] Reduce Significant CPU overhead from litellm_logging.py (#13895)
* fix: litellm.configured_cold_storage_logger

* fix Session Management - Non-OpenAI Models docs

* ruff fix

* test fix
2025-08-22 16:17:57 -07:00
Teddy Amkie 08d1c96ed1 docs: clarify prerequisites and env var for team rate limits 2025-08-22 18:39:31 +00:00
Sameer Kankute 694a9f0f4c [Documentation] Litellm rerank deepinfra endpoint (#13845)
* Add documentation for deepinfra rerank endpoint

* Add the missed import

* fix supported param
2025-08-22 09:51:42 -07:00
Krish Dholakia 7c2c1c11d9 Merge pull request #13783 from philipkiely-baseten/main
Update Baseten LiteLLM integration
2025-08-21 18:34:36 -07:00
jinskjoy d8a33e137d [Enhancement] Add support for Mistral model file handling and update documentation 2025-08-21 20:35:07 +02:00
Krrish Dholakia 5c1b27af99 docs(quick_start.md): add back to navbar
useful for showing setup via cli
2025-08-20 15:09:19 -07:00
Krrish Dholakia fb5b2d8d47 docs: remove old doc
prevent showing up on searches
2025-08-20 15:01:32 -07:00
Ishaan Jaff 55dcaded72 [Feat] Add VertexAI qwen API Service (#13828)
* add support for vertex AI QWEN API

* streaming QWEN API support

* test_partner_models_httpx

* test_partner_models_httpx_streaming

* add cost tracking for vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maa

* docs qwen models vertexAI
2025-08-20 15:00:33 -07:00
Ishaan Jaff e229a6a796 security fix 2025-08-20 08:35:59 -07:00
mubashir1osmani ef9e50458d removed faq 2025-08-19 17:01:31 -04:00
mubashir1osmani 2edcc51e57 updated claude-code docs and deployment faq 2025-08-19 16:52:46 -04:00
Philip Kiely 7c3d522435 Update Baseten LiteLLM integration 2025-08-19 12:21:05 -07:00
Krish Dholakia 422447b7f1 Responses API - add default api version for openai responses api calls + Openrouter - fix claude-sonnet-4 on openrouter + Azure - Handle openai/v1/responses
Responses API - add default api version for openai responses api calls + Openrouter - fix claude-sonnet-4 on openrouter + Azure - Handle `openai/v1/responses`
2025-08-18 18:59:28 -07:00
Krrish Dholakia 0459604721 docs: document new param 2025-08-18 18:56:39 -07:00
Ishaan Jaff ba0881d728 [Bug Fix] image_edit() function returns APIConnectionError with litellm_proxy - Support for both image edits and image generations (#13735)
* add image edits litellm proxy on SDK

* add image gen provider

* add IMG Gen support for litellm_proxy provider
2025-08-18 18:26:32 -07:00
Ishaan Jaff eee93e250a docs v1.75.5-stable 2025-08-18 09:23:18 -07:00
Krrish Dholakia 21549a3463 docs(index.md): write up on session management 2025-08-16 17:55:18 -07:00
Krrish Dholakia cb95f2d5d7 docs(index.md): clarify risk of upgrade 2025-08-16 17:38:50 -07:00
Krrish Dholakia 788a1d03a1 docs(index.md): cleanup 2025-08-16 17:33:07 -07:00
Krrish Dholakia 235b6b040f docs(index.md): add highlight note on redis latency improvements 2025-08-16 17:32:45 -07:00
Krrish Dholakia 8a75ab8388 docs(index.md): cleanup doc 2025-08-16 17:16:04 -07:00
Ishaan Jaff 5889311b10 [Docs] v1.75.8-stable (#13676)
* v1-75-8

* fix responses API docs

* docs fix

* docs fix new models

* docs links

* docs fix links

* docs sambanova

* docs fix
2025-08-16 09:20:04 -07:00
Krish Dholakia f360e0ead2 Merge pull request #13590 from BerriAI/litellm_bedrock_api_header
[LLM translation] Refactor Anthropic Configurations and Add Support for `anthropic_beta` Headers
2025-08-14 11:32:18 -07:00
Ishaan Jaff 5bb96af818 [Feat] Add reasoning_effort param for hosted_vllm provider (#13620)
* add reasoning_effort to hosted_vllm

* test_hosted_vllm_supports_reasoning_effort

* Reasoning Effort
2025-08-14 10:10:30 -07:00
Ishaan Jaff 76d25926d4 [Feat] New provider - Azure AI Flux Image Generation (#13592)
* init files

* add AzureFoundryModelInfo

* fix api_version property

* add azure_ai img gen

* use AzureFoundryModelInfo

* get_base_image_generation_call_args

* add azure_ai/FLUX-1.1-pro

* add util for route_image_generation_cost_calculator

* docs azure ai flux

* fixes for flux

* fixes for AzureFoundryFluxImageGenerationConfig

* ruff fix
2025-08-13 17:20:30 -07:00
Ishaan Jaff fb325cbb5e fix Build from litellm pip package (#13603) 2025-08-13 17:19:43 -07:00
Jugal Bhatt 4201f0aa79 Enhance Bedrock Provider Configuration and Header Management
- Added `forward_client_headers_to_llm_api` setting in the Bedrock documentation to facilitate client-side header forwarding.
- Updated `completion` function to use merged headers instead of original `extra_headers`.
- Improved request handling in `BedrockConverseLLM` and `AmazonInvokeConfig` to ensure proper header management for `anthropic-beta` parameters.
- Refactored request transformation logic to return the transformed request for better clarity and functionality.
2025-08-13 13:41:34 -07:00
Jugal Bhatt 3990f61bed Refactor Anthropic Configurations and Add Support for anthropic_beta Headers
- Renamed `AmazonAnthropicClaude3Config` and `AmazonAnthropicClaude3MessagesConfig` to `AmazonAnthropicClaudeConfig` and `AmazonAnthropicClaudeMessagesConfig` respectively for consistency.
- Implemented `get_anthropic_beta_from_headers` function to extract and handle `anthropic-beta` headers across various transformations.
- Updated request transformations in `AmazonConverseConfig` and `AmazonInvokeConfig` to include `anthropic_beta` parameters based on user headers.
- Added tests to ensure proper handling of `anthropic_beta` headers in different scenarios.
2025-08-13 11:47:59 -07:00
Krish Dholakia f00e891004 LiteLLM SDK <-> Proxy: support user param + Prisma - remove use_prisma_migrate flag - redundant as this is now default (#13555)
* fix(litellm_proxy/chat/transformation.py): support 'user' and all other openai chat completion params

Fixes issue where 'user' was not being sent in request to litellm proxy via sdk

* fix(prisma_migration.py): remove 'use_prisma_migrate' flag, is now default

* docs: cleanup docs

* fix(proxy_cli.py): remove --use_prisma_migrate flag

* refactor: remove references to use_prisma_migrate env var

This is now the default flow for db migrations
2025-08-12 22:03:39 -07:00
Jorge Piedrahita Ortiz 6a77780c0e Feat/sambanova embeddings (#13308)
* add sambanova embeddings

* fmt

* minor fix

* add sambanova embeddings call

* fmt

* include embeddings in sambanova docs

* add sambanova embeddigns unit test

* remove unused import

* minor fix

* fmt

* update sambanova embeding to inherit from BaseEmbeddingConfig

* fmt

* fmt

* hot fix

* fmt
2025-08-12 17:15:26 -07:00
Ishaan Jaff 828e3e3deb [Feat] Add Streaming support + Docs for bedrock gpt-oss model family (#13346)
* add openai.gpt-oss-20b-1:0

* update BEDROCK_CONVERSE_MODELS

* openai.gpt-oss-20b-1:0 fixes

* fix PDF input

* fix for should_fake_stream

* TestBedrockGPTOSS

* should_fake_stream

* update supports vision field for openai.gpt-oss models

* fixes for bedrock gpt oss

* fixes for should_fake_stream

* docs bedrock gpt oss models
2025-08-12 08:39:36 -07:00
Ishaan Jaff 49ca4dc792 Sending tags to LiteLLM Proxy (#13517) 2025-08-11 17:58:50 -07:00
Ishaan Jaff 9f78287000 [Bug Fix]: Azure OpenAI GPT-5 max_tokens + reasoning param support (#13510)
* add AzureOpenAIGPT5Config

* add AzureOpenAIGPT5Config

* add AzureOpenAIGPT5Config

* add AzureOpenAIGPT5Config

* test_azure_gpt5_supports_reasoning_effort

* test_azure_gpt5_reasoning

* test_azure_gpt5_reasoning

* ruff check fixes

* docs azure gpt5
2025-08-11 15:40:53 -07:00
Krrish Dholakia bd8a0ae0d0 docs: fix order 2025-08-10 09:42:55 -07:00
Krrish Dholakia 1dbac75675 docs(index.md): update release with deployment information 2025-08-10 09:31:28 -07:00
Krish Dholakia c742c76288 Litellm release notes 08 10 2025 (#13479)
* docs(index.md): initial doc

* build(index.md): initial notes

* docs(index.md): add llm translation tickets

* docs(index.md): document new model support

* docs(index.md): document all pricing changes

* docs(index.md): add llm api endpoints

* docs(index.md): add doc on mcp gateway

* docs(index.md): add all remaining rc notes

* docs(index.md): cleanup
2025-08-10 07:32:11 -07:00
Krish Dholakia 9f6f96d76c Litellm dev 08 07 2025 p1 (#13418)
* fix(router.py): support base model for model group usage

allows model group info to show accurate cost information for azure models

* fix(router.py): fix changes

* test: add unit tests

* build(pyproject.toml): bump openai version requirements

support custom tool from responses api

 Closes https://github.com/BerriAI/litellm/issues/13391

* docs(responses_api.md): add verbosity + free-form function calling parameters

* docs(responses_api.md): add cfg + minimal reasoning to docs

Closes https://github.com/BerriAI/litellm/issues/13391

* docs(responses_api.md): add proxy examples to docs

* refactor: fix ruff error
2025-08-09 16:30:04 -07:00
Sannan Nasir 0e53b1feab Add digitalocean provider (#12169)
* Add digitalocean provider

* Add digitalocean provider

* Revert "Add digitalocean provider"

This reverts commit 96dda40f45b3d12ea03e861d060ec81460b7759e.

* changes

* fixes

* Update transformation

* refactoring

* rename provider to Gradient AI

* fixes

* Incorporte review comments

* revert changes

* fix typo

* revert change

* incorporated review comments

* Revert "Incorporte review comments"

This reverts commit 37bd51bd54ef4fd52ccc12866e47f8de9476d597.

* changes

* Revert "Revert "Incorporte review comments"

This reverts commit 37bd51bd54ef4fd52ccc12866e47f8de9476d597."

This reverts commit 68c8a198ee0d6441c3a52f6c6a49c9c95a4cb0a8.

* changes

* fixes

* Update provider_specific_fields.tsx
2025-08-09 16:26:33 -07:00
Ishaan Jaff ee40db7b31 docs native litellm prompts 2025-08-09 09:46:31 -07:00
Ishaan Jaff 94c33200a4 docs - native prompt mgmt (#13463) 2025-08-09 09:39:16 -07:00
Ishaan Jaff 3999e65a97 docs update 2025-08-09 09:24:41 -07:00