Commit Graph

4056 Commits

Author SHA1 Message Date
Krrish Dholakia fb5b2d8d47 docs: remove old doc
prevent showing up on searches
2025-08-20 15:01:32 -07:00
Ishaan Jaff e229a6a796 security fix 2025-08-20 08:35:59 -07:00
Krish Dholakia 422447b7f1 Responses API - add default api version for openai responses api calls + Openrouter - fix claude-sonnet-4 on openrouter + Azure - Handle openai/v1/responses
Responses API - add default api version for openai responses api calls + Openrouter - fix claude-sonnet-4 on openrouter + Azure - Handle `openai/v1/responses`
2025-08-18 18:59:28 -07:00
Krrish Dholakia 0459604721 docs: document new param 2025-08-18 18:56:39 -07:00
Ishaan Jaff ba0881d728 [Bug Fix] image_edit() function returns APIConnectionError with litellm_proxy - Support for both image edits and image generations (#13735)
* add image edits litellm proxy on SDK

* add image gen provider

* add IMG Gen support for litellm_proxy provider
2025-08-18 18:26:32 -07:00
Ishaan Jaff eee93e250a docs v1.75.5-stable 2025-08-18 09:23:18 -07:00
Krrish Dholakia 21549a3463 docs(index.md): write up on session management 2025-08-16 17:55:18 -07:00
Krrish Dholakia cb95f2d5d7 docs(index.md): clarify risk of upgrade 2025-08-16 17:38:50 -07:00
Krrish Dholakia 788a1d03a1 docs(index.md): cleanup 2025-08-16 17:33:07 -07:00
Krrish Dholakia 235b6b040f docs(index.md): add highlight note on redis latency improvements 2025-08-16 17:32:45 -07:00
Krrish Dholakia 8a75ab8388 docs(index.md): cleanup doc 2025-08-16 17:16:04 -07:00
Ishaan Jaff 5889311b10 [Docs] v1.75.8-stable (#13676)
* v1-75-8

* fix responses API docs

* docs fix

* docs fix new models

* docs links

* docs fix links

* docs sambanova

* docs fix
2025-08-16 09:20:04 -07:00
Krish Dholakia f360e0ead2 Merge pull request #13590 from BerriAI/litellm_bedrock_api_header
[LLM translation] Refactor Anthropic Configurations and Add Support for `anthropic_beta` Headers
2025-08-14 11:32:18 -07:00
Ishaan Jaff 5bb96af818 [Feat] Add reasoning_effort param for hosted_vllm provider (#13620)
* add reasoning_effort to hosted_vllm

* test_hosted_vllm_supports_reasoning_effort

* Reasoning Effort
2025-08-14 10:10:30 -07:00
Ishaan Jaff 76d25926d4 [Feat] New provider - Azure AI Flux Image Generation (#13592)
* init files

* add AzureFoundryModelInfo

* fix api_version property

* add azure_ai img gen

* use AzureFoundryModelInfo

* get_base_image_generation_call_args

* add azure_ai/FLUX-1.1-pro

* add util for route_image_generation_cost_calculator

* docs azure ai flux

* fixes for flux

* fixes for AzureFoundryFluxImageGenerationConfig

* ruff fix
2025-08-13 17:20:30 -07:00
Ishaan Jaff fb325cbb5e fix Build from litellm pip package (#13603) 2025-08-13 17:19:43 -07:00
Jugal Bhatt 4201f0aa79 Enhance Bedrock Provider Configuration and Header Management
- Added `forward_client_headers_to_llm_api` setting in the Bedrock documentation to facilitate client-side header forwarding.
- Updated `completion` function to use merged headers instead of original `extra_headers`.
- Improved request handling in `BedrockConverseLLM` and `AmazonInvokeConfig` to ensure proper header management for `anthropic-beta` parameters.
- Refactored request transformation logic to return the transformed request for better clarity and functionality.
2025-08-13 13:41:34 -07:00
Jugal Bhatt 3990f61bed Refactor Anthropic Configurations and Add Support for anthropic_beta Headers
- Renamed `AmazonAnthropicClaude3Config` and `AmazonAnthropicClaude3MessagesConfig` to `AmazonAnthropicClaudeConfig` and `AmazonAnthropicClaudeMessagesConfig` respectively for consistency.
- Implemented `get_anthropic_beta_from_headers` function to extract and handle `anthropic-beta` headers across various transformations.
- Updated request transformations in `AmazonConverseConfig` and `AmazonInvokeConfig` to include `anthropic_beta` parameters based on user headers.
- Added tests to ensure proper handling of `anthropic_beta` headers in different scenarios.
2025-08-13 11:47:59 -07:00
Krish Dholakia f00e891004 LiteLLM SDK <-> Proxy: support user param + Prisma - remove use_prisma_migrate flag - redundant as this is now default (#13555)
* fix(litellm_proxy/chat/transformation.py): support 'user' and all other openai chat completion params

Fixes issue where 'user' was not being sent in request to litellm proxy via sdk

* fix(prisma_migration.py): remove 'use_prisma_migrate' flag, is now default

* docs: cleanup docs

* fix(proxy_cli.py): remove --use_prisma_migrate flag

* refactor: remove references to use_prisma_migrate env var

This is now the default flow for db migrations
2025-08-12 22:03:39 -07:00
Jorge Piedrahita Ortiz 6a77780c0e Feat/sambanova embeddings (#13308)
* add sambanova embeddings

* fmt

* minor fix

* add sambanova embeddings call

* fmt

* include embeddings in sambanova docs

* add sambanova embeddigns unit test

* remove unused import

* minor fix

* fmt

* update sambanova embeding to inherit from BaseEmbeddingConfig

* fmt

* fmt

* hot fix

* fmt
2025-08-12 17:15:26 -07:00
Ishaan Jaff 828e3e3deb [Feat] Add Streaming support + Docs for bedrock gpt-oss model family (#13346)
* add openai.gpt-oss-20b-1:0

* update BEDROCK_CONVERSE_MODELS

* openai.gpt-oss-20b-1:0 fixes

* fix PDF input

* fix for should_fake_stream

* TestBedrockGPTOSS

* should_fake_stream

* update supports vision field for openai.gpt-oss models

* fixes for bedrock gpt oss

* fixes for should_fake_stream

* docs bedrock gpt oss models
2025-08-12 08:39:36 -07:00
Ishaan Jaff 49ca4dc792 Sending tags to LiteLLM Proxy (#13517) 2025-08-11 17:58:50 -07:00
Ishaan Jaff 9f78287000 [Bug Fix]: Azure OpenAI GPT-5 max_tokens + reasoning param support (#13510)
* add AzureOpenAIGPT5Config

* add AzureOpenAIGPT5Config

* add AzureOpenAIGPT5Config

* add AzureOpenAIGPT5Config

* test_azure_gpt5_supports_reasoning_effort

* test_azure_gpt5_reasoning

* test_azure_gpt5_reasoning

* ruff check fixes

* docs azure gpt5
2025-08-11 15:40:53 -07:00
Krrish Dholakia bd8a0ae0d0 docs: fix order 2025-08-10 09:42:55 -07:00
Krrish Dholakia 1dbac75675 docs(index.md): update release with deployment information 2025-08-10 09:31:28 -07:00
Krish Dholakia c742c76288 Litellm release notes 08 10 2025 (#13479)
* docs(index.md): initial doc

* build(index.md): initial notes

* docs(index.md): add llm translation tickets

* docs(index.md): document new model support

* docs(index.md): document all pricing changes

* docs(index.md): add llm api endpoints

* docs(index.md): add doc on mcp gateway

* docs(index.md): add all remaining rc notes

* docs(index.md): cleanup
2025-08-10 07:32:11 -07:00
Krish Dholakia 9f6f96d76c Litellm dev 08 07 2025 p1 (#13418)
* fix(router.py): support base model for model group usage

allows model group info to show accurate cost information for azure models

* fix(router.py): fix changes

* test: add unit tests

* build(pyproject.toml): bump openai version requirements

support custom tool from responses api

 Closes https://github.com/BerriAI/litellm/issues/13391

* docs(responses_api.md): add verbosity + free-form function calling parameters

* docs(responses_api.md): add cfg + minimal reasoning to docs

Closes https://github.com/BerriAI/litellm/issues/13391

* docs(responses_api.md): add proxy examples to docs

* refactor: fix ruff error
2025-08-09 16:30:04 -07:00
Sannan Nasir 0e53b1feab Add digitalocean provider (#12169)
* Add digitalocean provider

* Add digitalocean provider

* Revert "Add digitalocean provider"

This reverts commit 96dda40f45b3d12ea03e861d060ec81460b7759e.

* changes

* fixes

* Update transformation

* refactoring

* rename provider to Gradient AI

* fixes

* Incorporte review comments

* revert changes

* fix typo

* revert change

* incorporated review comments

* Revert "Incorporte review comments"

This reverts commit 37bd51bd54ef4fd52ccc12866e47f8de9476d597.

* changes

* Revert "Revert "Incorporte review comments"

This reverts commit 37bd51bd54ef4fd52ccc12866e47f8de9476d597."

This reverts commit 68c8a198ee0d6441c3a52f6c6a49c9c95a4cb0a8.

* changes

* fixes

* Update provider_specific_fields.tsx
2025-08-09 16:26:33 -07:00
Ishaan Jaff ee40db7b31 docs native litellm prompts 2025-08-09 09:46:31 -07:00
Ishaan Jaff 94c33200a4 docs - native prompt mgmt (#13463) 2025-08-09 09:39:16 -07:00
Ishaan Jaff 3999e65a97 docs update 2025-08-09 09:24:41 -07:00
Jugal D. Bhatt 035e5497e0 added mcp guardrails doc in mcp.md (#13452) 2025-08-08 22:47:31 -07:00
Ishaan Jaff 7e2a00c848 [Docs] Add docs on how router / cooldowns work (#13444)
* add theme-mermaid

* docs cool down

* docs cooldown
2025-08-08 15:13:37 -07:00
Jugal D. Bhatt 3410b293d7 Add presidio docs (#13392) 2025-08-07 16:40:07 -07:00
Ishaan Jaff 2037037258 [Bug Fix] OpenAI gpt-5 series does not support "max_tokens" parameter and temperature values that are not = 1 (#13390)
* add OpenAIGPT5Config

* add map_openai_params for gpt5

* add OpenAIGPT5Config

* add OpenAI gpt 5 transform

* docs gpt 5 openai
2025-08-07 16:35:00 -07:00
breno-aumo e8b4b25774 Update OCI docs (#13336)
* add oci models to model_prices_and_context_window.json

* remove unsupported and unavailable oci models from docs
2025-08-07 09:45:17 -07:00
Edward D'Amato 30fc5b871c feat(integrations): allow setting of braintrust callback base url (#13368)
* feat(integrations): allow setting of braintrust callback base url

* chore(misc): remove extra additions due to merge
2025-08-07 08:40:11 -07:00
Ishaan Jaff 50c4a32c7d docs claude-opus-4-1-20250805 2025-08-05 18:02:22 -07:00
Ishaan Jaff dab8ba03e3 [Feat] - When using custom tags on prometheus allow using wildcard patterns (#13316)
* _tag_matches_wildcard_configured_pattern

* test_get_custom_labels_from_tags_wildcard_patterns

* docs Custom Tags

* docs how custom tags work

* fix
2025-08-05 17:46:13 -07:00
Jugal D. Bhatt 7524aa386c [Redis IAM] Change documentation (#13306)
* change docs

* addde note
2025-08-05 15:04:46 -07:00
Jugal D. Bhatt 29a8c583c2 added redis iam auth (#13275) 2025-08-05 10:56:34 -07:00
Pascal Bro a17d483c89 Add GCS bucket caching support (#13122) 2025-08-04 16:09:33 -07:00
breno-aumo 056b60a9fa Support OCI provider (#13206)
* create OCI required files

* request and response conversion for non-streaming chat

* support tool calling with OCI generic API without streaming

* adaptation of api call for generic and cohere format

* include tool calls and responses in generic api and dropping support for cohere

* fix invalid content-length error

* support streaming for generic api

* fix auth error when using acompletion with streaming

* refactor: use base_llm_http_handler and include API type definitions

* update types and add type safety in different methods

* fix OCIFunction format

* create custom stream wrapper for decoding OCI stream

* remove unused files

* create unit tests for OCI

* lint the code

* remove manual test

* docs: update the docs to include OCI
2025-08-04 15:59:25 -07:00
Krrish Dholakia 747467c819 docs(user_keys.md): add litellm python sdk tab 2025-08-04 15:14:13 -07:00
Krrish Dholakia bfecab2cc5 docs(index.md): cleanup tag 2025-08-04 14:18:04 -07:00
Krrish Dholakia 98102e2804 docs(index.md): cleanup 2025-08-02 23:06:46 -07:00
Ishaan Jaff 4c217c66f5 docs User Agent Activity Tracking 2025-08-02 17:26:40 -07:00
Ishaan Jaff 2ee4e84406 docs fix 2025-08-02 16:47:44 -07:00
Ishaan Jaff 06856b4d37 docs fix 2025-08-02 15:47:09 -07:00
Ishaan Jaff 0f9f5f7a6c docs fix 2025-08-02 15:44:59 -07:00