Commit Graph

27757 Commits

Author SHA1 Message Date
yuneng-jiang 6c79240cd6 Merge pull request #17108 from BerriAI/litellm_user_table_sort_ui
[Feature] UI - User Table Sort by All
2025-11-25 21:35:27 -08:00
yuneng-jiang e9ab20631f Merge pull request #17098 from BerriAI/litellm_broken_links_ui
[Feature] UI - Disable edit, delete, info, for dynamically generated spend tags
2025-11-25 21:34:39 -08:00
ripepersimmon be970735de feat: Add gemini-3-pro-image-preview model support for imageSize parameters (#17019)
- Add model identifier to FLASH_IMAGE_PREVIEW_MODEL_IDENTIFIERS
- Add imageSize parameter support (1K, 2K, 4K) with GeminiImageSize type
- Add tests for imageSize parameter transformation
- Update documentation with new model
2025-11-25 19:38:29 -08:00
Carlo Alberto Ferraris a727f71b19 Optimize date filtering for spend logs queries (#17073)
This should allow postgres to perform a more efficient index scan instead of a sequential table scan.

These two queries consistently show up in the longest-running ones in our instance, and are a major latency source for the usage page on the admin UI.
2025-11-25 19:36:25 -08:00
Igal Boxerman e6e1e8fca4 feat(pillar): add automatic LiteLLM context headers (#17076)
- Automatically pass LiteLLM virtual key context as X-LiteLLM-* headers
- Includes key_alias, user_id, team_id, org_id, and user_email
- No configuration required - always enabled for application/user tracking
- Excludes sensitive data (metadata, API tokens) for security
- Add comprehensive tests (30 tests, all passing)
- Update documentation with header details
2025-11-25 19:35:39 -08:00
Alexsander Hamir 7c09187daf downgrade grpcio (#17090) 2025-11-25 19:33:24 -08:00
yuneng-jiang 7227747a6f Improve Wording for Config Models in Model Table (#17100) 2025-11-25 19:25:23 -08:00
Carlo Alberto Ferraris b50fcc4b56 vertex ai: use the correct domain for the global location when counting tokens (#17116) 2025-11-25 19:22:20 -08:00
Sameer Kankute cd65a84abd Merge pull request #16844 from Chesars/fix/response-format-to-text-format-bridge-conversion
fix: Support response_format parameter in completion -> responses bridge
2025-11-26 08:51:09 +05:30
Ishaan Jaff 5c192a23c3 [Feat] Add new RAG API on LiteLLM AI Gateway (#17109)
* init RAG api types

* add RAG endpoints

* init main.py for RAG ingest API

* init RecursiveCharacterTextSplitter

* add BaseRAGIngestion

* fix OpenAIRAGIngestion

* fix img handler

* init OpenAIRAGIngestion

* init BedrockRAGIngestion

* init BedrockRAGIngestion

* init rag tests

* init BedrockVectorStoreOptions

* implement BedrockRAGIngestion

* add BaseRAGAPI

* add endpoint for RAG ingest

* add ingest RAG endpoints

* add test doc

* add parse_rag_ingest_request

* update endpoints

* docs add docs for new RAG API

* fix qa check

* fix linting

* docs ficx

* docs

* add max depth checks

* docs anthropic
2025-11-25 17:54:29 -08:00
Otavio Brito 6e5c7c0008 fix transcription exception handling - /audio/transcriptions (#16791)
* fix transcription exception handling

* reraise the exception
2025-11-25 16:41:35 -08:00
Krrish Dholakia 5cb5c2a7b7 docs: more doc cleanup 2025-11-25 16:04:27 -08:00
Krrish Dholakia 8ee6812edf docs: cleanup launch post 2025-11-25 15:58:51 -08:00
yuneng-jiang 3da9974a87 Tests 2025-11-25 15:54:55 -08:00
Krrish Dholakia 70a1325847 docs: more doc cleanup 2025-11-25 15:01:22 -08:00
Kerem Turgutlu 8637d74e17 include server_tool_use in streaming usage (#16826)
* include server_tool_use in streaming usage

* add test
2025-11-25 14:50:17 -08:00
Sam Chou c0288d81aa Fix bedrock claude opus 4.5 inference profile - only global currently (#17101) 2025-11-25 14:49:12 -08:00
yuneng-jiang db587926a4 Sorting changes, pending tests and loading state 2025-11-25 14:46:46 -08:00
Krrish Dholakia f3d5775920 fix: fix doc load issue 2025-11-25 14:40:26 -08:00
YutaSaito 52f1bf1a80 fix: missing await (#17103) 2025-11-25 14:33:38 -08:00
Ishaan Jaff be712908a3 [Feat] Add OpenAI compatible bedrock imported models. - qwen etc (#17097)
* test_bedrock_openai_imported_model

* AmazonBedrockOpenAIConfig

* add openai route for bedrock

* docs fix

* fix code qa check
2025-11-25 12:20:39 -08:00
yuneng-jiang 44cde2e48f Disable edit, delete, info, for dynamically generated spend tags 2025-11-25 12:03:01 -08:00
Krrish Dholakia db2c8e3631 docs: initial doc cleanup 2025-11-25 11:57:51 -08:00
Sameer Kankute 67622fb040 Add day 0 support for anthropic new feat (#17091)
* Added tool search support for anthropic

* Add programtic tool calling support

* Add tool use input examples support

* Add anthropic effort param support

* Add anthropic effort param support

* Add blog for new features

* fix mypy and lint errors

* fix mypy and lint errors

* fix mypy and lint errors

* fix mypy and lint errors

* Add better handling

* Add better handling
2025-11-25 11:28:47 -08:00
Sameer Kankute 3249f6dd2d Merge pull request #17070 from BerriAI/litellm_add_vertex_ai_image_support
Add vertex ai image gen support for both gemini and imagen models
2025-11-26 00:04:03 +05:30
Sameer Kankute 83a9dcd2d2 Merge pull request #16886 from BerriAI/litellm_anthopic_azure_support
Added support for azure anthopic models via chat completion
2025-11-26 00:03:52 +05:30
Sameer Kankute 59b4b9a07c fix documentation of anthropic azure 2025-11-26 00:02:48 +05:30
Sameer Kankute 59bcf079fb Merge pull request #17078 from BerriAI/litellm_add_search_logging
Add search API logging and cost tracking in LiteLLM Proxy
2025-11-25 23:59:41 +05:30
Sameer Kankute 2e50db81a5 Merge pull request #17071 from BerriAI/litellm_azure_gpt_5_reasoning
Fix `reasoning_effort="none"` not working on Azure for GPT-5.1
2025-11-25 23:59:25 +05:30
Krish Dholakia 00e17c81a1 Add enforce user param functionality (#17088)
* feat: Add reject_metadata_tags to proxy config

Co-authored-by: krrishdholakia <krrishdholakia@gmail.com>

* Refactor: Rename reject_metadata_tags to reject_clientside_metadata_tags

Co-authored-by: krrishdholakia <krrishdholakia@gmail.com>

---------

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-11-25 09:36:24 -08:00
Sameer Kankute 1c612288bc fix lint errors 2025-11-25 20:20:09 +05:30
Sameer Kankute 255d1bc239 fix lint errors 2025-11-25 20:20:09 +05:30
Sameer Kankute e0396e5fa7 Merge pull request #17082 from BerriAI/main
merge main
2025-11-25 18:49:52 +05:30
Sameer Kankute e2f2ccd913 Add tests related messages api 2025-11-25 18:45:51 +05:30
Sameer Kankute dd4c8ecbef Add v1/messages support for azure anthropic models 2025-11-25 18:36:39 +05:30
Sameer Kankute afe540e88d Fix auth issue 2025-11-25 18:26:25 +05:30
Sameer Kankute 67d69d12b0 Add cost tracking and logging support 2025-11-25 17:14:59 +05:30
Sameer Kankute c149ade6a8 Add tests related to reasoning param none 2025-11-25 13:57:15 +05:30
Sameer Kankute a50083a87b Remove none support from reasoning param 2025-11-25 13:56:30 +05:30
Sameer Kankute b0d511143c remove unsused imports 2025-11-25 13:36:20 +05:30
Sameer Kankute 883cfaeeaf Add tests 2025-11-25 13:32:13 +05:30
Sameer Kankute f52f05748d Update docs related to vertex ai image gen 2025-11-25 13:31:50 +05:30
Sameer Kankute 29ab291cf5 Add vertex ai image support 2025-11-25 13:31:16 +05:30
wcyat 6dcb5425a5 fix(vertex): fix CreateCachedContentRequest enum error (#16965)
* feat: add _fix_enum_types function to remove enums from non-string fields in schema

* test: add test for _fix_enum_types function to validate enum removal from non-string fields
2025-11-24 21:24:29 -08:00
yuneng-jiang babee43dde Merge pull request #17068 from BerriAI/litellm_additional_delete_resource_modal
[Feature] Change Delete Modals to Common Component
2025-11-24 20:58:46 -08:00
Dmitrii Komarov 046b7efbbe Make Bedrock image generation more consistent (#17021) 2025-11-24 20:58:01 -08:00
Saar wintrov cfd35d3b14 Metadata: fix 401 when audio/transcriptions (#17023)
* Metadata: fix 401 when audio/transcriptions

* check if str, CR fixes
2025-11-24 20:56:27 -08:00
Cesar Garcia 650b18974f fix(gemini): skip thinking config for image models (#17027)
* fix(gemini): exclude image models from automatic thinking_level parameter (#17013)

- gemini-3-pro-image-preview does not support thinking_level parameter
- Added check to skip adding thinkingConfig for models containing "image"
- Fixes BadRequestError: "Thinking level is not supported for this model"
- Only affects automatic default behavior, user can still pass reasoning_effort explicitly

Fixes #17013

* test: add tests for gemini-3 image models thinking_level exclusion

* update docs
2025-11-24 20:54:12 -08:00
yuneng-jiang 3aba6d96fd [Fix] UI - Add No Default Models for Team and User Settings (#17037)
* Add No Default Models to Team and User settings

* Removing unused imports

* Adding to Create User and Team flow
2025-11-24 20:53:17 -08:00
Saar wintrov 777ef628d2 Enhancement(helm): ServiceMonitor template rendering (#17038)
* Metadata: fix 401 when audio/transcriptions

* check if str, CR fixes

* Added new helmchart functionality

* .

* .

* adding new tests
2025-11-24 20:53:02 -08:00