Commit Graph

25256 Commits

Author SHA1 Message Date
Krish Dholakia dfdba527e2 Merge pull request #14678 from mritunjaysharma394/patch-1
chore(docker): remove redundant Admin UI build step
2025-09-19 23:28:06 -07:00
Krish Dholakia e24c8b8769 Merge pull request #14742 from FelipeRodriguesGare/litellm_doc_gemini_batch
docs: add Vertex batch provider documentation and sidebar entry
2025-09-19 23:21:29 -07:00
Krish Dholakia d6553045a3 Merge pull request #14530 from BerriAI/litellm_dev_09_10_2025_p1
fix(prometheus.py): make prometheus work for multiple workers
2025-09-19 22:26:56 -07:00
Krish Dholakia 270d612029 Merge branch 'main' into litellm_dev_09_10_2025_p1 2025-09-19 22:01:57 -07:00
Felipe Gare d59e813b58 docs: add Vertex batch provider documentation and sidebar entry 2025-09-20 00:20:51 -03:00
Krish Dholakia 08f708a7d7 Merge pull request #14740 from BerriAI/litellm_dev_09_19_2025_p2
UI - Add team-level sampling setting for tracing Langsmith
2025-09-19 19:01:15 -07:00
Krrish Dholakia 73875310d5 test(ui/): add unit testing 2025-09-19 18:48:03 -07:00
Krrish Dholakia 2a827c515a fix(ui/): add langsmith sampling rate
Closes LIT-879
2025-09-19 18:43:48 -07:00
Ishaan Jaffer a6795a6560 test fix 2025-09-19 18:05:36 -07:00
Ishaan Jaffer eabc9cd415 test test_e2e_bedrock_embedding 2025-09-19 18:03:20 -07:00
Ishaan Jaffer ed5c9f1c69 test fixes for mapped tests 2025-09-19 18:01:36 -07:00
Krish Dholakia 6142c3ac3d Merge pull request #14738 from BerriAI/litellm_dev_09_19_2025_p1
UI SSO - consider token info endpoint on generic SSO route for access control groups
2025-09-19 17:58:37 -07:00
Ishaan Jaffer dec15a80a2 test: mcp test fix 2025-09-19 17:52:32 -07:00
Ishaan Jaffer 15898c89e1 test: test_azure_openai_gpt_5_responses_api 2025-09-19 17:45:34 -07:00
Krrish Dholakia ee5058b5e2 test: add unit testing 2025-09-19 17:41:27 -07:00
Ishaan Jaffer c3f150b13d mcp test fix 2025-09-19 17:41:15 -07:00
Ishaan Jaffer 0ba4c7753a test fix 2025-09-19 17:33:37 -07:00
Krrish Dholakia 21d6c4943e test: add unit test 2025-09-19 17:33:00 -07:00
Ishaan Jaffer 1e20e56715 test fix 2025-09-19 17:32:56 -07:00
Krrish Dholakia 1931335bc3 fix(uiaccesscontrolform.tsx): set ui access mode to none when all_authenticated_users is set 2025-09-19 17:28:45 -07:00
Ishaan Jaffer 98f33843d9 fix: bump litellm-proxy-extras pip 2025-09-19 17:27:29 -07:00
Krrish Dholakia 7694e9f6a9 test: refactoring + testing 2025-09-19 17:20:15 -07:00
Ishaan Jaffer 5992b512fe Revert "feat: add X_LITELLM_DISABLE_PROMPTS_IN_SPEND_LOGS"
This reverts commit c26b1f1a1c.
2025-09-19 17:06:21 -07:00
Ishaan Jaffer c26b1f1a1c feat: add X_LITELLM_DISABLE_PROMPTS_IN_SPEND_LOGS 2025-09-19 17:05:50 -07:00
Krrish Dholakia 75b98a7909 fix(ui_sso.py): initial commit, adding checking token endpoint response for allowed team ids
Allows ui access control to work for the given team ids
2025-09-19 17:02:49 -07:00
Ishaan Jaffer 15ecc363d8 fix: test 2025-09-19 16:35:00 -07:00
Ishaan Jaff c918e18852 Enrich rate limit error message with specific limit type and reset time (#14736)
- Add specific rate limit type (requests/tokens/max_parallel_requests) to error message
- Include current limit value for better context
- Display reset time in human-readable format
- Handle negative remaining values gracefully by showing 0 instead
- Add reset_at header with timestamp for programmatic use

Fixes issue where rate limit errors were ambiguous about which type of limit
was exceeded and when the limit would reset.

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-09-19 16:32:13 -07:00
Ishaan Jaffer d739d226ed fix: test 2025-09-19 16:28:09 -07:00
Ishaan Jaff 90ee9e4587 [Feat] Dynamic Rate Limiter v3 - fixes to ensure priority routing works as expected (#14734)
* fix: dynamic limiter v3

* fix: dynamic limiter v3

* feat: add dynamic limiter v3

* feat: add dynamic limiter v3

* feat: add dynamic limiter v3 in init litellm_logging

* feat: add dynamic limiter v3 in init litellm_logging

* fix: priority rate limiting

* Potential fix for code scanning alert no. 3397: Clear-text logging of sensitive information

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

* fix: priority rate limiting

* fix: ruff

* fix: mypy lint

---------

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
v1.77.3.dynamic_rates
2025-09-19 16:04:45 -07:00
Franklin b39fd688a8 doc: navigate to the correct location (#14722) 2025-09-19 15:25:18 -07:00
Felipe Garé a696ffe4a6 Litellm gemini batch (#14733)
* feat: add Vertex AI support for file content retrieval

- Extended `custom_llm_provider` to include "vertex_ai" in `afile_content` function.
- Implemented file content retrieval logic for Vertex AI in `VertexAIFilesHandler`.
- Added helper method to extract bucket and object from URL-encoded file_id.
- Created comprehensive unit and integration tests for Vertex AI file handling.
- Updated transformation logic to ensure compatibility with Vertex AI file responses.

* fix: update Vertex AI file transformation logic

- Modified the transformation logic in `VertexAIFilesConfig` to return a newline-separated JSON string for batch JSONL files instead of a array if JSON strings.

* fix: enhance Vertex AI output handling in transformation logic

- Updated the transformation logic in `VertexAIBatchTransformation` to utilize the new `OutputInfo` TypedDict for retrieving the GCS output directory.
- Added `OutputInfo` class to type definitions for better structure and clarity in Vertex AI responses.
2025-09-19 15:22:52 -07:00
dependabot[bot] 060fa4d82f build(deps): bump esbuild and vite in /ui/litellm-dashboard (#14703)
Bumps [esbuild](https://github.com/evanw/esbuild) to 0.25.10 and updates ancestor dependency [vite](https://github.com/vitejs/vite/tree/HEAD/packages/vite). These dependencies need to be updated together.


Updates `esbuild` from 0.21.5 to 0.25.10
- [Release notes](https://github.com/evanw/esbuild/releases)
- [Changelog](https://github.com/evanw/esbuild/blob/main/CHANGELOG-2024.md)
- [Commits](https://github.com/evanw/esbuild/compare/v0.21.5...v0.25.10)

Updates `vite` from 5.4.20 to 7.1.6
- [Release notes](https://github.com/vitejs/vite/releases)
- [Changelog](https://github.com/vitejs/vite/blob/main/packages/vite/CHANGELOG.md)
- [Commits](https://github.com/vitejs/vite/commits/v7.1.6/packages/vite)

---
updated-dependencies:
- dependency-name: esbuild
  dependency-version: 0.25.10
  dependency-type: indirect
- dependency-name: vite
  dependency-version: 7.1.6
  dependency-type: direct:development
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-19 15:21:30 -07:00
Max Falk 12da4039b9 fix: Prevent AttributeError for _get_tags_from_request_kwargs (#14735)
* fix: avoid NoneType AttributeError when extracting tags

I've been running into this error:
```
21:47:08 - LiteLLM:ERROR: litellm_logging.py:2396 - LiteLLM.LoggingError: [Non-Blocking] Exception occurred while success logging Traceback (most recent call last):

  File "/usr/lib/python3.13/site-packages/litellm/litellm_core_utils/litellm_logging.py", line 2312, in async_success_handler

    await callback.async_log_success_event(

    ...<6 lines>...

    )

  File "/usr/lib/python3.13/site-packages/litellm/router_strategy/budget_limiter.py", line 396, in async_log_success_event

    request_tags = _get_tags_from_request_kwargs(kwargs)

  File "/usr/lib/python3.13/site-packages/litellm/router_strategy/tag_based_routing.py", line 144, in _get_tags_from_request_kwargs

    return _metadata.get("tags", [])

           ^^^^^^^^^^^^^

AttributeError: 'NoneType' object has no attribute 'get' 
```

This makes the function more resilient without resorting to try catch.

* add tests

Signed-off-by: Max Falk <gmdfalk@gmail.com>

---------

Signed-off-by: Max Falk <gmdfalk@gmail.com>
2025-09-19 15:21:02 -07:00
tosi dd89bae2ff fix: add download prometheus.yml to avoid error (#14725) 2025-09-19 14:20:01 -07:00
Ishaan Jaffer afbb2fc20c docs: v1.77.2.rc.2 2025-09-19 08:28:27 -07:00
Krish Dholakia f471cf002d Merge pull request #14715 from timelfrink/fix/issue-14120-gemini-2.5-flash-image-preview
Fix: gemini-2.5-flash-image-preview model routing for image generation
2025-09-19 07:43:03 -07:00
Tim Elfrink 5323ca8346 Fix Gemini 2.5 Flash Image Preview response parsing
- Add response_modalities configuration to request format
- Fix response parsing to use camelCase 'inlineData' instead of snake_case 'inline_data'
- Update test to validate proper request format and response parsing
- All existing Gemini image generation tests pass
2025-09-19 11:20:03 +02:00
Tim Elfrink 3a98fd6096 Remove hardcoded model name and fix breaking change
- Reverted GEMINI_2_5_FLASH_IMAGE_PREVIEW_MODEL constant usage
- Made endpoint selection conditional for gemini-2.5-flash-image-preview only
- Preserved existing Imagen models functionality with :predict endpoint
- Fixed potential breaking change that would affect 6 other Gemini image models
2025-09-19 09:29:14 +02:00
Krish Dholakia aa54994994 Merge pull request #14666 from michaeltansg/feat/add-bangkok-timezone
Added Indochina Time timezone support for budget resets
2025-09-18 23:41:27 -07:00
Krish Dholakia f5838594e0 Merge pull request #14675 from tcx4c70/fix/response-price
fix: cost calculation for responses
2025-09-18 23:41:08 -07:00
Krish Dholakia ad6ba8f5c5 Merge pull request #14695 from uc4w6c/fix/mcp-gateway-tools-list
Fix/mcp gateway tools list
2025-09-18 23:40:14 -07:00
Krish Dholakia d5a839d971 Merge pull request #14700 from BerriAI/litellm_contributor_prs_09_18_2025_p2
Update Bedrock documentation for Titan V2 encoding_format support + Anthropic - account for 1h vs. 5m cache creation token cost difference + UI - add langsmith_sampling_rate as a dynamic param
2025-09-18 23:38:29 -07:00
Krrish Dholakia 92e841e311 fix: fix test 2025-09-18 23:37:38 -07:00
Krish Dholakia b6d5a65fed Merge pull request #14708 from mrFranklin/fix/opik-timezone
fix: timezone issue of opik
2025-09-18 23:35:27 -07:00
Tim Elfrink f5e6246143 Add test for gemini-2.5-flash-image-preview fix
- Test validates correct endpoint routing to :generateContent
- Mock HTTP responses to avoid API limits
- Verify request format uses Gemini contents structure
- Ensure image generation functionality works correctly
2025-09-19 08:06:01 +02:00
Tim Elfrink ea30f752bd Fix Gemini image generation endpoint and request format
- Update endpoint from :predict to :generateContent
- Add Gemini format support for 2.5-flash-image-preview model
- Maintain backward compatibility with existing Imagen models
- Handle response parsing for candidates format
2025-09-19 08:04:54 +02:00
Tim Elfrink 282617f2bc Fix gemini-2.5-flash-image-preview model routing
- Update mode from 'chat' to 'image_generation' for both model variants
- Ensures correct routing to image generation endpoints
- Resolves 400 'request not supported' error for image generation
2025-09-19 07:58:58 +02:00
Ishaan Jaff 80bd8e007f fix contributor PR linting failing (#14710)
* validate fix

* fix linting error
2025-09-18 20:03:27 -07:00
Krish Dholakia 9a6b1651d2 Merge pull request #14707 from ARajan1084/bedrock-guardrail-silent-failure-correction
fix: Bedrock guardrail silent failure correction
2025-09-18 20:03:17 -07:00
Krish Dholakia 9c7e1934cc Merge pull request #14652 from BerriAI/litellm_dev_09_17_2025_p2_v2
Anthropic - account for 1h vs. 5m cache creation token cost difference + UI - add langsmith_sampling_rate as a dynamic param
2025-09-18 19:51:02 -07:00