litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-07-04 07:06:26 +00:00

Author	SHA1	Message	Date
Krish Dholakia	dfdba527e2	Merge pull request #14678 from mritunjaysharma394/patch-1 chore(docker): remove redundant Admin UI build step	2025-09-19 23:28:06 -07:00
Krish Dholakia	e24c8b8769	Merge pull request #14742 from FelipeRodriguesGare/litellm_doc_gemini_batch docs: add Vertex batch provider documentation and sidebar entry	2025-09-19 23:21:29 -07:00
Krish Dholakia	d6553045a3	Merge pull request #14530 from BerriAI/litellm_dev_09_10_2025_p1 fix(prometheus.py): make prometheus work for multiple workers	2025-09-19 22:26:56 -07:00
Krish Dholakia	270d612029	Merge branch 'main' into litellm_dev_09_10_2025_p1	2025-09-19 22:01:57 -07:00
Felipe Gare	d59e813b58	docs: add Vertex batch provider documentation and sidebar entry	2025-09-20 00:20:51 -03:00
Krish Dholakia	08f708a7d7	Merge pull request #14740 from BerriAI/litellm_dev_09_19_2025_p2 UI - Add team-level sampling setting for tracing Langsmith	2025-09-19 19:01:15 -07:00
Krrish Dholakia	73875310d5	test(ui/): add unit testing	2025-09-19 18:48:03 -07:00
Krrish Dholakia	2a827c515a	fix(ui/): add langsmith sampling rate Closes LIT-879	2025-09-19 18:43:48 -07:00
Ishaan Jaffer	a6795a6560	test fix	2025-09-19 18:05:36 -07:00
Ishaan Jaffer	eabc9cd415	test test_e2e_bedrock_embedding	2025-09-19 18:03:20 -07:00
Ishaan Jaffer	ed5c9f1c69	test fixes for mapped tests	2025-09-19 18:01:36 -07:00
Krish Dholakia	6142c3ac3d	Merge pull request #14738 from BerriAI/litellm_dev_09_19_2025_p1 UI SSO - consider token info endpoint on generic SSO route for access control groups	2025-09-19 17:58:37 -07:00
Ishaan Jaffer	dec15a80a2	test: mcp test fix	2025-09-19 17:52:32 -07:00
Ishaan Jaffer	15898c89e1	test: test_azure_openai_gpt_5_responses_api	2025-09-19 17:45:34 -07:00
Krrish Dholakia	ee5058b5e2	test: add unit testing	2025-09-19 17:41:27 -07:00
Ishaan Jaffer	c3f150b13d	mcp test fix	2025-09-19 17:41:15 -07:00
Ishaan Jaffer	0ba4c7753a	test fix	2025-09-19 17:33:37 -07:00
Krrish Dholakia	21d6c4943e	test: add unit test	2025-09-19 17:33:00 -07:00
Ishaan Jaffer	1e20e56715	test fix	2025-09-19 17:32:56 -07:00
Krrish Dholakia	1931335bc3	fix(uiaccesscontrolform.tsx): set ui access mode to none when all_authenticated_users is set	2025-09-19 17:28:45 -07:00
Ishaan Jaffer	98f33843d9	fix: bump litellm-proxy-extras pip	2025-09-19 17:27:29 -07:00
Krrish Dholakia	7694e9f6a9	test: refactoring + testing	2025-09-19 17:20:15 -07:00
Ishaan Jaffer	5992b512fe	Revert "feat: add X_LITELLM_DISABLE_PROMPTS_IN_SPEND_LOGS" This reverts commit `c26b1f1a1c`.	2025-09-19 17:06:21 -07:00
Ishaan Jaffer	c26b1f1a1c	feat: add X_LITELLM_DISABLE_PROMPTS_IN_SPEND_LOGS	2025-09-19 17:05:50 -07:00
Krrish Dholakia	75b98a7909	fix(ui_sso.py): initial commit, adding checking token endpoint response for allowed team ids Allows ui access control to work for the given team ids	2025-09-19 17:02:49 -07:00
Ishaan Jaffer	15ecc363d8	fix: test	2025-09-19 16:35:00 -07:00
Ishaan Jaff	c918e18852	Enrich rate limit error message with specific limit type and reset time (#14736 ) - Add specific rate limit type (requests/tokens/max_parallel_requests) to error message - Include current limit value for better context - Display reset time in human-readable format - Handle negative remaining values gracefully by showing 0 instead - Add reset_at header with timestamp for programmatic use Fixes issue where rate limit errors were ambiguous about which type of limit was exceeded and when the limit would reset. Co-authored-by: Cursor Agent <cursoragent@cursor.com>	2025-09-19 16:32:13 -07:00
Ishaan Jaffer	d739d226ed	fix: test	2025-09-19 16:28:09 -07:00
Ishaan Jaff	90ee9e4587	[Feat] Dynamic Rate Limiter v3 - fixes to ensure priority routing works as expected (#14734 ) * fix: dynamic limiter v3 * fix: dynamic limiter v3 * feat: add dynamic limiter v3 * feat: add dynamic limiter v3 * feat: add dynamic limiter v3 in init litellm_logging * feat: add dynamic limiter v3 in init litellm_logging * fix: priority rate limiting * Potential fix for code scanning alert no. 3397: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix: priority rate limiting * fix: ruff * fix: mypy lint --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> v1.77.3.dynamic_rates	2025-09-19 16:04:45 -07:00
Franklin	b39fd688a8	doc: navigate to the correct location (#14722 )	2025-09-19 15:25:18 -07:00
Felipe Garé	a696ffe4a6	Litellm gemini batch (#14733 ) * feat: add Vertex AI support for file content retrieval - Extended `custom_llm_provider` to include "vertex_ai" in `afile_content` function. - Implemented file content retrieval logic for Vertex AI in `VertexAIFilesHandler`. - Added helper method to extract bucket and object from URL-encoded file_id. - Created comprehensive unit and integration tests for Vertex AI file handling. - Updated transformation logic to ensure compatibility with Vertex AI file responses. * fix: update Vertex AI file transformation logic - Modified the transformation logic in `VertexAIFilesConfig` to return a newline-separated JSON string for batch JSONL files instead of a array if JSON strings. * fix: enhance Vertex AI output handling in transformation logic - Updated the transformation logic in `VertexAIBatchTransformation` to utilize the new `OutputInfo` TypedDict for retrieving the GCS output directory. - Added `OutputInfo` class to type definitions for better structure and clarity in Vertex AI responses.	2025-09-19 15:22:52 -07:00
dependabot[bot]	060fa4d82f	build(deps): bump esbuild and vite in /ui/litellm-dashboard (#14703 ) Bumps [esbuild](https://github.com/evanw/esbuild) to 0.25.10 and updates ancestor dependency [vite](https://github.com/vitejs/vite/tree/HEAD/packages/vite). These dependencies need to be updated together. Updates `esbuild` from 0.21.5 to 0.25.10 - [Release notes](https://github.com/evanw/esbuild/releases) - [Changelog](https://github.com/evanw/esbuild/blob/main/CHANGELOG-2024.md) - [Commits](https://github.com/evanw/esbuild/compare/v0.21.5...v0.25.10) Updates `vite` from 5.4.20 to 7.1.6 - [Release notes](https://github.com/vitejs/vite/releases) - [Changelog](https://github.com/vitejs/vite/blob/main/packages/vite/CHANGELOG.md) - [Commits](https://github.com/vitejs/vite/commits/v7.1.6/packages/vite) --- updated-dependencies: - dependency-name: esbuild dependency-version: 0.25.10 dependency-type: indirect - dependency-name: vite dependency-version: 7.1.6 dependency-type: direct:development ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-09-19 15:21:30 -07:00
Max Falk	12da4039b9	fix: Prevent AttributeError for _get_tags_from_request_kwargs (#14735 ) * fix: avoid NoneType AttributeError when extracting tags I've been running into this error: ``` 21:47:08 - LiteLLM:ERROR: litellm_logging.py:2396 - LiteLLM.LoggingError: [Non-Blocking] Exception occurred while success logging Traceback (most recent call last): File "/usr/lib/python3.13/site-packages/litellm/litellm_core_utils/litellm_logging.py", line 2312, in async_success_handler await callback.async_log_success_event( ...<6 lines>... ) File "/usr/lib/python3.13/site-packages/litellm/router_strategy/budget_limiter.py", line 396, in async_log_success_event request_tags = _get_tags_from_request_kwargs(kwargs) File "/usr/lib/python3.13/site-packages/litellm/router_strategy/tag_based_routing.py", line 144, in _get_tags_from_request_kwargs return _metadata.get("tags", []) ^^^^^^^^^^^^^ AttributeError: 'NoneType' object has no attribute 'get' ``` This makes the function more resilient without resorting to try catch. * add tests Signed-off-by: Max Falk <gmdfalk@gmail.com> --------- Signed-off-by: Max Falk <gmdfalk@gmail.com>	2025-09-19 15:21:02 -07:00
tosi	dd89bae2ff	fix: add download prometheus.yml to avoid error (#14725 )	2025-09-19 14:20:01 -07:00
Ishaan Jaffer	afbb2fc20c	docs: v1.77.2.rc.2	2025-09-19 08:28:27 -07:00
Krish Dholakia	f471cf002d	Merge pull request #14715 from timelfrink/fix/issue-14120-gemini-2.5-flash-image-preview Fix: gemini-2.5-flash-image-preview model routing for image generation	2025-09-19 07:43:03 -07:00
Tim Elfrink	5323ca8346	Fix Gemini 2.5 Flash Image Preview response parsing - Add response_modalities configuration to request format - Fix response parsing to use camelCase 'inlineData' instead of snake_case 'inline_data' - Update test to validate proper request format and response parsing - All existing Gemini image generation tests pass	2025-09-19 11:20:03 +02:00
Tim Elfrink	3a98fd6096	Remove hardcoded model name and fix breaking change - Reverted GEMINI_2_5_FLASH_IMAGE_PREVIEW_MODEL constant usage - Made endpoint selection conditional for gemini-2.5-flash-image-preview only - Preserved existing Imagen models functionality with :predict endpoint - Fixed potential breaking change that would affect 6 other Gemini image models	2025-09-19 09:29:14 +02:00
Krish Dholakia	aa54994994	Merge pull request #14666 from michaeltansg/feat/add-bangkok-timezone Added Indochina Time timezone support for budget resets	2025-09-18 23:41:27 -07:00
Krish Dholakia	f5838594e0	Merge pull request #14675 from tcx4c70/fix/response-price fix: cost calculation for responses	2025-09-18 23:41:08 -07:00
Krish Dholakia	ad6ba8f5c5	Merge pull request #14695 from uc4w6c/fix/mcp-gateway-tools-list Fix/mcp gateway tools list	2025-09-18 23:40:14 -07:00
Krish Dholakia	d5a839d971	Merge pull request #14700 from BerriAI/litellm_contributor_prs_09_18_2025_p2 Update Bedrock documentation for Titan V2 encoding_format support + Anthropic - account for 1h vs. 5m cache creation token cost difference + UI - add langsmith_sampling_rate as a dynamic param	2025-09-18 23:38:29 -07:00
Krrish Dholakia	92e841e311	fix: fix test	2025-09-18 23:37:38 -07:00
Krish Dholakia	b6d5a65fed	Merge pull request #14708 from mrFranklin/fix/opik-timezone fix: timezone issue of opik	2025-09-18 23:35:27 -07:00
Tim Elfrink	f5e6246143	Add test for gemini-2.5-flash-image-preview fix - Test validates correct endpoint routing to :generateContent - Mock HTTP responses to avoid API limits - Verify request format uses Gemini contents structure - Ensure image generation functionality works correctly	2025-09-19 08:06:01 +02:00
Tim Elfrink	ea30f752bd	Fix Gemini image generation endpoint and request format - Update endpoint from :predict to :generateContent - Add Gemini format support for 2.5-flash-image-preview model - Maintain backward compatibility with existing Imagen models - Handle response parsing for candidates format	2025-09-19 08:04:54 +02:00
Tim Elfrink	282617f2bc	Fix gemini-2.5-flash-image-preview model routing - Update mode from 'chat' to 'image_generation' for both model variants - Ensures correct routing to image generation endpoints - Resolves 400 'request not supported' error for image generation	2025-09-19 07:58:58 +02:00
Ishaan Jaff	80bd8e007f	fix contributor PR linting failing (#14710 ) * validate fix * fix linting error	2025-09-18 20:03:27 -07:00
Krish Dholakia	9a6b1651d2	Merge pull request #14707 from ARajan1084/bedrock-guardrail-silent-failure-correction fix: Bedrock guardrail silent failure correction	2025-09-18 20:03:17 -07:00
Krish Dholakia	9c7e1934cc	Merge pull request #14652 from BerriAI/litellm_dev_09_17_2025_p2_v2 Anthropic - account for 1h vs. 5m cache creation token cost difference + UI - add langsmith_sampling_rate as a dynamic param	2025-09-18 19:51:02 -07:00

1 2 3 4 5 ...

25256 Commits