litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-29 11:13:09 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	49ca4dc792	Sending `tags` to LiteLLM Proxy (#13517 )	2025-08-11 17:58:50 -07:00
Krrish Dholakia	21e067e47f	fix: fix check	2025-08-11 17:56:19 -07:00
Krrish Dholakia	92ebf5b918	fix(router.py): fix print statement	2025-08-11 17:46:14 -07:00
Jugal D. Bhatt	5efe59e6e2	[Draft] [LLM Translation] Add model id check (#13507 ) * Add model id check * add ruff fix	2025-08-11 15:41:08 -07:00
Ishaan Jaff	9f78287000	[Bug Fix]: Azure OpenAI GPT-5 max_tokens + `reasoning` param support (#13510 ) * add AzureOpenAIGPT5Config * add AzureOpenAIGPT5Config * add AzureOpenAIGPT5Config * add AzureOpenAIGPT5Config * test_azure_gpt5_supports_reasoning_effort * test_azure_gpt5_reasoning * test_azure_gpt5_reasoning * ruff check fixes * docs azure gpt5	2025-08-11 15:40:53 -07:00
Ishaan Jaff	1cd827874f	[Bug Fix] - Allow using `reasoning_effort` for gpt-5 model family and `reasoning` for Responses API (#13475 ) * test_openai_gpt5_reasoning * test_openai_gpt5_reasoning_effort_parameter * add OpenAIGPT5ResponsesAPIConfig * test_openai_gpt5_reasoning_effort_parameter * fixes	2025-08-10 09:55:36 -07:00
Krrish Dholakia	bd8a0ae0d0	docs: fix order	2025-08-10 09:42:55 -07:00
Krrish Dholakia	1dbac75675	docs(index.md): update release with deployment information	2025-08-10 09:31:28 -07:00
Krish Dholakia	0aeb4f1653	fix(health_check_helpers.py): set max tokens for wildcard call to 10, fixes calling gpt-5-nano via wildcard on openai (#13482 ) gpt-5-nano raises errors for max_tokens=1 v1.75.5.rc.1	2025-08-10 09:23:36 -07:00
Krish Dholakia	184687157e	Litellm model cost map fixes (#13480 ) * build(model_prices_and_context_window.json): fix max token values * build(model_prices_and_context_window.json): fix max token values * build(model_prices_and_context_window.json): fix azure gpt-5-chat pricing	2025-08-10 07:38:35 -07:00
Krish Dholakia	c742c76288	Litellm release notes 08 10 2025 (#13479 ) * docs(index.md): initial doc * build(index.md): initial notes * docs(index.md): add llm translation tickets * docs(index.md): document new model support * docs(index.md): document all pricing changes * docs(index.md): add llm api endpoints * docs(index.md): add doc on mcp gateway * docs(index.md): add all remaining rc notes * docs(index.md): cleanup	2025-08-10 07:32:11 -07:00
Krrish Dholakia	ece2c9c65d	bump: version 1.75.4 → 1.75.5 v1.75.5-stable.rc-draft	2025-08-09 16:31:51 -07:00
Krrish Dholakia	0eedf7c447	build: update local model cost map	2025-08-09 16:31:41 -07:00
Krish Dholakia	9f6f96d76c	Litellm dev 08 07 2025 p1 (#13418 ) * fix(router.py): support base model for model group usage allows model group info to show accurate cost information for azure models * fix(router.py): fix changes * test: add unit tests * build(pyproject.toml): bump openai version requirements support custom tool from responses api Closes https://github.com/BerriAI/litellm/issues/13391 * docs(responses_api.md): add verbosity + free-form function calling parameters * docs(responses_api.md): add cfg + minimal reasoning to docs Closes https://github.com/BerriAI/litellm/issues/13391 * docs(responses_api.md): add proxy examples to docs * refactor: fix ruff error	2025-08-09 16:30:04 -07:00
Sannan Nasir	0e53b1feab	Add digitalocean provider (#12169 ) * Add digitalocean provider * Add digitalocean provider * Revert "Add digitalocean provider" This reverts commit 96dda40f45b3d12ea03e861d060ec81460b7759e. * changes * fixes * Update transformation * refactoring * rename provider to Gradient AI * fixes * Incorporte review comments * revert changes * fix typo * revert change * incorporated review comments * Revert "Incorporte review comments" This reverts commit 37bd51bd54ef4fd52ccc12866e47f8de9476d597. * changes * Revert "Revert "Incorporte review comments" This reverts commit 37bd51bd54ef4fd52ccc12866e47f8de9476d597." This reverts commit 68c8a198ee0d6441c3a52f6c6a49c9c95a4cb0a8. * changes * fixes * Update provider_specific_fields.tsx	2025-08-09 16:26:33 -07:00
Ishaan Jaff	f60a9cf908	[Bug]: Fix JWTs access not working with model groups (#13474 ) * fix can_team_access_model * test_find_team_with_model_access_model_group	2025-08-09 16:14:51 -07:00
Jugal D. Bhatt	95fbe59c46	Add local storage auth (#13473 )	2025-08-09 16:13:56 -07:00
Jugal D. Bhatt	67833590d6	[Proxy changes] Litellm add model price reload schedule for multi-pod (#13470 ) * added mcp guardrails doc in mcp.md * add button to reload models * Added button changes * added button for scheduling reload * add multi pod support to reloading the model price json * fix ruff	2025-08-09 16:12:13 -07:00
Krish Dholakia	1c8761111f	Router - reduce p99 latency w/ redis enabled by 50% + OTEL - track pre_call hook latency (#13362 ) * feat(proxy/utils.py): track pre-call hooks in OTEL some pre call hooks can cause latency in high traffic - make sure this is tracked * fix(router.py): move redis call on deployment_callback_on_success to pipeline operation reduces p99 latency by half when redis is enabled * fix(parallel_request_limiter_v3.py): only run check if any item has rate limits set Prevents unnecessary latency added by rate limit checks * test: add unit tests * Latency Improvements: only track tpm/rpm usage when set on deployment+ LLM Caching - use an in-memory cache to reduce redis calls + OTEL - track time spent on LLM caching (#13472) * fix(router.py): only track usage for deployments with tpm/rpm set ensures additional latency avoided for non-tpm/rpm models * fix(caching_handler.py): log time spent on request get cache to OTEL enables easy debugging of call latency * fix(caching_handler.py): use dual cache object for in-memory caching + trace redis call within caching handler * fix(caching_handler.py): working in-memory cache for redis calls ensures dual cache works when redis cache setup for llm calls makes calls quicker by only checking redis when in-memory cache missed for llm api call * test: remove redundant test * test: add unit tests	2025-08-09 16:09:51 -07:00
Ishaan Jaff	60306d34a0	[Bug Fix] Allow using Swagger for /chat/completions (#13469 ) * fix get_openapi_schema * fixes for ProxyChatCompletionRequest * TestSwaggerChatCompletions * fix working request body * fix - add "messages" * fix messages * TestSwaggerChatCompletions * test_messages_field_has_example * ruff check fix	2025-08-09 15:35:45 -07:00
Jugal D. Bhatt	1270df08a4	[Proxy + UI] Litellm add reload model api and button (#13464 ) * added mcp guardrails doc in mcp.md * add button to reload models * Added button changes * remove the model_reload	2025-08-09 13:52:56 -07:00
Jugal D. Bhatt	10a1fe21c5	[LLM Translation] Litellm azure o series drop params (#13353 ) * added route check * fix ruff * Added support for dropping o_series params * Added ruff fix * fix tests	2025-08-09 13:52:45 -07:00
Ishaan Jaff	6184e898b7	Generate unique IDs for litellm_call_id and function_id using UUID (#13468 ) Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: ishaan <ishaan@berri.ai>	2025-08-09 12:59:09 -07:00
Ishaan Jaff	eb4bd26f24	[Bug Fix] - Get Routes (#13466 ) * fixes get_routes_for_mounted_app * fix - use _safe_get_endpoint_name * fix code QA check * test_get_routes_for_mounted_app_with_static_files * test fixes	2025-08-09 12:52:23 -07:00
Ishaan Jaff	825ea65b96	[Bug Fix] Responses API - Responses API failed if input containing ResponseReasoningItem (#13465 ) * add test_responses_api_multi_turn_with_reasoning_and_structured_output * fix transform_responses_api_request	2025-08-09 11:20:34 -07:00
Ishaan Jaff	ee40db7b31	docs native litellm prompts	2025-08-09 09:46:31 -07:00
Ishaan Jaff	94c33200a4	docs - native prompt mgmt (#13463 )	2025-08-09 09:39:16 -07:00
Ishaan Jaff	3999e65a97	docs update	2025-08-09 09:24:41 -07:00
Cole McIntosh	d874bec480	feat(models): add OpenRouter and Cerebras GPT-OSS models (20b, 120b) with pricing and context windows; update backup; refs #13428 (#13442 )	2025-08-08 22:47:51 -07:00
Jugal D. Bhatt	035e5497e0	added mcp guardrails doc in mcp.md (#13452 ) v1.75.4-nightly	2025-08-08 22:47:31 -07:00
Ishaan Jaff	3905cee579	test fixes	2025-08-08 18:50:09 -07:00
Ishaan Jaff	05b48eba62	fix security issue	2025-08-08 18:32:50 -07:00
Ishaan Jaff	32db7f1508	bump: version 1.75.3 → 1.75.4	2025-08-08 18:30:27 -07:00
Ishaan Jaff	edc38b73f9	UI new build	2025-08-08 18:30:15 -07:00
Ishaan Jaff	a843e876a8	[Feat] Working e2e flow for Responses API session management with media (#13456 ) * add MultimodalContent on chat UI * add multi modal img on chat ui * utils for responses API imgs * add code snippet with imgs * chat UI add imgs * add imge upload * chat ui allow adding images * fix chat send button * fix button styles * fix clear chat * fixes session management * fixes for session management * QA fix _should_check_cold_storage_for_full_payload * test_should_check_cold_storage_for_full_payload	2025-08-08 18:28:10 -07:00
Cole McIntosh	1d514cc68b	feat(reasoning): support 'minimal' effort type for OpenAI (#13447 ) * feat(reasoning): support 'minimal' effort type for OpenAI * fix(reasoning): correctly map 'minimal' effort to Reasoning object * chore(dependencies): update OpenAI package version to 1.99.5 in pyproject.toml and requirements.txt * chore(dependencies): update poetry.lock for OpenAI package version 1.99.5 and Poetry version 2.1.3	2025-08-08 17:56:23 -07:00
tanjiro	4571002e19	disable logging settings for non-enterprise users (#13431 )	2025-08-08 17:35:29 -07:00
Ishaan Jaff	840db3fe48	LiteLLM UI - Test Key Page - allow uploading images for /chat/completions and /responses (#13445 ) * add MultimodalContent on chat UI * add multi modal img on chat ui * utils for responses API imgs * add code snippet with imgs * chat UI add imgs * add imge upload * chat ui allow adding images * fix chat send button * fix button styles * fix clear chat	2025-08-08 16:57:08 -07:00
Ishaan Jaff	7e2a00c848	[Docs] Add docs on how router / cooldowns work (#13444 ) * add theme-mermaid * docs cool down * docs cooldown	2025-08-08 15:13:37 -07:00
Ishaan Jaff	d0aa12f3bf	Enhance team member permission error message with guidance for key creation (#13443 ) Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: ishaan <ishaan@berri.ai>	2025-08-08 14:35:06 -07:00
Low Jian Sheng	7f55bbc296	add support for reasoning_effort minimal (#13401 )	2025-08-08 14:34:46 -07:00
Jugal D. Bhatt	f55666bf5f	fix prices for oai gpt 5 (#13441 )	2025-08-08 14:33:21 -07:00
Ishaan Jaff	3b65733af8	[Bug fix] - Error creating standard logging object - can't register atexit after shutdownLitellm fixes standard logging payload (#13436 ) * fix: _generate_cold_storage_object_key * _get_configured_cold_storage_custom_logger * test_e2e_generate_cold_storage_object_key_runtime_error_handled	2025-08-08 12:38:26 -07:00
tanjiro	4fdc866fcb	Display Error from Backend on the UI - Notification (#13427 ) * fix sso logout - add a new login page with sso button * lint fix * lint fix * lint fix * fix tests * fix test * Revert "fix test" This reverts commit 74eb7345710892d5a9d02baec0ef389b98d0dde3. * Reapply "fix test" This reverts commit 72d0b2d4c62f6bb9351a7656ff88efc2ba91aef7. * add host to add modal * close modal after save is clicked. and auto-refresh * show old values in edit modal * send the whole payload on edit * Update settings.tsx * resolve conflict * fix conflict * merge main * first draft of notifications added to settings * add error compatibility by taking errors from the backend - db errors - auth errors * add support for different types of errors * minor * name change * email alerts page notifications modified * remove unused code	2025-08-08 12:34:16 -07:00
Jugal D. Bhatt	51c2ff7c15	fix user membership issue (#13433 )	2025-08-08 12:00:58 -07:00
Ishaan Jaff	3a35c82884	[Feat] Add `reasoning_effort` to OpenAIGPT5Config (#13434 ) * add reasoning_effort toi OpenAIGPT5Config * test_gpt5_supports_reasoning_effort	2025-08-08 11:57:12 -07:00
Edward D'Amato	793e1aa7c7	fix(proxy): add missing braintrust api base to env vars (#13412 )	2025-08-08 08:59:33 -07:00
Thiago Salvatore	c2ad858c83	fix(access group): allow access group on mcp tool retrieval (#13425 ) * fix(access group): allow access group on mcp tool retrieval * fix(test): fix broken tests and add test case for access group * fix(mypy): fix typing issues	2025-08-08 08:55:46 -07:00
Emerson Gomes	aea5af2165	Correct GPT-5 token limits and price (#13423 )	2025-08-08 08:55:33 -07:00
Ishaan Jaff	aefa71a300	bump: version 1.75.2 → 1.75.3 v1.75.3-nightly	2025-08-07 21:32:44 -07:00

1 2 3 4 5 ...

23925 Commits