litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-18 00:48:01 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	daa682e125	fix(tests): add missing start_db_health_watchdog_task mock (#21804 ) * fix(tests): add missing start_db_health_watchdog_task mock in test_proxy_server_prisma_setup * fix(tests): add missing start_db_health_watchdog_task mock in test_health_check_not_called_when_disabled	2026-02-21 12:31:52 -08:00
Shivam Rawat	c0e87f7ffb	fixed byok models for teams issue (#21408 )	2026-02-17 15:26:03 -08:00
Julio Quinteros Pro	2d41b03f8b	fix(test): mock environment variables for callback validation test The test test_proxy_config_state_post_init_callback_call was failing with: ``` ValidationError: 2 validation errors for TeamCallbackMetadata callback_vars.langfuse_public_key Input should be a valid string [type=string_type, input_value=None, input_type=NoneType] ``` Root cause: The test uses environment variable references like "os.environ/LANGFUSE_PUBLIC_KEY" which get resolved at runtime. In parallel execution with --dist=loadscope, these environment variables may not be set in all worker processes, causing the resolution to return None, which fails Pydantic validation expecting strings. Solution: Use monkeypatch to set the required environment variables before the test runs. This ensures consistent behavior across all test execution environments (local, CI, parallel workers). Fixes test failure exposed by PR #21277. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-15 20:44:17 -03:00
Sameer Kankute	9083b06ba7	Fix test_provider_specific_header_in_request	2026-02-11 16:56:26 +05:30
Harshit Jain	51d565f619	fix conflicts with main- (this PR is from upstream/main)	2026-02-07 03:10:53 +05:30
Ishaan Jaffer	35c636ba97	test_health_check_not_called_when_disabled	2026-01-10 13:55:11 -08:00
Alexsander Hamir	5534038e93	Fix CI: Revert security scan changes and add GitGuardian ignore rules (#18358 )	2025-12-22 17:03:53 -08:00
yuneng-jiang	81dc70673a	Merge remote-tracking branch 'origin' into litellm_ui_unset_values	2025-12-22 11:44:41 -08:00
Ishaan Jaffer	6112160a16	Revert "[Fix] Security - Remove example API keys with high entropy (#18255 )" This reverts commit `24edbccf5c`.	2025-12-20 20:48:11 +05:30
yuneng-jiang	ffcac2eebc	Allow deleting key expiry	2025-12-19 18:04:04 -08:00
Alexsander Hamir	24edbccf5c	[Fix] Security - Remove example API keys with high entropy (#18255 )	2025-12-19 10:09:50 -08:00
Alexsander Hamir	e9baa83a0f	[Fix] CI/CD – Clean Up Performance PR Changes & others (#17838 )	2025-12-11 12:50:03 -08:00
Petre Alexandru	911e802969	feat: add parallel execution handling in during_call_hook (#16279 )	2025-11-05 18:35:25 -08:00
Ishaan Jaffer	cb57455172	test_foward_litellm_user_info_to_backend_llm_call	2025-10-27 13:48:23 -07:00
Krish Dholakia	2bd41dc034	Guardrails - Responses API, Image Gen, Text completions, Audio transcriptions, Audio Speech, Rerank, Anthropic Messages API support via the unified `apply_guardrails` function (#15706 ) * fix(presidio.py): handle content as a list of texts covers openai + anthropic messages api * fix(presidio.py): safe get messages * test: add unit testing for presidio guardrails * fix(unified_guardrail.py): initial commit * fix(enkryptai.py): implement apply_guardrail to enkrypt guardrail * fix(unified_guardrail.py): support unified guardrail on input * feat(unified_guardrail.py): add post call success hook implementation allows us to just have 1 place to handle llm translation to guardrail api spec * refactor: refactor initial unified guardrail component * refactor: more refactoring * feat(responses/): add guardrails to responses api allows existing guardrails to work for new llm endpoints * docs(adding_guardrail_support.md): document new guardrail endpoint support * test: add unit tests * feat(image_generation/): add guardrail support for image generation endpoint * feat(openai/text_completion): support guardrails on `/v1/completions` API * docs: document guardrails support on new endpoints * docs: clarify when guardrails run * feat(openai/speech): add guardrail support for input * docs(rerank/): add guardrail support on input query * fix: fix ruff check	2025-10-25 13:38:57 -07:00
Ishaan Jaff	f55745fc5e	[Fix] Forward anthropic-beta headers to Bedrock, VertexAI (#15700 ) * [Fix] Forward anthropic-beta headers to Bedrock and other cross-provider scenarios (#15623) * add_provider_specific_headers_to_request * fix add_provider_specific_headers_to_request * test_provider_specific_header_multi_provider * test_provider_specific_header_in_request --------- Co-authored-by: Jack Venberg <jack.venberg@rover.com>	2025-10-18 16:26:32 -07:00
Mubashir Osmani	8b804303ed	fix: ci/cd tests + lint errors (#14646 ) * fix: lint errors + tests * fixed ci tests * fixed tests --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-09-17 17:06:43 -07:00
Krrish Dholakia	7e5bc8af28	test: update test	2025-07-29 21:35:44 -07:00
Ishaan Jaff	ff7dd1756a	[Security Bug Fix] Ensure only LLM API route fails get logged on Langfuse (and other loggers) (#12308 ) * _is_proxy_only_llm_api_error * test_proxy_only_error_true_for_llm_route * add not on change * Update tests/test_litellm/proxy/test_proxy_utils.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * add test_post_call_failure_hook_auth_error_key_info_route * test fix _is_proxy_only_llm_api_error * test_chat_completion_request_with_redaction * test_post_call_failure_hook_auth_error_llm_api_route --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-04 14:42:42 -07:00
Youfu Zhang	1c68c24358	introduce new environment variable NO_REDOC to opt-out Redoc (#12092 ) Signed-off-by: Youfu Zhang <zhangyoufu@gmail.com>	2025-06-27 21:26:37 -07:00
Krish Dholakia	cb90f8e613	Allow `/models` to return correct models for custom wildcard prefixes (#11784 ) * fix(model_checks.py): cleanup logic support wildcard models with non-provider prefix's for model discovery Closes https://github.com/BerriAI/litellm/pull/10358 * feat(model_checks.py): delegate wildcard prefix appending to the get_known_models_from_wildcard function remove from the 'get_provider_models' function * fix(model_checks.py): don't double add the wildcard prefix * test: update tests	2025-06-16 22:11:36 -07:00
Laurien	0c50f8bcc9	Update enduser spend and budget reset date based on budget duration (#8460 )	2025-06-08 08:39:14 -07:00
Ishaan Jaff	ea841eeb9b	[Feat] UI - show vector store permissions for Key, Team, Org (#11277 ) * fix LiteLLM_ObjectPermissionTable * fix include object_permission for list key * fix key list to inclue obj permissions * fix object permissions for vector stores on key info * add key edit view with vector stores * allow editing vector stores permissions * fixes obj permissions * feat: add obj permission on UI * fix: add object_permission:true * ui show org vector stores on org info * fix: show object permissions on /org/info * feat: allow updating obj permissions for keys * fixes: key object permissions * fixes: team object permissions * fixes: org object permissions * fix vector store selector for Orgs	2025-05-30 17:23:50 -07:00
Krish Dholakia	1caefb0ce0	fix(ui_sso.py): maintain backwards compatibility for older user id va… (#11106 ) * fix(ui_sso.py): maintain backwards compatibility for older user id variations Fixes issue in later SSO checks which only checked id from result * fix(internal_user_endpoints.py): handle trailing whitespace in new user email * fix(internal_user_endpoints.py): apply default_internal_user_settings on all new user calls (even when role not set) allows role undefined users to be assigned the correct role on sign up * feat(proxy_server.py): load default user settings from db - update litellm correctly updates the litellm module with default internal user settings ensures updated settings actually apply * test: add unit test * fix(internal_user_endpoints.py): fix internal user default param role * fix(ui_sso.py): fix linting error	2025-05-23 23:46:29 -07:00
Krrish Dholakia	b54e2ae98b	test: update unit test	2025-05-15 22:18:15 -07:00
Damian Gleumes	384a7ba94d	[Feat]: Configure LiteLLM to Parse User Headers from Open Web UI (#9802 ) * add user_header_name * docs: add per-user tracking to Open WebUI with LiteLLM doc * docs: standardize "OpenWeb UI" spelling across openweb_ui.md * docs: improve wording for openweb_ui guide * fix end_user_id not being set - move user header parsing to add_litellm_data_to_request - also set user_api_key_dict.end_user_id from user header	2025-05-15 22:01:12 -07:00
Ishaan Jaff	4ddca7a79c	Merge branch 'main' into litellm_fix_service_account_behavior	2025-04-01 12:04:28 -07:00
Ishaan Jaff	c2c5dbf24f	test_get_enforced_params	2025-04-01 08:41:53 -07:00
Ishaan Jaff	13aa7f75f6	test_enforced_params_check	2025-04-01 07:40:31 -07:00
Ishaan Jaff	55763ae276	test_end_user_transactions_reset	2025-04-01 07:13:25 -07:00
Ishaan Jaff	923ac2303b	test_end_user_transactions_reset	2025-03-31 20:55:13 -07:00
Ishaan Jaff	758182fc7f	fix typo on codebase	2025-03-27 22:36:00 -07:00
Krrish Dholakia	6ed995952f	fix: fix test	2025-03-14 20:28:50 -07:00
Krish Dholakia	51cb3c84e3	Litellm stable UI 02 17 2025 p1 (#8599 ) * fix(key_management_endpoints.py): initial commit with logic to get all keys for teams user is an admin for * fix(key_managements_endpoints.py): return all keys for teams user is an admin for * fix(key_management_endpoints.py): add query param to ensure user opts into seeing all team keys (not just their own) * fix(regenerate_key_modal.tsx): fix key regenerate * fix(proxy_server.py): fix model metrics check on none api base * test(test_key_generate_prisma.py): remove redundant test * test(test_proxy_utils.py): add unit test covering new management endpoint helper util * fix: fix test * test(test_proxy_server.py): fix test	2025-02-17 17:55:05 -08:00
Krish Dholakia	57e5ec07cc	Improved wildcard route handling on `/models` and `/model_group/info` (#8473 ) * fix(model_checks.py): update returning known model from wildcard to filter based on given model prefix ensures wildcard route - `vertex_ai/gemini-` just returns known vertex_ai/gemini- models test(test_proxy_utils.py): add unit testing for new 'get_known_models_from_wildcard' helper * test(test_models.py): add e2e testing for `/model_group/info` endpoint * feat(prometheus.py): support tracking total requests by user_email on prometheus adds initial support for tracking total requests by user_email * test(test_prometheus.py): add testing to ensure user email is always tracked * test: update testing for new prometheus metric * test(test_prometheus_unit_tests.py): add user email to total proxy metric * test: update tests * test: fix spend tests * test: fix test * fix(pagerduty.py): fix linting error	2025-02-11 19:37:43 -08:00
Ishaan Jaff	81109893ec	(round 4 fixes) - Team model alias setting (#8474 ) * update team info endpoint * clean up model alias * fix model alias * fix model alias card * clean up naming on docs * fix model alias card * fix _model_in_team_aliases * team alias - fix litellm.model_alias_map * fix _update_model_if_team_alias_exists * fix test_aview_spend_per_user * Test model alias functionality with teams: * complete e2e test * test_update_model_if_team_alias_exists	2025-02-11 16:40:01 -08:00
Krish Dholakia	df93debbc7	Internal User Endpoint - vulnerability fix + response type fix (#8228 ) * fix(key_management_endpoints.py): fix vulnerability where a user could update another user's keys Resolves https://github.com/BerriAI/litellm/issues/8031 * test(key_management_endpoints.py): return consistent 403 forbidden error when modifying key that doesn't belong to user * fix(internal_user_endpoints.py): return model max budget in internal user create response Fixes https://github.com/BerriAI/litellm/issues/7047 * test: fix test * test: update test to handle gemini token counter change * fix(factory.py): fix bedrock http:// handling * docs: fix typo in lm_studio.md (#8222) * test: fix testing * test: fix test --------- Co-authored-by: foreign-sub <51928805+foreign-sub@users.noreply.github.com>	2025-02-04 06:41:14 -08:00
Krish Dholakia	1e011b66d3	Ollama ssl verify = False + Spend Logs reliability fixes (#7931 ) * fix(http_handler.py): support passing ssl verify dynamically and using the correct httpx client based on passed ssl verify param Fixes https://github.com/BerriAI/litellm/issues/6499 * feat(llm_http_handler.py): support passing `ssl_verify=False` dynamically in call args Closes https://github.com/BerriAI/litellm/issues/6499 * fix(proxy/utils.py): prevent bad logs from breaking all cost tracking + reset list regardless of success/failure prevents malformed logs from causing all spend tracking to break since they're constantly retried * test(test_proxy_utils.py): add test to ensure bad log is dropped * test(test_proxy_utils.py): ensure in-memory spend logs reset after bad log error * test(test_user_api_key_auth.py): add unit test to ensure end user id as str works * fix(auth_utils.py): ensure extracted end user id is always a str prevents db cost tracking errors * test(test_auth_utils.py): ensure get end user id from request body always returns a string * test: update tests * test: skip bedrock test- behaviour now supported * test: fix testing * refactor(spend_tracking_utils.py): reduce size of get_logging_payload * test: fix test * bump: version 1.59.4 → 1.59.5 * Revert "bump: version 1.59.4 → 1.59.5" This reverts commit 1182b46b2ed814064f55f438c11b590cd7248596. * fix(utils.py): fix spend logs retry logic * fix(spend_tracking_utils.py): fix get tags * fix(spend_tracking_utils.py): fix end user id spend tracking on pass-through endpoints	2025-01-23 23:05:41 -08:00
Krish Dholakia	27560bd5ad	Litellm dev 01 22 2025 p4 (#7932 ) * feat(main.py): add new 'provider_specific_header' param allows passing extra header for specific provider * fix(litellm_pre_call_utils.py): add unit test for pre call utils * test(test_bedrock_completion.py): skip test now that bedrock supports this	2025-01-22 21:52:07 -08:00
Krish Dholakia	866fffb50d	Litellm dev 01 21 2025 p1 (#7898 ) * fix(utils.py): don't pass 'anthropic-beta' header to vertex - will cause request to fail * fix(utils.py): add flag to allow user to disable filtering invalid headers ensure user can control behaviour * style(utils.py): cleanup message * test(test_utils.py): add unit test to cover invalid header filtering * fix(proxy_server.py): fix custom openapi schema generation * fix(utils.py): pass extra headers if set * fix(main.py): fix image variation to use 'client' param	2025-01-21 20:36:11 -08:00
Krish Dholakia	fe60a38c8e	Litellm dev 01 2025 p4 (#7776 ) * fix(gemini/): support gemini 'frequency_penalty' and 'presence_penalty' Closes https://github.com/BerriAI/litellm/issues/7748 * feat(proxy_server.py): new env var to disable prisma health check on startup * test: fix test	2025-01-14 21:49:25 -08:00
Krish Dholakia	7b27cfb0ae	Support temporary budget increases on keys (#7754 ) * fix(gpt_transformation.py): fix response_format translation check for 4o models Fixes https://github.com/BerriAI/litellm/issues/7616 * feat(key_management_endpoints.py): support 'temp_budget_increase' and 'temp_budget_expiry' fields Allow proxy admin to grant temporary budget increases to keys * fix(proxy/_types.py): enforce temp_budget_increase and temp_budget_expiry are always passed together * feat(user_api_key_auth.py): initial working temp budget increase logic ensures key budget exceeded error checks for temp budget in key metadata * feat(proxy_server.py): return the key max budget and key spend in the response headers Allows clientside user to know their remaining limits * test: add unit testing for new proxy utils Ensures new key budget is correctly handled * docs(temporary_budget_increase.md): add doc on temporary budget increase * fix(utils.py): remove 3.5 from response_format check for now not all azure 3.5 models support response_format * fix(user_api_key_auth.py): return valid user api key auth object on all paths	2025-01-14 17:03:11 -08:00
Krish Dholakia	ec5a354eac	add azure o1 pricing (#7715 ) * build(model_prices_and_context_window.json): add azure o1 pricing Closes https://github.com/BerriAI/litellm/issues/7712 * refactor: replace regex with string method for whitespace check in stop-sequences handling (#7713) * Allows overriding keep_alive time in ollama (#7079) * Allows overriding keep_alive time in ollama * Also adds to ollama_chat * Adds some info on the docs about this parameter * fix: together ai warning (#7688) Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com> * fix(proxy_server.py): handle config containing thread locked objects when using get_config_state * fix(proxy_server.py): add exception to debug * build(model_prices_and_context_window.json): update 'supports_vision' for azure o1 --------- Co-authored-by: Wolfram Ravenwolf <52386626+WolframRavenwolf@users.noreply.github.com> Co-authored-by: Regis David Souza Mesquita <github@rdsm.dev> Co-authored-by: Carl <45709281+capsenz@users.noreply.github.com> Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com>	2025-01-12 18:15:35 -08:00
Krish Dholakia	907bcd3a62	Litellm dev 01 08 2025 p1 (#7640 ) * feat(ui_sso.py): support reading team ids from sso token * feat(ui_sso.py): working upsert sso user teams membership in litellm - if team exists Adds user to relevant teams, if user is part of teams and team exists on litellm * fix(ui_sso.py): safely handle add team member task * build(ui/): support setting team id when creating team on UI * build(ui/): teams.tsx allow setting team id on ui * build(circle_ci/requirements.txt): add fastapi-sso to ci/cd testing * fix: fix linting errors	2025-01-08 22:08:20 -08:00
Krish Dholakia	d43d83f9ef	feat(router.py): support request prioritization for text completion c… (#7540 ) * feat(router.py): support request prioritization for text completion calls * fix(internal_user_endpoints.py): fix sql query to return all keys, including null team id keys on `/user/info` Fixes https://github.com/BerriAI/litellm/issues/7485 * fix: fix linting errors * fix: fix linting error * test(test_router_helper_utils.py): add direct test for '_schedule_factory' Fixes code qa test	2025-01-03 19:35:44 -08:00
Krish Dholakia	39cbd9d878	Litellm dev 12 31 2024 p1 (#7488 ) * fix(internal_user_endpoints.py): fix team list sort - handle team_alias being set + None * fix(key_management_endpoints.py): allow team admin to create key for member via admin ui Fixes https://github.com/BerriAI/litellm/issues/7482 * fix(proxy_server.py): allow querying info on specific model group via `/model_group/info` allows client-side user to get model info from proxy * fix(proxy_server.py): add docstring on `/model_group/info` showing how to filter by model name * test(test_proxy_utils.py): add unit test for returning model group info filtered * fix(proxy_server.py): fix query param * fix(test_Get_model_info.py): handle no whitelisted bedrock modells	2024-12-31 23:21:51 -08:00
Krish Dholakia	539f166166	Support budget/rate limit tiers for keys (#7429 ) * feat(proxy/utils.py): get associated litellm budget from db in combined_view for key allows user to create rate limit tiers and associate those to keys * feat(proxy/_types.py): update the value of key-level tpm/rpm/model max budget metrics with the associated budget table values if set allows rate limit tiers to be easily applied to keys * docs(rate_limit_tiers.md): add doc on setting rate limit / budget tiers make feature discoverable * feat(key_management_endpoints.py): return litellm_budget_table value in key generate make it easy for user to know associated budget on key creation * fix(key_management_endpoints.py): document 'budget_id' param in `/key/generate` * docs(key_management_endpoints.py): document budget_id usage * refactor(budget_management_endpoints.py): refactor budget endpoints into separate file - makes it easier to run documentation testing against it * docs(test_api_docs.py): add budget endpoints to ci/cd doc test + add missing param info to docs * fix(customer_endpoints.py): use new pydantic obj name * docs(user_management_heirarchy.md): add simple doc explaining teams/keys/org/users on litellm * Litellm dev 12 26 2024 p2 (#7432) * (Feat) Add logging for `POST v1/fine_tuning/jobs` (#7426) * init commit ft jobs logging * add ft logging * add logging for FineTuningJob * simple FT Job create test * (docs) - show all supported Azure OpenAI endpoints in overview (#7428) * azure batches * update doc * docs azure endpoints * docs endpoints on azure * docs azure batches api * docs azure batches api * fix(key_management_endpoints.py): fix key update to actually work * test(test_key_management.py): add e2e test asserting ui key update call works * fix: proxy/_types - fix linting erros * test: update test --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * fix: test * fix(parallel_request_limiter.py): enforce tpm/rpm limits on key from tiers * fix: fix linting errors * test: fix test * fix: remove unused import * test: update test * docs(customer_endpoints.py): document new model_max_budget param * test: specify unique key alias * docs(budget_management_endpoints.py): document new model_max_budget param * test: fix test * test: fix tests --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-12-26 19:05:27 -08:00
Krish Dholakia	2e86a4806d	Litellm dev 12 24 2024 p2 (#7400 ) * fix(utils.py): default custom_llm_provider=None for 'supports_response_schema' Closes https://github.com/BerriAI/litellm/issues/7397 * refactor(langfuse/): call langfuse logger inside customlogger compatible langfuse class, refactor langfuse logger to use verbose_logger.debug instead of print_verbose * refactor(litellm_pre_call_utils.py): move config based team callbacks inside dynamic team callback logic enables simpler unit testing for config-based team callbacks * fix(proxy/_types.py): handle teamcallbackmetadata - none values drop none values if present. if all none, use default dict to avoid downstream errors * test(test_proxy_utils.py): add unit test preventing future issues - asserts team_id in config state not popped off across calls Fixes https://github.com/BerriAI/litellm/issues/6787 * fix(langfuse_prompt_management.py): add success + failure logging event support * fix: fix linting error * test: fix test * test: fix test * test: override o1 prompt caching - openai currently not working * test: fix test	2024-12-24 20:33:41 -08:00
Ishaan Jaff	56d9427fdb	(Admin UI) correctly render provider name in /models with wildcard routing (#7349 ) * ui fix - allow searching model list + fix bug on filtering * qa fix - use correct provider name for azure_text * ui wrap content onto next line * ui fix - allow selecting current UI session when logging in * ui session budgets * ui show provider models on wildcard models * test provider name appears in model list * ui fix auto scroll on chat ui tab	2024-12-21 14:19:12 -08:00
Krish Dholakia	4c7a3931b7	Litellm dev 12 19 2024 p2 (#7315 ) * fix(proxy_server.py): only update k,v pair if v is not empty/null Fixes https://github.com/BerriAI/litellm/issues/6787 * test(test_router.py): cleanup duplicate calls * test: add new test stream options drop params test * test: update optional params / stream options test to test for vertex ai mistral route specifically Addresses https://github.com/BerriAI/litellm/issues/7309 * fix(proxy_server.py): fix linting errors * fix: fix linting errors	2024-12-19 20:28:16 -08:00

1 2

58 Commits