litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-18 00:48:01 +00:00

Author	SHA1	Message	Date
Krish Dholakia	749051105b	Team Members - reset budget, if duration set + Prometheus - support tag based metrics (#12534 ) * fix(internal_user_endpoints.py): initial commit removing logic to create new budget for new user if default max budget in team set * feat(proxy_setting_endpoints.py): update team member budget when set via default internal user endpoint removes need to create a unique budget per user * feat(proxy_server.py): set team max member budget on startup, if set on config.yaml * fix(prometheus.py): support custom tags for tracking on prometheus Allows tracking user agent values on prometheus metrics * test(test_internal_user_endpoints.py): fix test	2025-07-11 22:54:16 -07:00
Krish Dholakia	c4af2eb5e2	MCP - usage tracking (#12397 ) * fix(common_daily_activity.py): initial commit with working mock BE endpoint for mcp usage * feat(ui/): show mcp server activity on UI allows admin to know which mcp's are being used * feat(common_daily_activity.py): return activity by key * feat(ui/): show top api keys for a given model / mcp server allow user to know which key is driving spend * fix(common_daily_activity.py): use known mcp server names * feat(server.py): log the namespaced tool name (includes server prefix) allow accurate cost tracking * feat(db_spend_update_writer.py): log by mcp_namespaced_tool_name store aggregate daily activity by mcp_namespaced_tool_name Enables cost / usage tracking by mcp tool name * fix(server.py): add key/user metadata to mcp calls * refactor(common_daily_activity.py): update to return mcp activity in API * fix(common_daily_activity.py): handle empty key * fix(common_daily_activity.py): track when api key is empty * test(test_spend_management_endpoints.py): update tests * fix: fix ui linting error * fix: fix linting errors * test: add missing key * build(schema.prisma): add mcp tool tracking * fix(migration.sql): add schema migration file * feat(server.py): add request logging for mcp calls enables storing the mcp calls * fix(new_usage.tsx): fix linting errors * fix: fix code qa errors * fix(activity_metrics.tsx): fix ui linting errors post-merge * fix(types/utils.py): fix linting error * fix(server.py): always have name	2025-07-08 22:08:16 -07:00
Jugal D. Bhatt	6f27385edf	Ensure message redaction works for responses API logging (#12291 ) * add fixes to choice implementation redaction * add isInstance check on responses API * change datadog to revert back * change datadog to revert back * fix type errors * Redaction test changes * Redaction test changes * Redaction test changes - remove changes	2025-07-04 15:11:20 -07:00
Ishaan Jaff	3d71b49d11	[Feat] Add failure logging support for s3 logger (#12299 ) * add async_log_failure_event * test_basic_s3_v2_logging_failure	2025-07-04 11:11:30 -07:00
Ishaan Jaff	66fafa3a7f	[Feat] Polish - add better error validation when users configure prometheus metrics and labels to control cardinality (#12182 ) * self._pretty_print_invalid_metric_error * docs prometheus.md * test prom validation checks * update metric name * fix _pretty_print_validation_errors * fix linting * test prometheus * test fixes - prometheus	2025-07-01 20:17:17 -07:00
Ishaan Jaff	d727d63a81	[Feat] Add new AWS SQS Logging Integration (#12176 ) * add aws_sqs * add sqs controls * add SQS to registry * fix url lib parse * fixes AWS SQS * test_async_sqs_logger_flush * fix test * fix SQS logger auth * add AWS SQS * add aws sqs * docs logging * test_async_sqs_logger_flush * test_async_sqs_logger_flush * add SQS logger * update SQS logging * use constants for SQS	2025-06-30 14:02:49 -07:00
Ishaan Jaff	2bb8048864	[Feat] Add OpenAI Search Vector Store Operation (#12018 ) * add BaseVectorStoreTransformation * fix BaseVectorStoreTransformation * add OpenAIVectorStoreTransformation * fix transform * add search, asearch vector stores * add skeleton for vector store searching * fix VectorStoreSearchOptionalRequestParams * fix VectorStoreRequestUtils * fix litellm.asearch/litellm.search * fix BaseVectorStoreConfig * add vector_store_search_handler to llm http handler * use llm http handler for searching vector stores * fix base vector store config * fix vector_store_search_handler * async_vector_store_search_handler * add conftest * add BaseVectorStoreTest * move litellm.integrations.vector_store_integrations * fix working OAI OpenAIVectorStoreConfig * add Search vector store * add OpenAI Vector Stores	2025-06-24 15:52:43 -07:00
Ishaan Jaff	8c5fb6f539	[Feat] Enterprise - Allow dynamically disabling callbacks in request headers (#11985 ) * Add support for disabling callbacks via x-litellm-disable-callbacks header * add _is_callback_disabled_via_headers * add get_proxy_server_request_headers * _is_callback_disabled_via_headers * X_LITELLM_DISABLE_CALLBACKS * add EnterpriseCallbackControls * use EnterpriseCallbackControls * use CustomLoggerRegistry * use CustomLoggerRegistry * CustomLoggerRegistry * EnterpriseCallbackControls * TestEnterpriseCallbackControls * docs clean up * docs dynamic callbacks * doc fixes * fix code qa checks * fix CustomLoggerRegistry --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com>	2025-06-23 14:32:05 -07:00
Ishaan Jaff	931b2e4875	[Bug Fix] Fix model_group tracked for /v1/messages and /moderations (#11933 ) * fixes _get_router_metadata_variable_name * fixes _update_kwargs_before_fallbacks * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_moderations_api_logging * fix _pass_through_moderation_endpoint_factory	2025-06-20 14:51:50 -07:00
Krrish Dholakia	3540984965	test: fix tests	2025-06-19 16:35:31 -07:00
Ishaan Jaff	bcdb53920a	[Fix] Bug Fix for using prom metrics config (#11779 ) * fixes for using config for prom metrics * test_set_llm_deployment_success_metrics_with_label_filtering * fixes for deployment failure * fix code qa checks * test_async_post_call_success_hook	2025-06-17 14:44:24 -07:00
Krrish Dholakia	bb907b5ecc	test: fix test	2025-06-16 18:49:41 -07:00
Krish Dholakia	7a128e2017	VertexAI Anthropic - streaming passthrough cost tracking (#11734 ) * feat(vertex_passthrough_logging_handler.py): initial anthropic passthrough streaming cost tracking support * fix: fix linting errors * test: update test	2025-06-15 01:16:43 -07:00
Krish Dholakia	c92b6c175c	Prometheus - fix request increment + add route tracking for streaming requests (#11731 ) * fix(prometheus.py): remove request increment from inside the log success event it's only done on post-call success/failure * fix(litellm_logging.py): add additional validation step for checking if 'stream' is true prevent double counting on non-stream requests * test: add unit testing to ensure stream is not incorrectly set to true * feat(litellm_logging.py): emit request route in standard logging payload used by prometheus streaming metrics for route * fix: fix otel test * fix: fix linting errors * test: update test * fix: fix linting error	2025-06-14 16:26:48 -07:00
Ishaan Jaff	ad82792c4b	fix lf OTEL	2025-06-14 15:43:38 -07:00
Krrish Dholakia	3608db5ffe	fix(prometheus.py): update tests	2025-06-06 09:12:54 -07:00
Ishaan Jaff	23627d6a26	[Fix] [Bug]: Knowledge Base Call returning error (#11467 ) * fix:get_and_pop_recognised_vector_store_tools * test: tools wwith vector stores * test - bedrock kb tools * fix: add clear comment * fix: vector store tools	2025-06-05 18:24:36 -07:00
Ishaan Jaff	a1f3a1c5dc	[Feat] Performance - Don't create 1 task for every hanging request alert (#11385 ) * feat: add async_get_oldest_n_keys in memory cache * fix: add add_request_to_hanging_request_check * test: alerting * feat: v2 hanging request check * fix: HangingRequestData * fix: AlertingHangingRequestCheck * fix: check_for_hanging_requests * fix: use correct metadata location for hanging requests * fix: formatting alert * test hanging request check * fix: add guard flags for background tasks alerting	2025-06-03 21:12:54 -07:00
Krrish Dholakia	2fe0a2750b	test: ensure aws region correctly set	2025-06-03 20:58:21 -07:00
Ishaan Jaff	41a2a62511	fix: bedrock kb test	2025-06-03 11:55:41 -07:00
Ishaan Jaff	3db272b6d2	[Perf] - Add Async + Batched S3 Logging (#11340 ) * fix: add s3 v2 async * fix: add s3 v2 async * fix: add s3 v2 async * test: s3 v2 logging * fixes: s3 logging * fixes: s3 logging use max upload batch size * fixes: s3 logging tests * fixes: s3 logging tests * fixes: s3 logging tests	2025-06-02 21:52:34 -07:00
Ishaan Jaff	7d47417906	test: fixes	2025-05-31 12:42:56 -07:00
Krish Dholakia	1995c7aad5	fix(utils.py): support non default params for audio transcription (#11212 ) * fix(utils.py): support non default params for audio transcription allows passing provider specific params straight through on transcription calls * fix(gpt_transformation.py): fix o_series model routing call _transform_request on async event * refactor: refactor tests * test(test_azure_chat_o_series_transformation.py): add unit test for azure o series error * test: update test * test: update json * fix: fix mutiple keyword error	2025-05-28 22:24:02 -07:00
Ishaan Jaff	0590b1eb3a	[Fix] Prometheus Metrics - Do not track end_user by default + expose flag to enable tracking end_user on prometheus (#11192 ) * fix: testing for disabling end user on metrics * fix: fixes for test_prometheus_factory * Delete litellm/model_prices_and_context_window_backup.json * fix: issues with merge conflicts * fix: test_get_end_user_id_for_cost_tracking_prometheus_only * Update tests/test_litellm/integrations/test_prometheus.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-05-27 17:06:58 -07:00
Krish Dholakia	066a502b89	Litellm dev 05 26 2025 p1 (#11165 ) * fix(utils.py): prevent leaking sensitive keys to langfuse Fixes https://github.com/BerriAI/litellm/issues/11150 * test(langfuse/): unit test preventing future bedrock key leaks Fixes https://github.com/BerriAI/litellm/issues/11150 * test(test_langfuse_e2e_test.py): add unit test for vertex - make sure no key leaks occur * ci(test-litellm.yml): add pytest retry to github workflow * fix(proxy_server.py): support forwarding `/sso/key/generate` to the server root path url Fixes https://github.com/BerriAI/litellm/issues/10761 * fix(proxy_server.py): don't rewrite absolute path (PROXY_BASE_URL) with relative path (SERVER_ROOT_PATH) This causes issues when using a custom path with sso, when doing redirects * fix(utils.py): ignore token - will mistakenly redact 'max_tokens' as well	2025-05-26 22:00:48 -07:00
Krish Dholakia	010a4d44af	Fix passing standard optional params (#11124 ) * fix(main.py): use processed non-default-params as standard input params for langfuse Fixes https://github.com/BerriAI/litellm/issues/11072 Fixes https://github.com/BerriAI/litellm/issues/11096 * fix(main.py): rename variable to be more accurate * test(test_langfuse_e2e_test.py): add router unit test for langfuse e2e testing Prevent https://github.com/BerriAI/litellm/issues/11072 from happening again * build: update lock * fix(utils.py): refactor optional params function make it easier to get the standardized non default params * fix(utils.py): improve process non default params function * fix(main.py): include provider specific params in processed non default params used in logging ensures user can see any provider specific params on langfuse ensures user can see any provider specific params on langfus e	2025-05-24 12:12:31 -07:00
Ishaan Jaff	86cdb8382b	[Feat] Use aiohttp transport by default - 97% lower median latency (#11097 ) * fix: add flag for disabling use_aiohttp_transport * feat: add _create_async_transport * feat: fixes for transport * add httpx-aiohttp * feat: fixes for transport * refactor: fixes for transport * build: fix deps * fixes: test fixes * fix: ensure aiohttp does not auto set content type * test: test fixes * feat: add LiteLLMAiohttpTransport * fix: fixes for responses API handling * test: fixes for responses API handling * test: fixes for responses API handling * feat: fixes for transport * fix: base embedding handler * test: test_async_http_handler_force_ipv4 * test: fix failing deepeval test * fix: add YARL for bedrock urls * fix: issues with transport * fix: comment out linting issues * test fix * test: XAI is unstable * test: fixes for using respx * test: XAI fixes * test: XAI fixes * test: infinity testing fixes * docs(config_settings.md): document param * test: test_openai_image_edit_litellm_sdk * test: remove deprecated test * bump respx==0.22.0 * test: test_xai_message_name_filtering * test: fix anthropic test after bumping httpx * use n 4 for mapped tests (#11109) * fix: use 1 session per event loop * test: test_client_session_helper * fix: linting error * fix: resolving GET requests on httpx 0.28.1 * test fixes proxy unit tests * fix: add ssl verify settings * fix: proxy unit tests * fix: refactor * tests: basic unit tests for aiohttp transports * tests: fixes xai --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2025-05-23 22:55:35 -07:00
Krish Dholakia	2efaa3cf36	Expose `/list` and `/info` endpoints for Audit Log events (#11102 ) * feat(audit_logging_endpoints.py): expose list endpoint to show all audit logs make it easier for user to retrieve individual endpoints * feat(enterprise/): add audit logging endpoint * feat(audit_logging_endpoints.py): expose new GET `/audit/{id}` endpoint make it easier to retrieve view individual audit logs * feat(key_management_event_hooks.py): correctly show the key of the user who initiated the change * fix(key_management_event_hooks.py): add key rotations as an audit log event ' * test(test_audit_logging_endpoints.py): add simple unit testing for audit log endpoint * fix: testing fixes * fix: fix ruff check	2025-05-23 22:54:59 -07:00
Ishaan Jaff	13bbf11ab0	test: fix failing deepeval test	2025-05-23 14:40:39 -07:00
Ishaan Jaff	754a94db97	Revert "Revert "Support passing `prompt_label` to langfuse (#11018 )"" This reverts commit `0be7e7d088`.	2025-05-22 14:14:39 -07:00
Ishaan Jaff	0be7e7d088	Revert "Support passing `prompt_label` to langfuse (#11018 )" This reverts commit `2b50b43ae2`.	2025-05-22 14:11:19 -07:00
Krish Dholakia	2b50b43ae2	Support passing `prompt_label` to langfuse (#11018 ) * fix: add prompt label support to prompt management hook * feat: support 'prompt_label' parameter for langfuse prompt management Closes https://github.com/BerriAI/litellm/discussions/9003#discussioncomment-13221555 * fix(litellm_logging.py): deep copy optional params to avoid mutation while logging * fix(log-consistent-optional-param-values-across-providers): ensures params can be used for finetuning from providers * fix: fix linting error * test: update test * test: update langfuse tests * fix(litellm_logging.py): avoid deepcopying optional params might contain thread object	2025-05-21 22:27:36 -07:00
Ishaan Jaff	14321a2708	[Feat] Prometheus - Track `route` on proxy_* metrics (#10992 ) * fix: trace route on prometheus metrics * fix: show route on prometheus metrics for total fails * test: trace route on metrics * fix: tests for route in prom metrics * test: fix test metrics * test: fix test_proxy_failure_metrics	2025-05-20 22:55:55 -07:00
Ishaan Jaff	298912bd38	[UI] - Add Guardrail Tracing to LiteLLM SpendLogs (#10893 ) * feat: trace guardrail SLP in spendLogs * test: trace guardrail SLP in spendLogs * add guardrail viewer * checkpoint - working guardrail view on logs * ui add guardrail view to SpendLogs * test: fixes guardrails * trace: fixes guardrails	2025-05-16 12:20:20 -07:00
Ishaan Jaff	42e6e664b2	[Refactor] Make Pagerduty a free feature (#10857 ) * refactor: make pagerduty free * refactor: make pagerduty free * fix: pagerduty loc * fix: linting error	2025-05-15 10:12:06 -07:00
Ishaan Jaff	2a994d7016	[Feat] Presidio Improvements - Allow adding presidio api base on UI, Test presidio on Test Key, fixes for running presidio hook (#10840 ) * feat: add GuardrailProviderSpecificParams * feat: add add apply_guardrail helper for presidio * ui cleanup * fixes pii config on ui * fixes for adding presidio pii * refactor: InitializeGuardrails * feat: init guardrails from DB * allow running guardrails from test key pg * fix: running a request with guardrails on UI * fix: types/guardrails.py * test: test_presidio_pre_call_hook_with_different_call_types * test: test_initialize_presidio_guardrail * test: fix custom guardrail tests	2025-05-14 17:41:33 -07:00
Krish Dholakia	a421316e56	fix(litellm_logging.py): log custom headers in requester metadata (#10818 ) * fix(litellm_logging.py): log custom headers in requester metadata allows passing along custom headers from client to logging integration - e.g. `x-correlation-id` * refactor: move enterprise code out of OSS package work towards simplified CE version of docker image * test: update test * fix: fix linting error	2025-05-13 23:04:37 -07:00
Ishaan Jaff	a4fb1da2d9	fix: pass application/json for GenericAPILogger (#10772 ) * fix: pass application/json for GenericAPILogger * fix: test_generic_api_callback	2025-05-12 14:15:33 -07:00
Ishaan Jaff	d13117992c	fix: test_init_custom_logger_compatible_class_as_callback	2025-05-10 17:26:12 -07:00
Ishaan Jaff	3731ee436a	[Refactor] Use pip package for enterprise/ folder (#10709 ) * init enterprise pip * init enterprise pip * init enterprise pip * test: enterprise pip * add litellm-enterprise to pip * litellm ent check * litellm ent check * fix import email router * fix setup_litellm_enterprise_pip * fix local testing with enterprise pip	2025-05-09 17:18:48 -07:00
Ishaan Jaff	d9f2415f65	[Refactor - Filtering Spend Logs] Add `status` to root of SpendLogs table (#10661 ) * add status to spendlogs * add status to spendlogs * add status to spendlogs * add status to spendlogs * fix tests with spendlogs	2025-05-08 11:29:25 -07:00
Ishaan Jaff	dd32860d62	[Feat] V2 Emails - Fixes for sending emails when creating keys + Resend API support (#10602 ) * working email integration * fix get_custom_loggers_for_type * add SendKeyCreatedEmailEvent type * bug fix, only send 1 email when creating key for user * polish for emails for key created * polish for key created email * fix test_init_custom_logger_compatible_class_as_callback * testing resend email integration * testing fixes for email integration	2025-05-06 22:50:48 -07:00
Krish Dholakia	500e6cddf5	Fix otel - follow genai semantic conventions + support 'instructions' param for tts (#10608 ) * fix(opentelemetry.py): fix passing otel semantic conventions Fixes SpanAttributes.LLM_PROMPTS to SpanAttributes.LLM_PROMPTS.value * fix(opentelemetry.py): ensure spanattributes always pass the actual enum value * fix(main.py): support passing 'instructions' param for gpt-4o-mini-tts * test: update tests	2025-05-06 21:57:01 -07:00
Ishaan Jaff	489f1a6c25	[Feat] v2 Custom Logger API Endpoints (#10575 ) * fixes for generic api logger * tests for generic api logger * test_generic_api_callback_multiple_logs * allow health checking generic api endpoints * docs generic api endpoint for logging * allow setting headers for generic api callback * fix for test_init_custom_logger_compatible_class_as_callback * fix linting	2025-05-05 16:57:55 -07:00
Ishaan Jaff	3f45200802	test_openai_responses_api_web_search_cost_tracking	2025-05-03 20:22:42 -07:00
Ishaan Jaff	8faa7dbc39	fixes for web search tests	2025-05-03 18:10:30 -07:00
Ishaan Jaff	a37e4cb0b8	fix typo (#10535 )	2025-05-03 11:39:40 -07:00
Ishaan Jaff	988e20aa36	[QA] Bedrock Vector Stores Integration - Allow using with registry + in OpenAI API spec with tools (#10516 ) * refactor KB implementation to use central registry * allow passing tools when making KB calls * test fixes * linting fix * fix kb tests * QA for KB stored in DB * fix, use litellm_credential_name when adding KB on litellm UI * QA list endpoint vector stores * allow using UI creds with KBs	2025-05-03 08:30:38 -07:00
Ishaan Jaff	28cb7cc0ed	[Feat] KnowledgeBase/Vector Store - Log `StandardLoggingVectorStoreRequest` for requests made when a vector store is used (#10509 ) * ensure vector store results are logged in SLP * fix tests * fix tests with vector_store_request_metadata * fix linting	2025-05-02 13:43:20 -07:00
Ishaan Jaff	96e75628d6	[Fix + Refactor] Trigger Soft Budget Webhooks When Key Crosses Threshold (#10491 ) * fix slack alerting with webhooks * emit correct event group/entity on webhooks * refactor to use a common class of alerts with abc methods * fixes for tests * refactor to use a common class of alerts with abc methods * Send a budget alert on slack or webhook * unit test slack alerting * fix code qa	2025-05-02 07:06:07 -07:00

1 2 3 4

199 Commits