litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-07-04 05:05:28 +00:00

Author	SHA1	Message	Date
Krish Dholakia	2b50b43ae2	Support passing `prompt_label` to langfuse (#11018 ) * fix: add prompt label support to prompt management hook * feat: support 'prompt_label' parameter for langfuse prompt management Closes https://github.com/BerriAI/litellm/discussions/9003#discussioncomment-13221555 * fix(litellm_logging.py): deep copy optional params to avoid mutation while logging * fix(log-consistent-optional-param-values-across-providers): ensures params can be used for finetuning from providers * fix: fix linting error * test: update test * test: update langfuse tests * fix(litellm_logging.py): avoid deepcopying optional params might contain thread object	2025-05-21 22:27:36 -07:00
Ishaan Jaff	14321a2708	[Feat] Prometheus - Track `route` on proxy_* metrics (#10992 ) * fix: trace route on prometheus metrics * fix: show route on prometheus metrics for total fails * test: trace route on metrics * fix: tests for route in prom metrics * test: fix test metrics * test: fix test_proxy_failure_metrics	2025-05-20 22:55:55 -07:00
Ishaan Jaff	298912bd38	[UI] - Add Guardrail Tracing to LiteLLM SpendLogs (#10893 ) * feat: trace guardrail SLP in spendLogs * test: trace guardrail SLP in spendLogs * add guardrail viewer * checkpoint - working guardrail view on logs * ui add guardrail view to SpendLogs * test: fixes guardrails * trace: fixes guardrails	2025-05-16 12:20:20 -07:00
Ishaan Jaff	42e6e664b2	[Refactor] Make Pagerduty a free feature (#10857 ) * refactor: make pagerduty free * refactor: make pagerduty free * fix: pagerduty loc * fix: linting error	2025-05-15 10:12:06 -07:00
Ishaan Jaff	2a994d7016	[Feat] Presidio Improvements - Allow adding presidio api base on UI, Test presidio on Test Key, fixes for running presidio hook (#10840 ) * feat: add GuardrailProviderSpecificParams * feat: add add apply_guardrail helper for presidio * ui cleanup * fixes pii config on ui * fixes for adding presidio pii * refactor: InitializeGuardrails * feat: init guardrails from DB * allow running guardrails from test key pg * fix: running a request with guardrails on UI * fix: types/guardrails.py * test: test_presidio_pre_call_hook_with_different_call_types * test: test_initialize_presidio_guardrail * test: fix custom guardrail tests	2025-05-14 17:41:33 -07:00
Krish Dholakia	a421316e56	fix(litellm_logging.py): log custom headers in requester metadata (#10818 ) * fix(litellm_logging.py): log custom headers in requester metadata allows passing along custom headers from client to logging integration - e.g. `x-correlation-id` * refactor: move enterprise code out of OSS package work towards simplified CE version of docker image * test: update test * fix: fix linting error	2025-05-13 23:04:37 -07:00
Ishaan Jaff	a4fb1da2d9	fix: pass application/json for GenericAPILogger (#10772 ) * fix: pass application/json for GenericAPILogger * fix: test_generic_api_callback	2025-05-12 14:15:33 -07:00
Ishaan Jaff	d13117992c	fix: test_init_custom_logger_compatible_class_as_callback	2025-05-10 17:26:12 -07:00
Ishaan Jaff	3731ee436a	[Refactor] Use pip package for enterprise/ folder (#10709 ) * init enterprise pip * init enterprise pip * init enterprise pip * test: enterprise pip * add litellm-enterprise to pip * litellm ent check * litellm ent check * fix import email router * fix setup_litellm_enterprise_pip * fix local testing with enterprise pip	2025-05-09 17:18:48 -07:00
Ishaan Jaff	d9f2415f65	[Refactor - Filtering Spend Logs] Add `status` to root of SpendLogs table (#10661 ) * add status to spendlogs * add status to spendlogs * add status to spendlogs * add status to spendlogs * fix tests with spendlogs	2025-05-08 11:29:25 -07:00
Ishaan Jaff	dd32860d62	[Feat] V2 Emails - Fixes for sending emails when creating keys + Resend API support (#10602 ) * working email integration * fix get_custom_loggers_for_type * add SendKeyCreatedEmailEvent type * bug fix, only send 1 email when creating key for user * polish for emails for key created * polish for key created email * fix test_init_custom_logger_compatible_class_as_callback * testing resend email integration * testing fixes for email integration	2025-05-06 22:50:48 -07:00
Krish Dholakia	500e6cddf5	Fix otel - follow genai semantic conventions + support 'instructions' param for tts (#10608 ) * fix(opentelemetry.py): fix passing otel semantic conventions Fixes SpanAttributes.LLM_PROMPTS to SpanAttributes.LLM_PROMPTS.value * fix(opentelemetry.py): ensure spanattributes always pass the actual enum value * fix(main.py): support passing 'instructions' param for gpt-4o-mini-tts * test: update tests	2025-05-06 21:57:01 -07:00
Ishaan Jaff	489f1a6c25	[Feat] v2 Custom Logger API Endpoints (#10575 ) * fixes for generic api logger * tests for generic api logger * test_generic_api_callback_multiple_logs * allow health checking generic api endpoints * docs generic api endpoint for logging * allow setting headers for generic api callback * fix for test_init_custom_logger_compatible_class_as_callback * fix linting	2025-05-05 16:57:55 -07:00
Ishaan Jaff	3f45200802	test_openai_responses_api_web_search_cost_tracking	2025-05-03 20:22:42 -07:00
Ishaan Jaff	8faa7dbc39	fixes for web search tests	2025-05-03 18:10:30 -07:00
Ishaan Jaff	a37e4cb0b8	fix typo (#10535 )	2025-05-03 11:39:40 -07:00
Ishaan Jaff	988e20aa36	[QA] Bedrock Vector Stores Integration - Allow using with registry + in OpenAI API spec with tools (#10516 ) * refactor KB implementation to use central registry * allow passing tools when making KB calls * test fixes * linting fix * fix kb tests * QA for KB stored in DB * fix, use litellm_credential_name when adding KB on litellm UI * QA list endpoint vector stores * allow using UI creds with KBs	2025-05-03 08:30:38 -07:00
Ishaan Jaff	28cb7cc0ed	[Feat] KnowledgeBase/Vector Store - Log `StandardLoggingVectorStoreRequest` for requests made when a vector store is used (#10509 ) * ensure vector store results are logged in SLP * fix tests * fix tests with vector_store_request_metadata * fix linting	2025-05-02 13:43:20 -07:00
Ishaan Jaff	96e75628d6	[Fix + Refactor] Trigger Soft Budget Webhooks When Key Crosses Threshold (#10491 ) * fix slack alerting with webhooks * emit correct event group/entity on webhooks * refactor to use a common class of alerts with abc methods * fixes for tests * refactor to use a common class of alerts with abc methods * Send a budget alert on slack or webhook * unit test slack alerting * fix code qa	2025-05-02 07:06:07 -07:00
Ishaan Jaff	be885e4a17	[Feat] Vector Stores/KnowledgeBases - Allow defining Vector Store Configs (#10448 ) * init vector store configs * working kb init * add vector store endpoints * use litellm_credential_name * working CRUD vector stores litellm * working creds with vector DB * ui cleanup * clean up vector store id * fix delete button * refactored vector store component * working selector for KBs * ui vector stores * add vector store tool calls usage on chat ui * fixes for vector stores litellm * test fix * docs Knowledge Bases * fixes for vector stores litellm * fix linting * add managed vectorstores * fix orjson ci/cd test * fix linting * add types.tsx file	2025-04-30 21:49:59 -07:00
Ishaan Jaff	f30871ef13	[Feat] Add support for using Bedrock Knowledge Bases with LiteLLM /chat/completions requests (#10413 ) * add make_bedrock_kb_retrieve_request * working bedrock KB hook * working bedrock KB hook * test_openai_with_knowledge_base_mock_openai * fix linting * fix BedrockKnowledgeBaseHook * docs using bedrock kb with litellm * docs kb with litellm * fix bedrock kb test * DynamicPromptManagementParamLiteral * fix _should_run_prompt_management_hooks_without_prompt_id * test_init_custom_logger_compatible_class_as_callback	2025-04-29 17:29:02 -07:00
Ishaan Jaff	34cdefbb77	[Feat] Add logging callback support for /moderations API (#10390 ) * add OpenAIModerationResponse as a type * Add typing for litellm.moderation response obj * Add typing for litellm.moderation response obj * test_moderations_api_logging * add assertions for logged slp object * test_moderations_api_logging * refactor moderations api usage * Revert "test_moderations_api_logging" This reverts commit 5a9001f04c592b063f016d2bb854416255ec6cdf. * Revert "refactor moderations api usage" This reverts commit b076d3174ca14251ee3fc28bd4744e25685c1e5f. * test_moderations_api_logging	2025-04-28 20:24:17 -07:00
Ishaan Jaff	331e784db4	[Feat] Responses API - Add session management support for non-openai models (#10321 ) * add session id in spendLogs * fix log proxy server request as independant field * use trace id for SpendLogs * add _ENTERPRISE_ResponsesSessionHandler * use _ENTERPRISE_ResponsesSessionHandler * working session_ids * working session management * working session_ids * test_async_gcs_pub_sub_v1 * test_spend_logs_payload_e2e * working session_ids * test_get_standard_logging_payload_trace_id * test_get_standard_logging_payload_trace_id * test_gcs_pub_sub.py * fix all linting errors * test_spend_logs_payload_with_prompts_enabled * _ENTERPRISE_ResponsesSessionHandler * _ENTERPRISE_ResponsesSessionHandler * expose session id on ui * get spend logs by session * add sessionSpendLogsCall * add session handling * session logs * ui session details * fix on rowExpandDetails * ui working sessions	2025-04-25 23:24:24 -07:00
Ishaan Jaff	96e31d205c	feat: Added Missing Attributes For Arize & Phoenix Integration (#10043 ) (#10215 ) * feat: Added Missing Attributes For Arize & Phoenix Integration * chore: Added noqa for PLR0915 to suppress warning * chore: Moved Contributor Test to Correct Location * chore: Removed Redundant Fallback Co-authored-by: Ali Saleh <saleh.a@turing.com>	2025-04-22 21:34:51 -07:00
Ishaan Jaff	44264ab6d6	fix failing agent ops test	2025-04-22 14:39:50 -07:00
Krrish Dholakia	b4f2b3dad1	test: update test to be more robust to usage updates	2025-04-19 09:26:26 -07:00
Krish Dholakia	1ea046cc61	test: update tests to new deployment model (#10142 ) * test: update tests to new deployment model * test: update model name * test: skip cohere rbac issue test * test: update test - replace gpt-4o model	2025-04-18 14:22:12 -07:00
Ishaan Jaff	c1a642ce20	[UI] Allow setting prompt `cache_control_injection_points` (#10000 ) * test_anthropic_cache_control_hook_system_message * test_anthropic_cache_control_hook.py * should_run_prompt_management_hooks * fix should_run_prompt_management_hooks * test_anthropic_cache_control_hook_specific_index * fix test * fix linting errors * ChatCompletionCachedContent * initial commit for cache control * fixes ui design * fix inserting cache_control_injection_points * fix entering cache control points * fixes for using cache control on ui + backend * update cache control settings on edit model page * fix init custom logger compatible class * fix linting errors * fix linting errors * fix get_chat_completion_prompt	2025-04-14 21:17:42 -07:00
Ishaan Jaff	f9ce754817	[Feat] Add litellm.supports_reasoning() util to track if an llm supports reasoning (#9923 ) * add supports_reasoning for xai models * add "supports_reasoning": true for o1 series models * add supports_reasoning util * add litellm.supports_reasoning * add supports reasoning for claude 3-7 models * add deepseek as supports reasoning * test_supports_reasoning * add supports reasoning to model group info * add supports_reasoning * docs supports reasoning * fix supports_reasoning test * "supports_reasoning": false, * fix test * supports_reasoning	2025-04-11 17:56:04 -07:00
Ishaan Jaff	94a553dbb2	[Feat] Emit Key, Team Budget metrics on a cron job schedule (#9528 ) * _initialize_remaining_budget_metrics * initialize_budget_metrics_cron_job * initialize_budget_metrics_cron_job * initialize_budget_metrics_cron_job * test_initialize_budget_metrics_cron_job * LITELLM_PROXY_ADMIN_NAME * fix code qa checks * test_initialize_budget_metrics_cron_job * test_initialize_budget_metrics_cron_job * pod lock manager allow dynamic cron job ID * fix pod lock manager * require cronjobid for PodLockManager * fix DB_SPEND_UPDATE_JOB_NAME acquire / release lock * add comment on prometheus logger * add debug statements for emitting key, team budget metrics * test_pod_lock_manager.py * test_initialize_budget_metrics_cron_job * initialize_budget_metrics_cron_job * initialize_remaining_budget_metrics * remove outdated test	2025-04-10 16:59:14 -07:00
Krrish Dholakia	b168f8b744	test: update test	2025-04-10 14:04:57 -07:00
Krish Dholakia	0c5b4aa96d	feat(realtime/): add token tracking + log usage object in spend logs … (#9843 ) * feat(realtime/): add token tracking + log usage object in spend logs metadata * test: fix test * test: update tests * test: update testing * test: update test * test: update test * test: update test * test: update test * test: update tesdt * test: update test	2025-04-09 22:11:00 -07:00
Krish Dholakia	4a128cfd64	Realtime API Cost tracking (#9795 ) * fix(proxy_server.py): log realtime calls to spendlogs Fixes https://github.com/BerriAI/litellm/issues/8410 * feat(realtime/): OpenAI Realtime API cost tracking Closes https://github.com/BerriAI/litellm/issues/8410 * test: add unit testing for coverage * test: add more unit testing * fix: handle edge cases	2025-04-07 16:43:12 -07:00
Ishaan Jaff	b7cd4cef07	test_get_exception_class_name	2025-04-04 21:32:55 -07:00
Ishaan Jaff	df4593d58b	test prom unit tests	2025-04-04 21:30:05 -07:00
Ishaan Jaff	a3df0269bb	fix tests	2025-03-29 17:38:24 -07:00
Ishaan Jaff	047d767947	fix tests for gcs pub sub	2025-03-29 17:06:36 -07:00
Ishaan Jaff	d17ab7da2a	Merge pull request #9508 from BerriAI/litellm_fix_gcs_pub_sub [Fix] Use StandardLoggingPayload for GCS Pub Sub Logging Integration	2025-03-24 18:22:43 -07:00
Ishaan Jaff	36d17c23ac	fix flaky test test_openai_responses_api_web_search_cost_tracking	2025-03-24 15:51:33 -07:00
Ishaan Jaff	6a0ff944e4	assert_gcs_pubsub_request_matches_expected_standard_logging_payload	2025-03-24 15:41:05 -07:00
Ishaan Jaff	43a6c4c635	test_openai_responses_api_web_search_cost_tracking	2025-03-22 17:29:17 -07:00
Ishaan Jaff	81033dc8c9	test_openai_responses_api_web_search_cost_tracking	2025-03-22 17:21:41 -07:00
Ishaan Jaff	beb68037d0	move web search cost tracking	2025-03-22 17:02:43 -07:00
Ishaan Jaff	475dfaa156	fixes for web search cost tracking	2025-03-22 16:56:32 -07:00
Ishaan Jaff	cf22d31b2b	search_context_cost_per_query	2025-03-22 14:52:58 -07:00
Ishaan Jaff	bddbeff717	test_openai_web_search_logging_cost_tracking	2025-03-22 14:29:02 -07:00
Ishaan Jaff	8690873488	test_arize_dynamic_params	2025-03-18 23:22:55 -07:00
Ishaan Jaff	8568caf532	test_arize_dynamic_params	2025-03-18 23:18:07 -07:00
Ishaan Jaff	19a7bfa6b5	test_arize_callback	2025-03-18 18:49:06 -07:00
Ishaan Jaff	00795f2bbf	test arize logging	2025-03-18 18:40:39 -07:00

1 2 3 4

168 Commits