litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-26 23:05:45 +00:00

Author	SHA1	Message	Date
Ishaan Jaffer	7ae980410b	docs fix	2026-02-02 19:50:22 -08:00
Ishaan Jaffer	5aa8725c63	docs Tracing Tools	2026-02-02 19:48:00 -08:00
Sameer Kankute	333419b4d2	Add documentation correctly for nova sonic	2026-02-03 09:03:27 +05:30
Ishaan Jaff	0ef506a54a	Litellm docs mcp filtering semantic (#20316 ) * init: SemanticMCPToolFilter * init: SemanticToolFilterHook * test_e2e_semantic_filter * mock tests: test_semantic_filter_basic_filtering * Update litellm/proxy/_experimental/mcp_server/semantic_tool_filter.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * refactor folder/file organization * docs fix * fix filter * fix: filter_tools * fix linting tool filrer * initialize_from_config * fix: _expand_mcp_tools * _initialize_semantic_tool_filter * working: async_post_call_response_headers_hook * clean up semantic tool filter * add _initialize_semantic_tool_filter * build_router_from_mcp_registry * _get_tools_by_names * fiix config * async_post_call_response_headers_hook * docs mcp filter * docs fix --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-02-02 18:29:07 -08:00
Alexsander Hamir	1b9631d260	Add blog post: Achieving Sub-Millisecond Proxy Overhead (#20309 )	2026-02-02 17:46:36 -08:00
shin-bot-litellm	0614ff9fda	docs: add Prisma migration troubleshooting guide (#20300 ) * docs: add Prisma migration troubleshooting guide Add troubleshooting documentation for common Prisma migration errors encountered when upgrading/downgrading LiteLLM proxy versions. Covers: - 'relation does not exist' errors after version rollback - Blocked migrations from previous failures - Migration state mismatch after version rollback - General tips for prisma migrate resolve, db push, and migrate deploy * docs: simplify prisma migration troubleshooting - focus on delete + restart	2026-02-02 14:39:18 -08:00
Ishaan Jaff	73691fb373	Model request tags documentation (#20290 ) * Add request tags documentation for spend tracking - Add new concise doc explaining how to tag model requests - Include Python SDK and cURL examples - Show where tags appear in spend logs - Add common use cases table (AWS accounts, teams, projects) - Include how to set default tags on API keys - Add to Spend Tracking section in sidebar Co-authored-by: ishaan <ishaan@berri.ai> * Simplify request tags doc for AI Gateway usage - Focus on config.yaml setup with default_key_generate_params - Show both request body and header methods for sending tags - Remove SDK examples, keep concise cURL examples - Streamline for quick reference Co-authored-by: ishaan <ishaan@berri.ai> * Update request tags doc to show model-level config - Set tags directly on model deployments in litellm_params - Requests just specify model, tags applied automatically - Use clear naming: AWS_IAM_PROD, AWS_IAM_DEV Co-authored-by: ishaan <ishaan@berri.ai> --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: ishaan <ishaan@berri.ai>	2026-02-02 11:32:00 -08:00
shin-bot-litellm	0a1b98895b	docs: Add FAQ for setting up and verifying LITELLM_LICENSE (#20284 ) * docs: add FAQ for setting up and verifying LITELLM_LICENSE Added two new FAQ entries to the Enterprise docs page: - How to set up your Enterprise License (LITELLM_LICENSE) via .env, Docker, or docker-compose - How to verify the license is active by checking for 'Enterprise Edition' in the Swagger UI * docs: trim license FAQ to essential steps only	2026-02-02 11:03:45 -08:00
Sameer Kankute	7329fa8e7a	fix: litellm_oss_staging_01_31_2026_3 failing tests	2026-02-02 18:50:58 +05:30
Harshit Jain	6d86808eaf	feat: enforce model-level TPM/RPM limits (enforce_model_rate_limits) … (#19230 ) * feat: enforce model-level TPM/RPM limits (enforce_model_rate_limits) flag * fix lint errors	2026-02-02 18:18:46 +05:30
Sameer Kankute	27b4052231	Merge pull request #20244 from BerriAI/litellm_nova_sonic_imp_2_feb [Feat]Add support for nova sonic Speech to speech model	2026-02-02 18:04:02 +05:30
Sameer Kankute	19f6640309	Merge pull request #20249 from BerriAI/litellm_add_0_budget_model_bypass2 [Feat] Add support for 0 cost models	2026-02-02 17:45:05 +05:30
Sameer Kankute	fff35ee032	Merge pull request #20257 from BerriAI/litellm_slack_alert_fix_jfeb_02 Fix: Slack alert issue	2026-02-02 17:43:05 +05:30
Sameer Kankute	72482c0cb5	Fix: Slack alert issue	2026-02-02 16:49:10 +05:30
Sameer Kankute	c6f178eeae	Update Vertex AI Text to Speech doc to show use of audio	2026-02-02 16:15:48 +05:30
Sameer Kankute	14c2932387	Add docs on Zero-Cost Models	2026-02-02 13:29:37 +05:30
Sameer Kankute	ea6c31a02a	Add documentation on nova sonic	2026-02-02 12:19:04 +05:30
shin-bot-litellm	df387c39f6	docs: Update v1.81.6 release notes - focus on Logs v2 with Tool Call Tracing (#20225 ) - Updated title to highlight Logs v2 feature - Simplified Key Highlights to focus on Logs v2 / tool call tracing - Rewrote Logs v2 description with improved language style - Removed Claude Agents SDK and RAG API from key highlights section - TODO: Add image (logs_v2_tool_tracing.png) Co-authored-by: shin-bot-litellm <shin-bot-litellm@users.noreply.github.com>	2026-01-31 19:29:06 -08:00
shin-bot-litellm	93a0631ea3	docs: fix dead links in v1.81.6 release notes (#20218 ) - Fix /docs/search/index -> /docs/search (404 error) - Fix /cookbook/ -> GitHub cookbook URL (404 error) Co-authored-by: shin-bot-litellm <shin-bot-litellm@users.noreply.github.com>	2026-01-31 17:46:56 -08:00
Ishaan Jaffer	8a57ee5efb	docs fix	2026-01-31 17:34:25 -08:00
Ishaan Jaffer	01b96f1272	fixes	2026-01-31 17:15:53 -08:00
yuneng-jiang	0bf471b071	Merge pull request #20198 from BerriAI/spend_logs_docs [Doc] Fixing Image	2026-01-31 15:28:50 -08:00
yuneng-jiang	93dfac78ed	doc fix	2026-01-31 15:27:21 -08:00
yuneng-jiang	543e85a5b4	Merge pull request #20197 from BerriAI/spend_logs_docs [Docs] UI Spend Logs Settings Docs	2026-01-31 15:20:31 -08:00
yuneng-jiang	af015fe4f0	UI spend logs setting docs	2026-01-31 15:16:59 -08:00
ryan-crabbe	82383cde74	docs/blog index page (#20188 ) * docs: add card-based blog index page for mobile navigation Fixes #20100 - the blog landing page showed post content directly instead of an index, with no way to navigate between posts on mobile. - Swizzle BlogListPage with card-based grid layout - Featured latest post spans full width with badge - Responsive 2-column grid with orphan handling - Pagination, SEO metadata, accessibility (aria-label, dateTime, heading hierarchy) - Add description frontmatter to existing blog posts * docs: add deterministic fallback colors for unknown blog tags * docs: rename blog heading to The LiteLLM Blog	2026-01-31 15:11:45 -08:00
Cesar Garcia	3fa7ab1012	docs(embeddings): add supported input formats section (#20073 ) Document valid input formats for /v1/embeddings endpoint per OpenAI spec. Clarifies that array of string arrays is not a valid format.	2026-01-31 10:45:56 -08:00
shin-bot-litellm	1c757dee14	litellm_docs: add missing environment variable documentation (#20138 )	2026-01-31 07:33:24 -08:00
Krrish Dholakia	0edd50fe3c	docs: cleanup docs	2026-01-30 19:39:17 -08:00
Ishaan Jaff	5345a763c2	[Feat] v2 - Logs view with side panel and improved UX (#20091 ) * init: azure_ai/azure-model-router * show additional_costs in CostBreakdown * UI show cost breakdown fields * feat: dedicated cost calc for azure ai * test_azure_ai_model_router * docs azure model router * test azure model router * fix transfrom * Add transform file * fix:feat: route to config * v0 - looks decen view * refactored code * fix ui * fixes ui * complete v2 viewer * address feedback * address feedback	2026-01-30 18:34:13 -08:00
Ishaan Jaff	6897d5f59e	[Feat] Add async_post_call_response_headers_hook to CustomLogger (#20083 ) * Add async_post_call_response_headers_hook to CustomLogger (#20070) Allow CustomLogger callbacks to inject custom HTTP response headers into streaming, non-streaming, and failure responses via a new async_post_call_response_headers_hook method. * async_post_call_response_headers_hook --------- Co-authored-by: michelligabriele <gabriele.michelli@icloud.com>	2026-01-30 12:44:44 -08:00
Ishaan Jaff	3910161a02	Realtime API benchmarks (#20074 ) * Add /realtime API benchmarks to Benchmarks documentation - Added new section showing performance improvements for /realtime endpoint - Included before/after metrics showing 182× faster p99 latency - Added test setup specifications and key optimizations - Referenced from v1.80.5-stable release notes Co-authored-by: ishaan <ishaan@berri.ai> * Update /realtime benchmarks to show current performance only - Removed before/after comparison, showing only current metrics - Clarified that benchmarks are e2e latency against fake realtime endpoint - Simplified table format for better readability Co-authored-by: ishaan <ishaan@berri.ai> --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: ishaan <ishaan@berri.ai>	2026-01-30 11:55:29 -08:00
Alexsander Hamir	0acfcb494b	Add mock client factory pattern and mock support for PostHog, Helicone, and Braintrust integrations (#19707 ) * Add LangSmith mock client support - Create langsmith_mock_client.py following GCS and Langfuse patterns - Add mock mode detection via LANGSMITH_MOCK environment variable - Intercept LangSmith API calls via AsyncHTTPHandler.post patching - Add verbose logging throughout mock implementation - Update LangsmithLogger to initialize mock client when mock mode enabled - Supports configurable mock latency via LANGSMITH_MOCK_LATENCY_MS * Add Datadog mock client support - Create datadog_mock_client.py following GCS, Langfuse, and LangSmith patterns - Add mock mode detection via DATADOG_MOCK environment variable - Intercept Datadog API calls via AsyncHTTPHandler.post and httpx.Client.post patching - Add verbose logging throughout mock implementation - Update DataDogLogger and DataDogLLMObsLogger to initialize mock client when mock mode enabled - Supports both async and sync logging paths - Supports configurable mock latency via DATADOG_MOCK_LATENCY_MS * refactor: consolidate mock client logic into factory pattern - Create mock_client_factory.py to centralize common mock HTTP client logic - Refactor GCS, Langfuse, LangSmith, and Datadog mock clients to use factory - Improve GET/DELETE mock accuracy for GCS (return valid StandardLoggingPayload) - Fix DELETE mock to return empty body (204 No Content) instead of JSON - Reduce code duplication across integration mock clients * feat: add PostHog mock client support - Create posthog_mock_client.py using factory pattern - Integrate mock client into PostHogLogger with mock mode detection - Add verbose logging for mock mode initialization and batch operations - Enable mock mode via POSTHOG_MOCK environment variable * Add Helicone mock client support - Created helicone_mock_client.py using factory pattern (similar to GCS) - Integrated mock mode detection and initialization in HeliconeLogger - Mock client patches HTTPHandler.post to intercept Helicone API calls - Uses factory pattern for should_use_mock and MockResponse utilities - Custom HTTPHandler.post patching required since HTTPHandler uses self.client.send() * Add mock support for Braintrust integration and extend mock client factory - Add braintrust_mock_client.py with mock HTTP client for Braintrust integration testing - Integrate mock client into BraintrustLogger with mock mode detection - Refactor Helicone mock client to fully utilize factory's HTTPHandler.post patching - Extend mock_client_factory to support patching HTTPHandler.post for sync calls - Enable endpoint-specific mock responses for Braintrust (/project vs /project_logs) - All mock clients now properly handle both async (AsyncHTTPHandler) and sync (HTTPHandler) calls * Fix linter errors: remove unused imports and suppress complexity warning - Remove unused imports from gcs_bucket_mock_client.py (httpx, json, timedelta, Dict, Optional) - Remove unused Callable import from mock_client_factory.py - Add noqa comment to suppress PLR0915 complexity warning for create_mock_client_factory function * Document mock environment variables for PostHog, Helicone, Braintrust, Datadog, and Langsmith integrations - Add POSTHOG_MOCK and POSTHOG_MOCK_LATENCY_MS documentation - Add HELICONE_MOCK and HELICONE_MOCK_LATENCY_MS documentation - Add BRAINTRUST_MOCK and BRAINTRUST_MOCK_LATENCY_MS documentation - Add DATADOG_MOCK and DATADOG_MOCK_LATENCY_MS documentation - Add LANGSMITH_MOCK and LANGSMITH_MOCK_LATENCY_MS documentation All mock env vars follow the same pattern: enable mock mode for integration testing by intercepting API calls and returning mock responses without making actual network calls. * Fix security issue	2026-01-30 09:52:53 -08:00
Sameer Kankute	c509ffcea8	Merge pull request #20042 from BerriAI/main merge main in passthrough	2026-01-30 08:44:00 +05:30
Ishaan Jaff	c9658f877e	[Docs] Claude Agents SDK x LiteLLM Guide (#20036 ) * docs claude agent SDK * docs fix * docs * docs	2026-01-29 18:04:54 -08:00
Sameer Kankute	8808e4d7ac	Add /openai_passthrough route for openai passthrough requests:	2026-01-29 16:07:45 +05:30
Alexsander Hamir	69bd4426e8	[Release Day] - Fixed CI/CD issues & changed processes (#19902 )	2026-01-28 17:57:24 -08:00
Sameer Kankute	f5e5569e40	Merge pull request #19636 from BerriAI/litellm_langfuse_callback Add litellm_callback_logging_failures_metric for Langfuse, Langfuse Otel and other Otel providers	2026-01-28 18:02:17 +05:30
Sameer Kankute	7386621d04	Merge pull request #19839 from BerriAI/litellm_oss_staging_01_27_2026 Litellm oss staging 01 27 2026	2026-01-28 17:33:27 +05:30
Ishaan Jaff	3080e04180	[Feat] UI: Allow Admins to control what pages are visible on LeftNav (#19907 ) * feat: enabled_ui_pages_internal_users * init ui for internal user controsl * fix ui settings * fix build * fix leftnav * fix leftnav * test fixes * fix leftnav * isPageAccessibleToInternalUsers * docs fix * docs ui viz	2026-01-27 19:31:24 -08:00
Sameer Kankute	42a0d576f3	Merge pull request #19910 from BerriAI/main merge 01 27	2026-01-28 08:30:47 +05:30
Ryan Wilson	70eb732b41	docs: fix guardrail logging docs (#19833 )	2026-01-27 18:13:04 -08:00
Ishaan Jaff	fe444f3ed5	[Feat] RAG API - Add support for using s3 Vectors as Vector Store Provider for /rag/ingest (#19888 ) * init S3VectorsRAGIngestion as a supported ingestion provider for RAG API * test: TestRAGS3Vectors * init S3VectorsVectorStoreOptions * init s3 vectors * code clean up + QA * fix: get_credentials * S3VectorsRAGIngestion * TestRAGS3Vectors * docs: AWS S3 Vectors * add asyncio QA checks * fix: S3_VECTORS_DEFAULT_DIMENSION	2026-01-27 14:45:26 -08:00
Harshit Jain	0f0b71e6d9	feat: add feature to make silent calls (#19544 ) * feat: add feature to make silent calls * add test or silent feat * add docs for silent feat * fix lint issues and UI logs * add docs of ab testing and deep copy	2026-01-27 09:16:53 -08:00
Sameer Kankute	81020d05d2	Merge pull request #19844 from BerriAI/litellm_sarvam_doc [Docs]Add sarvam usage documentation	2026-01-27 17:47:59 +05:30
Sameer Kankute	c8a93c7d81	Merge pull request #19845 from BerriAI/litellm_gemini-robotics-er-1.5-preview2 Add Gemini Robotics-ER 1.5 preview support	2026-01-27 17:47:44 +05:30
Sameer Kankute	c834d7d1fe	Merge branch 'main' into litellm_oss_staging_01_27_2026	2026-01-27 17:11:15 +05:30
Sameer Kankute	0214cb04cd	Merge branch 'main' into litellm_oss_staging_01_26_2026	2026-01-27 17:00:58 +05:30
Sameer Kankute	adf6d7e1db	Merge pull request #19692 from BerriAI/litellm_oss_staging_01_24_2026 Litellm oss staging 01 24 2026	2026-01-27 16:59:28 +05:30
Sameer Kankute	9a2750f8ec	Merge pull request #19617 from BerriAI/litellm_oss_staging_01_23_2026 Litellm oss staging 01 23 2026	2026-01-27 16:55:32 +05:30

1 2 3 4 5 ...

5383 Commits