Commit Graph

5362 Commits

Author SHA1 Message Date
yuneng-jiang 0bf471b071 Merge pull request #20198 from BerriAI/spend_logs_docs
[Doc] Fixing Image
2026-01-31 15:28:50 -08:00
yuneng-jiang 93dfac78ed doc fix 2026-01-31 15:27:21 -08:00
yuneng-jiang 543e85a5b4 Merge pull request #20197 from BerriAI/spend_logs_docs
[Docs] UI Spend Logs Settings Docs
2026-01-31 15:20:31 -08:00
yuneng-jiang af015fe4f0 UI spend logs setting docs 2026-01-31 15:16:59 -08:00
ryan-crabbe 82383cde74 docs/blog index page (#20188)
* docs: add card-based blog index page for mobile navigation

Fixes #20100 - the blog landing page showed post content directly
instead of an index, with no way to navigate between posts on mobile.

- Swizzle BlogListPage with card-based grid layout
- Featured latest post spans full width with badge
- Responsive 2-column grid with orphan handling
- Pagination, SEO metadata, accessibility (aria-label, dateTime, heading hierarchy)
- Add description frontmatter to existing blog posts

* docs: add deterministic fallback colors for unknown blog tags

* docs: rename blog heading to The LiteLLM Blog
2026-01-31 15:11:45 -08:00
Cesar Garcia 3fa7ab1012 docs(embeddings): add supported input formats section (#20073)
Document valid input formats for /v1/embeddings endpoint per OpenAI spec.
Clarifies that array of string arrays is not a valid format.
2026-01-31 10:45:56 -08:00
shin-bot-litellm 1c757dee14 litellm_docs: add missing environment variable documentation (#20138) 2026-01-31 07:33:24 -08:00
Krrish Dholakia 0edd50fe3c docs: cleanup docs 2026-01-30 19:39:17 -08:00
Ishaan Jaff 5345a763c2 [Feat] v2 - Logs view with side panel and improved UX (#20091)
* init: azure_ai/azure-model-router

* show additional_costs in CostBreakdown

* UI show cost breakdown fields

* feat: dedicated cost calc for azure ai

* test_azure_ai_model_router

* docs azure model router

* test azure model router

* fix transfrom

* Add transform file

* fix:feat: route to config

* v0 - looks decen view

* refactored code

* fix ui

* fixes ui

* complete v2 viewer

* address feedback

* address feedback
2026-01-30 18:34:13 -08:00
Ishaan Jaff 6897d5f59e [Feat] Add async_post_call_response_headers_hook to CustomLogger (#20083)
* Add async_post_call_response_headers_hook to CustomLogger (#20070)

Allow CustomLogger callbacks to inject custom HTTP response headers
into streaming, non-streaming, and failure responses via a new
async_post_call_response_headers_hook method.

* async_post_call_response_headers_hook

---------

Co-authored-by: michelligabriele <gabriele.michelli@icloud.com>
2026-01-30 12:44:44 -08:00
Ishaan Jaff 3910161a02 Realtime API benchmarks (#20074)
* Add /realtime API benchmarks to Benchmarks documentation

- Added new section showing performance improvements for /realtime endpoint
- Included before/after metrics showing 182× faster p99 latency
- Added test setup specifications and key optimizations
- Referenced from v1.80.5-stable release notes

Co-authored-by: ishaan <ishaan@berri.ai>

* Update /realtime benchmarks to show current performance only

- Removed before/after comparison, showing only current metrics
- Clarified that benchmarks are e2e latency against fake realtime endpoint
- Simplified table format for better readability

Co-authored-by: ishaan <ishaan@berri.ai>

---------

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
Co-authored-by: ishaan <ishaan@berri.ai>
2026-01-30 11:55:29 -08:00
Alexsander Hamir 0acfcb494b Add mock client factory pattern and mock support for PostHog, Helicone, and Braintrust integrations (#19707)
* Add LangSmith mock client support

- Create langsmith_mock_client.py following GCS and Langfuse patterns
- Add mock mode detection via LANGSMITH_MOCK environment variable
- Intercept LangSmith API calls via AsyncHTTPHandler.post patching
- Add verbose logging throughout mock implementation
- Update LangsmithLogger to initialize mock client when mock mode enabled
- Supports configurable mock latency via LANGSMITH_MOCK_LATENCY_MS

* Add Datadog mock client support

- Create datadog_mock_client.py following GCS, Langfuse, and LangSmith patterns
- Add mock mode detection via DATADOG_MOCK environment variable
- Intercept Datadog API calls via AsyncHTTPHandler.post and httpx.Client.post patching
- Add verbose logging throughout mock implementation
- Update DataDogLogger and DataDogLLMObsLogger to initialize mock client when mock mode enabled
- Supports both async and sync logging paths
- Supports configurable mock latency via DATADOG_MOCK_LATENCY_MS

* refactor: consolidate mock client logic into factory pattern

- Create mock_client_factory.py to centralize common mock HTTP client logic
- Refactor GCS, Langfuse, LangSmith, and Datadog mock clients to use factory
- Improve GET/DELETE mock accuracy for GCS (return valid StandardLoggingPayload)
- Fix DELETE mock to return empty body (204 No Content) instead of JSON
- Reduce code duplication across integration mock clients

* feat: add PostHog mock client support

- Create posthog_mock_client.py using factory pattern
- Integrate mock client into PostHogLogger with mock mode detection
- Add verbose logging for mock mode initialization and batch operations
- Enable mock mode via POSTHOG_MOCK environment variable

* Add Helicone mock client support

- Created helicone_mock_client.py using factory pattern (similar to GCS)
- Integrated mock mode detection and initialization in HeliconeLogger
- Mock client patches HTTPHandler.post to intercept Helicone API calls
- Uses factory pattern for should_use_mock and MockResponse utilities
- Custom HTTPHandler.post patching required since HTTPHandler uses self.client.send()

* Add mock support for Braintrust integration and extend mock client factory

- Add braintrust_mock_client.py with mock HTTP client for Braintrust integration testing
- Integrate mock client into BraintrustLogger with mock mode detection
- Refactor Helicone mock client to fully utilize factory's HTTPHandler.post patching
- Extend mock_client_factory to support patching HTTPHandler.post for sync calls
- Enable endpoint-specific mock responses for Braintrust (/project vs /project_logs)
- All mock clients now properly handle both async (AsyncHTTPHandler) and sync (HTTPHandler) calls

* Fix linter errors: remove unused imports and suppress complexity warning

- Remove unused imports from gcs_bucket_mock_client.py (httpx, json, timedelta, Dict, Optional)
- Remove unused Callable import from mock_client_factory.py
- Add noqa comment to suppress PLR0915 complexity warning for create_mock_client_factory function

* Document mock environment variables for PostHog, Helicone, Braintrust, Datadog, and Langsmith integrations

- Add POSTHOG_MOCK and POSTHOG_MOCK_LATENCY_MS documentation
- Add HELICONE_MOCK and HELICONE_MOCK_LATENCY_MS documentation
- Add BRAINTRUST_MOCK and BRAINTRUST_MOCK_LATENCY_MS documentation
- Add DATADOG_MOCK and DATADOG_MOCK_LATENCY_MS documentation
- Add LANGSMITH_MOCK and LANGSMITH_MOCK_LATENCY_MS documentation

All mock env vars follow the same pattern: enable mock mode for integration testing by intercepting API calls and returning mock responses without making actual network calls.

* Fix security issue
2026-01-30 09:52:53 -08:00
Sameer Kankute c509ffcea8 Merge pull request #20042 from BerriAI/main
merge main in passthrough
2026-01-30 08:44:00 +05:30
Ishaan Jaff c9658f877e [Docs] Claude Agents SDK x LiteLLM Guide (#20036)
* docs claude agent SDK

* docs fix

* docs

* docs
2026-01-29 18:04:54 -08:00
Sameer Kankute 8808e4d7ac Add /openai_passthrough route for openai passthrough requests: 2026-01-29 16:07:45 +05:30
Alexsander Hamir 69bd4426e8 [Release Day] - Fixed CI/CD issues & changed processes (#19902) 2026-01-28 17:57:24 -08:00
Sameer Kankute f5e5569e40 Merge pull request #19636 from BerriAI/litellm_langfuse_callback
Add litellm_callback_logging_failures_metric for Langfuse, Langfuse Otel and other Otel providers
2026-01-28 18:02:17 +05:30
Sameer Kankute 7386621d04 Merge pull request #19839 from BerriAI/litellm_oss_staging_01_27_2026
Litellm oss staging 01 27 2026
2026-01-28 17:33:27 +05:30
Ishaan Jaff 3080e04180 [Feat] UI: Allow Admins to control what pages are visible on LeftNav (#19907)
* feat: enabled_ui_pages_internal_users

* init ui for internal user controsl

* fix ui settings

* fix build

* fix leftnav

* fix leftnav

* test fixes

* fix leftnav

* isPageAccessibleToInternalUsers

* docs fix

* docs ui viz
2026-01-27 19:31:24 -08:00
Sameer Kankute 42a0d576f3 Merge pull request #19910 from BerriAI/main
merge 01 27
2026-01-28 08:30:47 +05:30
Ryan Wilson 70eb732b41 docs: fix guardrail logging docs (#19833) 2026-01-27 18:13:04 -08:00
Ishaan Jaff fe444f3ed5 [Feat] RAG API - Add support for using s3 Vectors as Vector Store Provider for /rag/ingest (#19888)
* init S3VectorsRAGIngestion as a supported ingestion provider for RAG API

* test: TestRAGS3Vectors

* init S3VectorsVectorStoreOptions

* init s3 vectors

* code clean up + QA

* fix: get_credentials

* S3VectorsRAGIngestion

* TestRAGS3Vectors

* docs: AWS S3 Vectors

* add asyncio QA checks

* fix: S3_VECTORS_DEFAULT_DIMENSION
2026-01-27 14:45:26 -08:00
Harshit Jain 0f0b71e6d9 feat: add feature to make silent calls (#19544)
* feat: add feature to make silent calls

* add test or silent feat

* add docs for silent feat

* fix lint issues and  UI logs

* add docs of ab testing and deep copy
2026-01-27 09:16:53 -08:00
Sameer Kankute 81020d05d2 Merge pull request #19844 from BerriAI/litellm_sarvam_doc
[Docs]Add sarvam usage documentation
2026-01-27 17:47:59 +05:30
Sameer Kankute c8a93c7d81 Merge pull request #19845 from BerriAI/litellm_gemini-robotics-er-1.5-preview2
Add  Gemini Robotics-ER 1.5 preview support
2026-01-27 17:47:44 +05:30
Sameer Kankute c834d7d1fe Merge branch 'main' into litellm_oss_staging_01_27_2026 2026-01-27 17:11:15 +05:30
Sameer Kankute 0214cb04cd Merge branch 'main' into litellm_oss_staging_01_26_2026 2026-01-27 17:00:58 +05:30
Sameer Kankute adf6d7e1db Merge pull request #19692 from BerriAI/litellm_oss_staging_01_24_2026
Litellm oss staging 01 24 2026
2026-01-27 16:59:28 +05:30
Sameer Kankute 9a2750f8ec Merge pull request #19617 from BerriAI/litellm_oss_staging_01_23_2026
Litellm oss staging 01 23 2026
2026-01-27 16:55:32 +05:30
Sameer Kankute bd95712a22 Merge pull request #19832 from BerriAI/litellm_fix_a2a_package
Fix: A2A Python SDK URL
2026-01-27 15:16:52 +05:30
Sameer Kankute e273c85848 Add gemini-robotics-er-1.5-preview model documentation 2026-01-27 13:58:13 +05:30
Sameer Kankute faf9c9ba76 Add sarvam doc 2026-01-27 13:11:29 +05:30
Krish Dholakia 6a54dcfa93 feat: Add model_id label to Prometheus metrics (#18048) (#19678)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2026-01-26 20:32:08 -08:00
Krish Dholakia 7ba0235a50 Litellm release notes 01 26 2026 (#19838)
* docs: document new models/endpoints

* docs: cleanup

* feat: update model table

* fix: cleanup
2026-01-26 20:20:20 -08:00
Krish Dholakia 664715b68e Litellm release notes 01 26 2026 (#19836)
* docs: document new models/endpoints

* docs: cleanup

* feat: update model table
2026-01-26 20:11:56 -08:00
Sameer Kankute c70d04d85f Fix: A2A Python SDK URL 2026-01-27 08:07:36 +05:30
Ishaan Jaff cec1a3c858 [Feat] CLI Auth - Add configurable CLI JWT expiration via environment variable (#19780)
* fix: add CLI_JWT_EXPIRATION_HOURS

* docs: CLI_JWT_EXPIRATION_HOURS

* fix: get_cli_jwt_auth_token

* test_get_cli_jwt_auth_token_custom_expiration
2026-01-26 14:56:17 -08:00
Tamir Kiviti aa8134fee9 add timeout to onyx guardrail (#19731)
* add timeout to onyx guardrail

* add tests
2026-01-25 23:13:46 -08:00
yuneng-jiang 8094aff8c5 Merge pull request #19715 from BerriAI/key_teams_fallback_docs
[Docs] UI Keys Teams Router Settings docs
2026-01-24 16:26:28 -08:00
yuneng-jiang 937ccf1977 UI Keys Teams Router Settings docs 2026-01-24 16:23:46 -08:00
Ishaan Jaffer 241c0c6d2a docs fix 2026-01-24 12:32:18 -08:00
Ishaan Jaffer 6c6ed0dad2 docs fix 2026-01-24 12:11:39 -08:00
Ishaan Jaffer 28a9003103 docs fix 2026-01-24 12:10:42 -08:00
Ishaan Jaffer bbeb007f4e docs fix 2026-01-24 12:09:16 -08:00
Ishaan Jaffer a081dc2ee2 docs fix 2026-01-24 11:41:16 -08:00
houdataali 7ffc1a718a a2a agent Header-Based Context Propagation (#19504)
* a2a agent Header-Based Context Propagation

* docs: add guide for A2A context header forwarding
2026-01-23 19:56:04 -08:00
Harshit Jain f4ba5b9209 docs: add litellm-enterprise requirement for managed files (#19689) 2026-01-23 19:51:39 -08:00
Alexsander Hamir 5c61586e65 Add GCS mock mode for testing without API calls (#19683) 2026-01-23 16:25:32 -08:00
Alexsander Hamir 56883add3c Add Langfuse mock mode for testing without API calls (#19676) 2026-01-23 15:33:40 -08:00
mubashir1osmani 8e060593bf feat(vercel_ai_gateway): add embeddings support
feat(vercel_ai_gateway): add embeddings support
2026-01-23 18:28:25 -05:00