Commit Graph

4269 Commits

Author SHA1 Message Date
Michael Tan 6721189b7c feat: add Asia/Bangkok timezone support for budget resets
- Add Asia/Bangkok (UTC+7) to timezone_map in duration_parser.py
- Update documentation to include Bangkok in common timezone values
- Add test case to verify Bangkok timezone functionality
2025-09-18 14:56:12 +07:00
Alexsander Hamir 8d96626044 fix: iscoroutine removed from hot path +50 RPS (#14649)
* fix: iscoroutine removed from hot path

* fix: replace all instances & separate concerns

1. Replaced all instances of iscoroutine with is_async_callable
2. Place the coroutine checker in its own file

* fix: PR comment changes

* fix: missing config setting declaration

* fix: revert non-performance related changes

* fix: revert to initial implementation

* fix: remove dead const
2025-09-17 19:01:43 -07:00
Ishaan Jaffer e2be7e0b1f docs fix posthog 2025-09-17 17:02:23 -07:00
Carlos Marchal e168161e64 Feat/add posthog observability (#14610)
* feat: add posthog observability

* docs: add posthog logging docs

* docs: posthog integration in proxy mode
2025-09-17 08:24:04 -07:00
Krish Dholakia a74fda1159 Merge pull request #14625 from timelfrink/fix/mdx-compilation-error-compactifai
Fix: MDX compilation error in CompactifAI documentation
2025-09-16 23:06:50 -07:00
Tim Elfrink 79d1992de0 Fix: MDX compilation error in CompactifAI documentation
- Replace (<5%) with (under 5%) in two locations to fix JSX parsing
- Resolves webpack build failure in Docusaurus documentation
- Maintains same meaning while avoiding MDX syntax conflicts

Fixes #14624
2025-09-17 07:50:38 +02:00
Krish Dholakia ef14270dec Merge pull request #14448 from vvidovic/propagate_metadata_to_vertexai_labels_docs
Describing the `labels` field use in the Vertex AI
2025-09-16 22:46:28 -07:00
Krish Dholakia 701b4fffee Merge pull request #14600 from mubashir1osmani/fix_all_docs
docs: helicone integration and mcp
2025-09-16 22:37:38 -07:00
Sameer Kankute ab1fb2b2e7 Add Support for Bedrock Guardrails to supportive selective Guarding (#14575)
* Add Support for Bedrock Guardrails to supportive selective Guarding

* Add method for better handling

* Add guarded_text content type

* Add guarded_text content type

* Update Dockerfile

* Update Dockerfile
2025-09-16 16:43:34 -07:00
mubashir1osmani a2d5bde837 changed docs 2025-09-16 00:58:08 -04:00
Krish Dholakia 3f90fe938d Merge pull request #14514 from iabhi4/fix-13581
feat(proxy): Assign default budget to auto-generated JWT teams
2025-09-15 21:22:25 -07:00
Krish Dholakia f4e2870490 Merge pull request #14532 from timelfrink/feat/issue-14476-compactifai-provider
Add CompactifAI provider support
2025-09-15 21:15:00 -07:00
Ishaan Jaffer eb3e159b7c docs update 2025-09-15 17:22:03 -07:00
Tim Elfrink 9521414efa Resolve merge conflict by including both CompactifAI and OVHCloud providers
- Keep CompactifAI provider detection logic
- Include new OVHCloud provider from main branch
- Both providers now work correctly with model prefix detection
2025-09-14 23:03:18 +02:00
Tim Elfrink 6ac37093e5 Update CompactifAI model references and move tests to unit test directory
- Update all model references from llama-2-7b-compressed to cai-llama-3-1-8b-slim
- Move CompactifAI tests from tests/llm_translation to tests/test_litellm/llms/compactifai/
- Update documentation examples to use the new model name
- Remove integration test inheritance to make tests pure mock tests

This addresses review feedback to use mock tests and updated model naming.
2025-09-14 23:00:27 +02:00
iabhi4 dc27bccb45 feat(proxy): Assign default budget to auto-generated JWT teams 2025-09-14 12:04:43 -07:00
Krish Dholakia 56fd60b140 Merge pull request #14494 from eliasto/feat/ovhcloud-ai-edpoints-provider
feat: Add OVHCloud AI Endpoints as a provider
2025-09-14 00:45:08 -07:00
Krrish Dholakia 7ef8c808cf docs: update doc 2025-09-14 00:28:11 -07:00
Krish Dholakia 11822e63f1 Merge pull request #14519 from uc4w6c/feat/add_tools_permission_guardrail
feat: add tool-permission guardrail
2025-09-13 23:22:31 -07:00
Ishaan Jaff f37dd6bb95 Litellm 1.77.2 stable notes (#14544)
* fix release notes instructions

* docs v1

* fix doc

* fix highlights

* docs fix

* docs fix
2025-09-13 18:41:34 -07:00
Boopesh Shanmugam 95da14cb96 Docs update on user header mapping (#14527) 2025-09-13 16:58:10 -07:00
Krish Dholakia 269515e525 Merge branch 'main' into litellm_dev_09_12_2025_p1 2025-09-13 10:10:30 -07:00
Krrish Dholakia 9ee9745781 docs(vllm.md): document new endpoint 2025-09-13 09:46:39 -07:00
Krish Dholakia 38efd358eb Merge pull request #14401 from Noma-Security/noma_non_blocking_monitor_mode
Noma non blocking monitor mode & anonymize input support
2025-09-13 09:41:41 -07:00
Krish Dholakia ad9515a81e Merge branch 'main' into litellm_contributor_prs_09_12_2025_p1 2025-09-13 09:38:43 -07:00
Tim Elfrink 1987556a50 Add CompactifAI provider documentation and config
- Create comprehensive provider documentation with usage examples
- Cover basic completion, streaming, async, and function calling
- Document AWS Marketplace subscription and API key setup process
- Include proxy configuration and advanced parameter examples
- Add error handling examples and model information
- Update website sidebar to include CompactifAI in provider list
- Update README.md with CompactifAI provider reference
2025-09-13 08:42:56 +02:00
Krish Dholakia de8cf40ffa Merge pull request #14482 from BerriAI/litellm_dev_09_11_2025_p3
Feature - new `litellm_request_debug=true` flag, enables emitting raw request/response log on single request
2025-09-12 19:56:53 -07:00
Ishaan Jaff 8b3b943557 docs fix 2025-09-12 19:51:14 -07:00
Krrish Dholakia 05e3e3685b docs(forward_client_headers.md): document new forwarding client headers to llm api feature 2025-09-12 19:38:07 -07:00
Ishaan Jaff 93af8fd6ba [QA] E2E - Testing for bedrock batches api (#14525)
* add bedrock/batch-anthropic.claude-3-5-sonnet-20240620-v1:0

* test_bedrock_batches_api

* fix

* fix import

* test_bedrock_batches_api
2025-09-12 19:31:19 -07:00
Ishaan Jaff 20c390dc71 [Docs] - Bedrock Batches API (#14526)
* docs bedrock batch

* docs fix

* docs bedrock batches

* fix Create Batch Request

* docs fix
2025-09-12 19:08:13 -07:00
Krrish Dholakia bdd960c451 docs(admin_ui_sso.md): clarify sso usage on docs 2025-09-12 18:03:03 -07:00
Yuta Saito 1de7f076ac feat: add tool-permission guardrail 2025-09-13 07:34:29 +09:00
Elias TOURNEUX 54f49b430d fix: edit readme to add OVHCloud 2025-09-12 14:29:33 +02:00
Elias TOURNEUX ef9d1ddc40 feat: Add OVHCloud AI Endpoints as a provider 2025-09-12 13:37:03 +02:00
Krrish Dholakia c802c472b5 docs(debugging.md): document new feature
Closes https://github.com/BerriAI/litellm/issues/13814
2025-09-11 20:17:39 -07:00
Ishaan Jaff dda115cc6d [Feat] Cost Tracking - Add support for Tiered Cost Tracking for Qwen API (Dashscope) (#14471)
* add dashscope logo

* docs fix

* docs fix

* fix supports_batch_calling

* fix naming

* fix input_cost_per_audio_token

* use output_cost_per_reasoning_token

* add tiered_pricing in get_model_info

* test fixes

* fix cost calc

* ruff fix
2025-09-11 18:14:39 -07:00
Vedran Vidović e6a6300b70 Describing the labels field use in the Vertex AI 2025-09-11 15:28:55 +02:00
Tom Alon b473344f70 Implement anonymization logic 2025-09-11 11:47:19 +03:00
Ishaan Jaff a13aa4740a [Fixes] Bug fixes to using LiteLLM MCP Gateway (#14392)
* fix: use _get_mcp_servers_in_path

* fix checks for using litellm_proxy as MCP tool provider

* fix: fix mcp_tools_with_litellm_proxy

* fix: fix aresponses_api_with_mcp

* aresponses_api_with_mcp

* test_mcp_allowed_tools_filtering

* fix: _filter_mcp_tools_by_allowed_tools

* fix: _filter_mcp_tools_by_allowed_tools

* test_streaming_responses_api_with_mcp_tools

* fixes: test tools transfrom MCP->OpenaI spec

* test_streaming_responses_api_with_mcp_tools

* fix: chat ui allow multi select with allowed tools

* fix: use correct MCP events with litellm proxy response API

* fix get_event_model_class

* fix litellm proxy MCP handler

* fix MCPEnhancedStreamingIterator

* chat ui show list tools result

* UI: show MCP events

* fix stream iterator

* fixes: litellm proxy mcp handler

* test responses + mcp

* fix: update responses api with mcp handling

* ruff check fix

* central: _process_mcp_tools_to_openai_format

* fix: refactor code

* test_mcp_allowed_tools_filtering

* test mcp with litellm proxy

* fix mcp call

* demo: video using MCP ui

* fixes for using stream iterator

* test_no_duplicate_mcp_tools_in_streaming_e2e

* docs fix

* fix code snippet
2025-09-10 19:12:11 -07:00
Ishaan Jaff 1f42e41c8d [Bug]: Fix Authorization header not being sent to configured MCP servers (#14422)
* test: test_mcp_server_config_auth_value_header_used

* fix: authentication_token

* docs: fix instructions on using responses api with MCPs

* mcp fixes
2025-09-10 16:41:08 -07:00
Ishaan Jaff 199c262e33 EXPERIMENTAL_MULTI_INSTANCE_RATE_LIMITING 2025-09-10 15:55:27 -07:00
Krish Dholakia d72081113e Merge pull request #14111 from dharamendrak/feature/aiohttp-dependency-injection
feat: Add dependency injection support to BaseLLMAIOHTTPHandler
2025-09-09 22:35:30 -07:00
Krish Dholakia 34c5bc7bb0 Merge pull request #14391 from mubashir1osmani/fix_all_docs
added tags to langchain
2025-09-09 22:15:16 -07:00
mubashir1osmani c20a5b0bbd more info for turn_off_logging 2025-09-10 00:51:37 -04:00
mubashir1osmani f51036e092 added tags to langchain 2025-09-10 00:32:26 -04:00
Krrish Dholakia 98444a900b fix(config_settings.md): document new env var 2025-09-09 19:25:38 -07:00
Krish Dholakia b9ce3a1587 Merge pull request #12416 from dotmobo/feature/fix-alloy
feat: add a health_check_voice parameter in model_info
2025-09-08 23:12:48 -07:00
Krish Dholakia 5ab091f1a4 Merge pull request #14335 from TeddyAmkie/custom-sso-doc-update
Scim docs updated with user removal behavior
2025-09-08 19:04:45 -07:00
Ishaan Jaff ddfbf7c36c docs v1.76.3 2025-09-08 18:37:28 -07:00