litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-17 22:48:35 +00:00

Author	SHA1	Message	Date
Krish Dholakia	67f90254ed	feat(guardrails): team-based guardrail registration and approval workflow (#22459 ) * feat(guardrails): team-based guardrail registration and approval workflow Add team-based guardrail submission system where teams can register Generic Guardrail API guardrails for admin review. Includes: - POST /guardrails/register endpoint for team-scoped submissions - Admin review endpoints (list/get/approve/reject submissions) - Team Guardrails tab in the UI dashboard - extra_headers support for forwarding client headers to guardrail APIs - Prisma schema migration for status, submitted_at, reviewed_at fields - Documentation for team-based guardrails and static/dynamic headers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(guardrails): address review feedback - SSRF, silent failure, redundant query - Validate api_base URL scheme (http/https only) and hostname in register_guardrail to prevent SSRF via team submissions - Return warning field in approve response when in-memory initialization fails so admins know the guardrail won't work until next sync cycle - Eliminate redundant DB query in list_guardrail_submissions by fetching all team guardrails once and deriving both filtered list and summary counts from the single result set Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(guardrails): add pending_review status guard to reject endpoint Prevent rejecting already-active or already-rejected guardrails, which would create a DB/memory inconsistency (active in memory but rejected in DB). Now mirrors the approve endpoint's status check. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 22:06:49 -08:00
Krish Dholakia	12c4876891	Agents - assign tools (#22064 ) * feat(proxy): add max_iterations limiter for agent session loops (#22058) Adds a new proxy hook that enforces a per-session cap on the number of LLM calls an agentic loop can make. Callers send a session_id with each request, and the hook counts calls per session, returning 429 when the configured max_iterations limit is exceeded. - Uses Redis Lua script for atomic increment (multi-instance safe) - Falls back to in-memory cache when Redis unavailable - Follows parallel_request_limiter_v3 pattern - Configurable via key metadata: {"max_iterations": 25} - Session counters auto-expire via TTL (default 1hr) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> * feat: add new code execution dataset * feat(agent_endpoints/): allow giving agents keys * fix: ui fixes * feat: allow assigning mcp servers to agents * fix: eliminate duplicate DB queries in MCP agent auth and N+1 in agent listing (#22110) - Extract _get_agent_object_permission helper so _get_allowed_mcp_servers_for_agent and _get_agent_tool_permissions_for_server share a single DB fetch instead of each independently querying the same agent row (was 1+N queries per MCP request) - Use include={"object_permission": True} on find_many in get_all_agents_from_db to eagerly load permissions in one query instead of N+1 - Use include={"object_permission": True} on create/update/find_unique in all agent CRUD operations, removing attach_object_permission_to_dict follow-up calls Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 11:44:30 -08:00
Ryan Crabbe	e0ddb2a525	fix: guard print_aggregate against empty latencies	2026-02-23 10:06:10 -08:00
Ryan Crabbe	95d9514054	fix: add auth headers and empty latencies guard to benchmark script	2026-02-23 09:58:40 -08:00
Ryan Crabbe	94b76ea9ad	feat: add network_mock transport for benchmarking proxy overhead without real API calls Intercepts at httpx transport layer so the full proxy path (auth, routing, OpenAI SDK, response transformation) is exercised with zero-latency responses. Activated via `litellm_settings: { network_mock: true }` in proxy config.	2026-02-21 17:52:39 -08:00
Alexsander Hamir	7f81dea8b3	Add custom auth header support and increase default prompt size to 100k chars (#19436 )	2026-01-20 13:25:12 -08:00
Alexsander Hamir	270b41b0f4	Simplify file comments (#19382 )	2026-01-19 17:01:38 -08:00
Alexsander Hamir	0cd7763d5f	Add health check scripts and parallel execution support (#19295 ) - Add health_check_client.py for monitoring model availability - Add health_check_client_README.md with usage documentation - Add health_check_requirements.txt for dependencies - Add run_parallel_health_checks.ps1 (PowerShell version) - Add run_parallel_health_checks.sh (Bash version) - Organize all scripts under scripts/health_check/ directory	2026-01-19 08:38:38 -08:00
Xiaohan Fu	07fe9e8604	implement failopen option default to True on grayswan guardrail (#18266 ) * implement failopen option default to True * introduce a config to set the timeout limit (default to 30)	2026-01-06 15:17:05 +05:30
Alexsander Hamir	b635f92d90	Add benchmark_proxy_vs_provider.py script to scripts directory with usage examples (#17889 )	2025-12-12 11:26:34 -08:00
Alexsander Hamir	762b429d6c	enhance: create_litellm_branch tool to be more robust (#17874 )	2025-12-12 05:35:50 -08:00
Cesar Garcia	a7ad8a36a4	chore: cleanup unused scripts and fix misplaced test file (#17611 ) Remove scripts/ directory containing unused development/debug scripts: - mock_ibm_guardrails_server.py - test_groq_streaming_issue.py (debug for #12660) - test_mock_ibm_guardrails.py - update_readme_providers_table.py Move misplaced test file to correct location: - test_litellm/ -> tests/test_litellm/ (from PR #17221)	2025-12-08 19:00:55 -08:00
Alexsander Hamir	c44e075b2d	feat: add script to create branches with litellm_ prefix (#17606 ) Add utility scripts to create branches with litellm_ prefix from contributor branches. This helps maintain consistent branch naming conventions for CI/CD. - scripts/create_litellm_branch.sh (Bash for macOS/Linux) - scripts/create_litellm_branch.ps1 (PowerShell for Windows) Usage: ./scripts/create_litellm_branch.sh [source_branch] [new_branch_name] ./scripts/create_litellm_branch.ps1 [source_branch] [new_branch_name] Features: - Auto-prefixes branch names with litellm_ - Handles existing branches gracefully - Validates branch names - Supports local and remote source branches	2025-12-06 10:41:39 -08:00
Rob Geada	d35d9008c9	Ensure detector-id is passed as header to IBM detector server (#16649 )	2025-11-14 19:35:49 -08:00
Ishaan Jaff	0428229032	[Docs] readme fixes add supported providers (#16109 ) * add provider test * docs readme.md * docs providers * order providers * test_providers_alphabetically_ordered * docs endpoint * fix config * add ENDPOINT_COLUMNS * add provider endpoints * docs fix	2025-10-30 17:23:20 -07:00
Krish Dholakia	ddacaf6c32	(feat) Organizations: allow org admins to create teams on UI + (feat) IBM Guardrails (#15924 ) * fix(oldteams.tsx): allow org admin to create team on ui * fix(oldteams.tsx): show org admin a dropdown of allowed orgs for team creation * docs(access_control.md): cleanup doc * feat(ibm_guardrails/): initial commit adding support for ibm guardrails on litellm allows user to use self-hosted ibm guardrails * feat(ibm_detector.py): working detector * docs(ibm_guardrails.md): document new ibm guardrails * fix: fix linting errors	2025-10-25 11:13:39 -07:00
Cole McIntosh	000ecad4e2	Fix Groq streaming ASCII encoding issue Replace iter_lines()/aiter_lines() with iter_text()/aiter_text() using explicit UTF-8 encoding to handle non-ASCII characters like µ in streaming responses. - Added utf8_iter_lines() and utf8_aiter_lines() helper functions - Ensures proper UTF-8 decoding of streaming response content - Added comprehensive tests for Unicode character handling Fixes #12660	2025-08-16 08:32:22 -05:00

17 Commits