litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-18 07:33:58 +00:00

Author	SHA1	Message	Date
yuneng-jiang	195c0ee54d	Merge pull request #23917 from BerriAI/litellm_/loving-noyce [Fix] Add contents:write permission to ghcr_deploy release job	2026-03-17 17:27:36 -07:00
yuneng-jiang	b8ffbba352	[Fix] Add contents:write permission to release job in ghcr_deploy workflow The release job was failing with "Resource not accessible by integration" because other jobs explicitly set permissions, causing GitHub to scope the default token down for all jobs. The release job needs contents:write to create GitHub releases. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 17:23:39 -07:00
codspeed-hq[bot]	be20a8a93d	Add CodSpeed performance benchmarks (#23676 ) Co-authored-by: codspeed-hq[bot] <117304815+codspeed-hq[bot]@users.noreply.github.com>	2026-03-14 18:44:36 -07:00
Chesars	0fc407cfdd	ci: exclude enterprise/ from black --check in linting workflow Contributors don't have local access to enterprise/ files, so the check would always fail on unformatted enterprise code.	2026-03-12 14:27:00 -03:00
Cesar Garcia	f79744cee2	Merge pull request #18648 from Chesars/fix-black-check-ci fix: check Black formatting in CI instead of auto-formatting	2026-03-12 14:24:37 -03:00
yuneng-jiang	202b5b29de	Add daily internal dev branch creation job Adds a new job to the existing daily staging branch workflow that creates a `litellm_internal_dev_MM_DD_YYYY` branch from main twice a day. This branch serves as a staging area before merging into main to improve stability. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 15:53:42 -07:00
Joe Reyna	40210ce750	fix(codeql): remove ruby from language matrix (#23227 )	2026-03-10 20:45:23 -07:00
Sameer Kankute	0ee4d90d7e	Fix enterpise bump yml	2026-03-09 16:43:40 +05:30
Sameer Kankute	4d92c720c7	Fix enterpise bump yml	2026-03-09 16:39:38 +05:30
Sameer Kankute	a52a4fd28a	fix(enterprise): create PR for version bump instead of pushing to protected main Made-with: Cursor	2026-03-09 16:31:27 +05:30
Julio Quinteros Pro	512a5fa3c7	Merge pull request #22788 from BerriAI/fix/azure-batches-add-tenacity-ci Add tenacity to e2e Azure batch CI and revert importorskip	2026-03-04 11:50:44 -03:00
Julio Quinteros Pro	75b2e40cd3	Remove incompatible openai==1.100.1 pin from linting CI The linting workflow force-installed openai==1.100.1 which conflicts with litellm's requirement of openai>=2.8.0, causing pip dependency resolver errors and CI cancellation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-04 11:46:31 -03:00
Julio Quinteros Pro	aa62ddaf0a	Add tenacity to e2e Azure batch CI and revert importorskip PR #22785 used pytest.importorskip which causes exit code 5 (all skipped) in CI. Instead, add tenacity to the CI workflow pip install and restore direct imports. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-04 11:45:14 -03:00
Sameer Kankute	213bf11ede	Merge pull request #22763 from BerriAI/litellm_test_e2e_batches_test feat(tests): add proxy e2e azure batches test	2026-03-04 18:28:52 +05:30
Sameer Kankute	7b6a972fed	Add this test in cicd	2026-03-04 17:21:00 +05:30
Sameer Kankute	49738bb3e3	ci: add proxy e2e azure batches workflow - Run test_e2e_managed_batch with -vv -s for terminal output on failure - PostgreSQL, Poetry, Prisma setup - Upload logs as artifact on failure Made-with: Cursor	2026-03-04 17:15:33 +05:30
Chesars	dad7805b42	fix(deps): update python-multipart version to 0.0.22 in all files Align requirements.txt, CI workflow, liccheck, and license cache with the >=0.0.22 constraint already set in pyproject.toml.	2026-03-03 15:09:33 -03:00
Julio Quinteros Pro	2f6298d00f	Fix observatory tunnel flaky DNS and suppress PLR0915 in router The observatory test workflow failed because the "Verify tunnel connectivity" step used a single curl with no retries. Cloudflare quick tunnels need time for DNS propagation, and the first lookup can return NXDOMAIN (curl exit 6). Replace with a retry loop (10 attempts, 5s apart) matching the pattern already used in the health check step. Also add `# noqa: PLR0915` to `_completion_streaming_iterator` in router.py, matching the suppression already on its async twin. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 17:45:49 -03:00
Julio Quinteros Pro	cc0b1323d7	Fix observatory checkout failing on commit hash ref actions/checkout treats short commit hashes as branch names, causing fetch failures. The checkout only needs the config file from the repo, so use the default branch instead of a specific ref. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 17:05:49 -03:00
Julio Quinteros Pro	b40b1e6a4b	Fix invalid secrets context in test-linting workflow The secrets context is not available in step-level if: conditions, causing the workflow file to fail validation. Move the conditional check into the shell script instead. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 16:54:53 -03:00
Julio Quinteros Pro	369edb2afb	Move all secrets to env blocks instead of direct interpolation Pass AZURE_API_KEY, AZURE_API_BASE, OBSERVATORY_URL, OBSERVATORY_API_KEY, and REQUEST_ID through step-level env blocks so they are never interpolated directly into shell scripts. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 16:34:10 -03:00
Julio Quinteros Pro	a24ba226ba	Validate tag input and add explicit cleanup step - Validate inputs.tag matches vX.Y.Z format to prevent script injection via workflow_dispatch - Pass tag via env var instead of direct interpolation in shell - Add cleanup step to kill cloudflared and remove docker container Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 16:19:30 -03:00
Julio Quinteros Pro	a2946e2cc8	Add job timeout and use jq for safe JSON construction - Add timeout-minutes: 30 to prevent runaway jobs - Build /run-test payload with jq --arg to safely escape TUNNEL_URL and LITELLM_MASTER_KEY values Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 15:30:09 -03:00
Julio Quinteros Pro	7a46aaff2b	Pin cloudflared to v2025.2.1 for reproducible builds Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 15:30:09 -03:00
Julio Quinteros Pro	58264aadb7	Validate request_id before polling Fail early if request_id is missing or null from the /run-test response instead of polling /run-status/null for 15 minutes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 15:30:09 -03:00
Julio Quinteros Pro	b4e0c4db07	Use temp file for JSON result passing between steps Avoids shell quoting issues with single quotes in JSON and multi-line output truncation when using GITHUB_OUTPUT. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 15:30:09 -03:00
Julio Quinteros Pro	1fdaa1588d	Address PR review comments on observatory workflow - Add permissions block (contents: read) per GitHub security scan - Poll /run-status/{request_id} instead of global /queue-status to avoid race conditions with concurrent test runs - Add result verification step that fails the workflow if tests did not pass or the run errored - Fix auth header to use X-LiteLLM-Observatory-API-Key Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 15:30:09 -03:00
Julio Quinteros Pro	d7dd7ef33b	Add observatory test workflow for RC/stable releases - New reusable workflow that spins up a LiteLLM container from the release image, exposes it via cloudflared tunnel, and triggers test runs on the Railway-hosted observatory - Integrates into ghcr_deploy.yml for RC and stable releases - Can also be triggered manually via workflow_dispatch - Add placeholder litellm_config.yaml for observatory test models Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 15:30:09 -03:00
Julio Quinteros Pro	bc9c28eb80	Merge pull request #22397 from BerriAI/fix/codeql-custom-workflow fix(ci): replace default CodeQL with custom workflow to unblock CI	2026-02-28 17:19:42 -03:00
Ishaan Jaff	b5f5b42035	bump: litellm-enterprise 0.1.32 → 0.1.33 + manual publish workflow (#22421 ) * bump: litellm-enterprise 0.1.32 → 0.1.33 * ci: add manual workflow to publish litellm-enterprise to PyPI * Apply suggestion from @greptile-apps[bot] Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * Apply suggestion from @greptile-apps[bot] Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * ci: add manual workflow to publish litellm-proxy-extras to PyPI * fix(ci): commit before publish, add poetry.lock update to enterprise + proxy-extras workflows --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-02-28 10:56:15 -08:00
Julio Quinteros Pro	ce0753243b	Merge pull request #22193 from BerriAI/test/secret-scan-ci test(ci): add secret scan test and CI job	2026-02-28 14:05:55 -03:00
Chesars	10a91c5199	fix(ci): remove duplicate env key in scan_duplicate_issues workflow The greptile suggestion in #22034 was applied without removing the original env block, leaving a duplicate env key that makes the YAML invalid. GitHub fails to parse the workflow on every push to main, creating failed run entries ("No jobs were run").	2026-02-28 13:27:59 -03:00
Cesar Garcia	7f5c8653f0	Merge pull request #18478 from Chesars/fix/prevent-scheduled-workflow-in-forks fix: update_price_and_context_window workflow from running in forks	2026-02-28 13:10:15 -03:00
Julio Quinteros Pro	d7340b595b	Update .github/workflows/codeql.yml Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-02-28 12:16:42 -03:00
Julio Quinteros Pro	53f3123030	fix(ci): add custom CodeQL workflow to replace expensive default setup The default CodeQL setup runs all 45 Python security queries against the entire codebase. Two queries (CleartextLogging, PolynomialReDoS) produce result sets > 2 GiB, causing 49+ minute runs that fail and block CI. - Add custom workflow with 30-minute timeout and concurrency limits - Exclude py/clear-text-logging-sensitive-data (CWE-312) - Exclude py/polynomial-redos (CWE-730) - Skip scanning tests/, docs/, and UI build output NOTE: The Default Setup must be disabled in repo Settings > Code security before merging, otherwise both will run simultaneously. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 11:40:22 -03:00
Julio Quinteros Pro	5a28ca985c	Update .github/workflows/scan_duplicate_issues.yml Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-02-28 00:17:31 -03:00
Julio Quinteros Pro	94b7342da8	Update .github/workflows/check_duplicate_issues.yml Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2026-02-28 00:17:22 -03:00
Julio Quinteros Pro	1c376afc85	fix(ci): use secrets context in ggshield step condition Step-level env is not visible to the if condition — reference secrets directly so ggshield actually runs when the key is configured. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:51:28 -03:00
Julio Quinteros Pro	05c3a95da8	fix(ci): add permissions block to secret-scan job Address github-advanced-security bot review comment by setting explicit minimal permissions (contents: read) for the GITHUB_TOKEN. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:48:43 -03:00
Julio Quinteros Pro	2fce35a162	test(ci): add secret scan test and CI job to prevent hardcoded credentials - Add unit test that scans Python source for Base64 Basic Auth patterns that would be flagged by secret scanners like GitGuardian/ggshield - Add secret-scan job to the linting CI workflow that runs the test on every PR and optionally runs ggshield if GITGUARDIAN_API_KEY is set Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:46:42 -03:00
Julio Quinteros Pro	db3d61f433	feat(ci): add duplicate issue detection and auto-close bot Add a Python script that detects duplicate issues using title similarity (difflib.SequenceMatcher) and closes them via the gh CLI. Two-tier system: - 0.6 threshold: informational comment via existing wow-actions step - 0.85 threshold: auto-close with comment, label, and not_planned reason Includes a workflow_dispatch workflow for one-time batch scans and integrates auto-close into the existing check_duplicate_issues workflow for newly opened issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 16:49:06 -03:00
Ryan Crabbe	079ff24d78	Revert duplicate issue checker to text-based matching, remove duplicate PR workflow Remove the Claude Code-powered duplicate PR detection workflow and revert the duplicate issue checker back to wow-actions/potential-duplicates with text similarity matching.	2026-02-23 15:28:13 -08:00
Krrish Dholakia	a26f83fd3c	fix: update calendly on repo	2026-02-23 06:13:59 -08:00
Ryan Crabbe	c7d3198d9a	fix: pass prompt as env var in duplicate detection workflows Fixes "Input must be provided either through stdin or as a prompt argument" error by moving the prompt to a PROMPT env variable instead of inline multiline shell string.	2026-02-21 14:29:08 -08:00
Ryan Crabbe	1d0f91010b	feat: switch duplicate detection workflows from opencode to Claude Code Route through LiteLLM proxy using LITELLM_VIRTUAL_KEY and LITELLM_BASE_URL secrets. Also adds --repo flag to all gh commands to fix missing repo context.	2026-02-20 17:51:12 -08:00
yuneng-jiang	deeaae7e10	Merge pull request #21606 from BerriAI/litellm_ai-duplicate-issue-detection feat: upgrade duplicate issue detection to be AI-powered instead of title text	2026-02-20 09:48:32 -08:00
Julio Quinteros Pro	b551b98b26	ci: further split b2/b3 to isolate single heavy files Isolate the two dominant files so they no longer block smaller tests: - proxy-unit-b2: test_proxy_server.py alone (2750 lines) - proxy-unit-b3: test_proxy_server_.py + test_proxy_setting_guardrails.py (618 lines) - proxy-unit-b4: test_proxy_utils.py alone (2339 lines) - proxy-unit-b5: test_proxy_token_counter.py (1279 lines) - proxy-unit-b6: test_[r-t].py (renamed from b4, 1988 lines) - proxy-unit-b7: test_[u-z]*.py (renamed from b5, 2394 lines) Matrix grows from 18 → 20 jobs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 14:08:25 -03:00
Julio Quinteros Pro	c8ddbd90d1	ci: rebalance matrix groups based on actual timings Split the two slowest groups based on measured wall-clock times: - proxy-unit-b2 (was 7m15s, test_proxy_[s-z]): → proxy-unit-b2: test_proxy_s.py (server + setting_guardrails, ~3368 lines) → proxy-unit-b3: test_proxy_[t-z].py (utils + token_counter, ~3618 lines) - proxy-unit-b3 (was 4m30s, test_[r-z]): → proxy-unit-b4: test_[r-t].py (response_polling + search + skills + realtime, ~1988 lines) → proxy-unit-b5: test_[u-z].py (user_api_key_auth + zero_cost + update_spend + unit_tests, ~2394 lines) proxy-unit-a2 (6m15s) will self-resolve once PR #21679 merges (55 skip markers added to test_key_generate_prisma.py). Matrix grows from 16 → 18 jobs; all groups expected ≤ 3-4m. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 14:08:25 -03:00
Julio Quinteros Pro	1572162fdc	ci: split slow test matrix groups to reduce wall-clock time Three groups were bottlenecking CI (proxy-unit-b: 15min, other: 20+min, proxy-unit-a: 6min). Split each into smaller parallel jobs based on actual line counts of the test files. proxy-unit-a (6min) → proxy-unit-a1 + proxy-unit-a2 - a1: test_[a-j].py (jwt 1564, auth_checks 978, google_gemini 478, ...) - a2: test_[k-o].py (key_generate_prisma 4346, ...) proxy-unit-b (15min) → proxy-unit-b1 + proxy-unit-b2 + proxy-unit-b3 - b1: prisma/project/prompt + test_proxy_[c-r].py (config, custom, routes, ...) - b2: test_proxy_[s-z].py (proxy_server 2745, proxy_utils 2339, proxy_token_counter 1276) - b3: test_[r-z]*.py (response_polling 1399, user_api_key_auth 1136, ...) other (20+min) → other-1 + other-2 + other-3 - other-1: responses (5942) + caching (1723) + types (819) ≈ 8.5k lines - other-2: enterprise (3062) + google_genai (2511) + router_utils (1982) ≈ 7.6k lines - other-3: remaining 11 dirs ≈ 8.0k lines Total matrix jobs: 11 → 16. No test files are added, removed, or skipped. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 14:08:25 -03:00
Ryan Crabbe	dc13378505	feat: add AI-powered duplicate PR detection via opencode Same approach as the duplicate issue detector — uses opencode run with gh pr commands to find potentially duplicate open PRs when external contributors open new PRs. Skips core team and bots.	2026-02-19 17:51:44 -08:00

1 2 3 4 5 ...

437 Commits