litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-18 03:31:23 +00:00

Author	SHA1	Message	Date
Krish Dholakia	df49b24bc0	Azure - responses api bridge - respect `responses/` + Gemini - generate content bridge - handle kwargs + litellm params containing `stream` (#12224 ) * fix(main.py): handle router custom azure model name for responses api bridge * fix(responses/handler): ensure azure model name is stripped before sending to provider Fixes model name error * fix(google_genai/main.py): handle stream=true being set in kwargs * docs: cleanup icons from sidebar * fix(test-litellm.yml): add google-genai to test litellmyml * fix(main.py): strip 'responses/' from bridge * fix(main.py): fix linting errors * fix(types/openai.py): allow item to be none handle azure streaming response * fix(base.py): allow extra fields + handle azure item = none value in response output item added event * fix(main.py): correctly handle removing responses/ * test(test_main.py): add unit tests	2025-07-02 13:53:52 -07:00
Cole McIntosh	e91802da39	feat: add local LLM translation testing with artifact generation (#12120 ) - Move from CircleCI dependency to direct pytest execution - Add Python script to generate beautiful markdown reports - Update GitHub workflow to run tests directly - Update Makefile to use the new test runner script - Generate both JUnit XML and markdown artifacts - Group test results by provider with detailed statistics	2025-06-27 21:24:19 -07:00
Cole McIntosh	c4f3cc6de2	Enhance CircleCI integration in LLM translation testing workflow. Updated commit SHA retrieval method, improved pipeline search logic, and refined artifact downloading process. Added checks for test workflows and job statuses, with placeholder results creation if no artifacts are found. Updated artifact upload step for clarity.	2025-06-26 10:06:30 -06:00
Cole McIntosh	d5d0dfc26f	Add GitHub Actions workflow for LLM translation testing artifacts	2025-06-26 10:05:33 -06:00
Cole McIntosh	3ab1dfad08	Add GitHub Actions workflow for LLM translation testing artifacts (#11780 ) * Add GitHub Actions workflow for LLM translation testing artifacts * Update LLM translation testing workflow to fetch results from CircleCI and improve timeout settings. The job name has been changed for clarity, and the installation of dependencies has been replaced with CircleCI CLI setup. Placeholder test results are created if no CircleCI artifacts are found.	2025-06-23 09:23:27 -07:00
Krrish Dholakia	e50647627c	build(ghcr_deploy.yml): add rc to all docker images	2025-06-14 17:16:46 -07:00
Krrish Dholakia	a5157978aa	build(ghrc_deploy.yml): add 'rc' release type	2025-06-14 17:15:29 -07:00
Ishaan Jaff	08d6f3e142	Revert "Enhance proxy CLI with Rich formatting and improved user experience (#11420 )" This reverts commit `3b911ba1b2`.	2025-06-06 17:55:45 -07:00
Cole McIntosh	3b911ba1b2	Enhance proxy CLI with Rich formatting and improved user experience (#11420 ) * Enhance proxy CLI with Rich formatting and improved user experience - Integrated Rich library for better console output in `proxy_cli.py`, including version display, health check results, and test completion responses. - Updated health check and test completion methods to provide progress indicators and formatted tables. - Refactored feedback display in `proxy_server.py` to use Rich for a more visually appealing user interface. - Adjusted tests in `test_proxy_cli.py` to mock console output instead of using print statements, ensuring compatibility with Rich formatting. * fix linting error * refactor(proxy_cli.py): simplify DB setup logging - Removed progress indicators for IAM token generation and environment variable decryption to simplify the code. - Consolidated the logic for generating the database URL and setting environment variables. - Enhanced error handling for configuration loading and database setup, ensuring clearer feedback * Update test-linting workflow to include proxy-dev dependencies in Poetry installation * Enhance proxy server initialization with Rich console for improved model display. Added support for loading model parameters from environment variables and refined provider identification logic. Fallback to original print formatting if Rich is not available. * Refactor feedback handling: Moved feedback message generation and custom warning display to utils.py. Enhanced feedback box with rich formatting and fallback to ASCII for environments without rich. Cleaned up proxy_server.py by removing obsolete code. * fix linting error * Refactor model initialization display: Moved model initialization logic to a new utility function `display_model_initialization` for improved readability and maintainability. Enhanced model provider extraction with a dedicated function. Fallback to basic logging if Rich console is unavailable. * Refactor model provider extraction: Replace the `_extract_provider_from_model` function with a more robust approach using `get_llm_provider`. Implement fallback logic for provider identification and improve error handling. Ensure compatibility with Rich console for model initialization display.	2025-06-06 17:16:53 -07:00
மனோஜ்குமார் பழனிச்சாமி	0fd4ee2f94	Increase timeout (#11288 )	2025-05-31 07:31:14 -07:00
Ishaan Jaff	9a6d5c119e	feat: Allow Adding MCP Servers Through LiteLLM UI (#11208 ) * feat: MCP Servers with CRUD operations (#10699) * feat: mcp CRUD operations with authn/authz * feat: mcp server UI * mcp server page with overview, mcp tools, and settings page * Adding MCP Server flow * prisma generate before test * UI callbacks add/remove with api server refetch * test fix: poetry run prisma * feat: mcp server db and config connection * fix: MCPTool filter on description when not present * feat: mcp on UI and integrated with list tools * feat: Update mcp server endpoint * tests: Unit and integration tests for mcp management endpoints * fix: docs and ensuring global_mcp_manage up to date * ui: remove the mcp tools view * fix: ruff lint * fix: unit -> integration test area * fix(ui): remove left nav menu of previous tools --------- Co-authored-by: wagnerjt <wagnerjt@github.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * fix: sync DB MCP tools with in memory * fix: sync DB MCP tools with in memory * fix: stop using prisma.models * fix: code qa check * fix: import MCP * fix: code QA checks * fix: code QA checks * fixes - only list tools for the specific MCP server * fix: only list MCP tools for selected server * fix linting error --------- Co-authored-by: Tyler Wagner <wagnerjt@users.noreply.github.com> Co-authored-by: wagnerjt <wagnerjt@github.com>	2025-05-28 16:29:27 -07:00
Krish Dholakia	4c82dd9b27	Ollama Chat - parse tool calls on streaming (#11171 ) * fix(user_api_key_auth.py): fix else block Fixes https://github.com/BerriAI/litellm/issues/11170 * refactor(ollama/chat): refactor to base config pattern easier to maintain fixes * fix(ollama/chat): support tool call parsing on streaming Closes https://github.com/BerriAI/litellm/issues/11104 * test: update import location * fix: cleanup unused import * fix: fix ruff check error * test: update import * test: update test on ci * ci: cleanup * fix: fix chekc * fix: fix api key check order * test: fix import * ci: fix script * test: fix imports * fix: fix tests	2025-05-27 16:14:49 -07:00
Krish Dholakia	ef42461c1e	Litellm fix GitHub action testing (#11163 ) * test: add __init__.py files * refactor: rename test folder to avoid naming conflict * test: update workflows * test: update tests * test: update imports * test: update tests * test: remove unused import * ci(test-litellm.yml): add pytest retry to github workflow * test: fix test	2025-05-26 14:41:42 -07:00
Kreato	2e0dcedac0	Proper github images (#10927 ) * feat: add seperate image URLs to distinguish types of release * feat: remove new nightly/dev image URLs, only keep stable	2025-05-23 12:38:00 -07:00
Ishaan Jaff	dd4a65b83a	Feat: add MCP to Responses API and bump openai python sdk (#11029 ) * feat: add MCP to responses API * feat: bump openai version to 1.75.0 * docs MCP + responses API * fixes: type checking * fixes: type checking * build: use latest openai 1.81.0 * fix: linting error * fix: linting error * fix: test * fix: linting errors * fix: test * fix: test * fix: linting * Revert "fix: linting" This reverts commit ebb19ff8cb1f8fcc3e224390e351676daccb33de. * fix: linting	2025-05-22 07:24:10 -07:00
Ishaan Jaff	faed9860c0	[Refactor] Move enterprise_routes within litellm_enterprise (#10860 ) * fix: move enterprise routes to litellm_enterprise * refactor: move enterprise routes to litellm_enterprise * fix: litellm_enterprise routes * fix test litellm on github workflow	2025-05-15 10:34:26 -07:00
Zoltan K	91dcc50768	Github: Increase timeout of litellm tests (#10568 )	2025-05-05 12:37:04 -07:00
Krish Dholakia	9cc39af131	Add vertex ai meta llama 4 support + handle tool call result in content for vertex ai (#10492 ) * refactor(vertex_ai/llama): handle response transformation within config Allows us to handle https://github.com/BerriAI/litellm/issues/10441#issuecomment-2844975599 * fix(vertex_ai/llama): handle tool call in content Fixes https://github.com/BerriAI/litellm/issues/10441 * fix(vertex_ai/llama): return 'tool_calls' as finish reason if tool call returned vertex ai returns stop * feat(vertex_ai/): cost tracking for vertex_ai/meta/llama-4 * ci(test-linting.yml): pin openai version * build: reorder pinning * ci(pyproject.toml): limit openai version temporary patch as new version has linting errors * ci(pyproject.toml): limit openai version temporary patch around linting errors * ci(limit-openai-version): temporary patch * fix: fix linting errors * fix: fix linting error * fix(parallel_request_limiter_v2.py): add team based multi-instance rate limiting * fix: fix linting errors * build(pyproject.toml): modify pin * ci: bump pin	2025-05-01 22:47:06 -07:00
Krrish Dholakia	4e44d7f40c	ci(test-linting.yml): pin openai version	2025-05-01 18:55:39 -07:00
Ishaan Jaff	faf54e3f29	fixes for EE image	2025-04-26 15:43:21 -07:00
Ishaan Jaff	fd3603d4e8	deploy - add build-and-push-image-ee	2025-04-26 14:40:20 -07:00
Krrish Dholakia	611afaf2ab	ci(test-linting.yml): update to run black formatting	2025-03-31 17:03:59 -07:00
Krish Dholakia	9b7ebb6a7d	build(pyproject.toml): add new dev dependencies - for type checking (#9631 ) * build(pyproject.toml): add new dev dependencies - for type checking * build: reformat files to fit black * ci: reformat to fit black * ci(test-litellm.yml): make tests run clear * build(pyproject.toml): add ruff * fix: fix ruff checks * build(mypy/): fix mypy linting errors * fix(hashicorp_secret_manager.py): fix passing cert for tls auth * build(mypy/): resolve all mypy errors * test: update test * fix: fix black formatting * build(pre-commit-config.yaml): use poetry run black * fix(proxy_server.py): fix linting error * fix: fix ruff safe representation error	2025-03-29 11:02:13 -07:00
Krish Dholakia	0865e52db3	fix(proxy_server.py): get master key from environment, if not set in … (#9617 ) * fix(proxy_server.py): get master key from environment, if not set in general settings or general settings not set at all * test: mark flaky test * test(test_proxy_server.py): mock prisma client * ci: add new github workflow for testing just the mock tests * fix: fix linting error * ci(conftest.py): add conftest.py to isolate proxy tests * build(pyproject.toml): add respx to dev dependencies * build(pyproject.toml): add prisma to dev dependencies * test: fix mock prompt management tests to use a mock anthropic key * ci(test-litellm.yml): parallelize mock testing make it run faster * build(pyproject.toml): add hypercorn as dev dep * build(pyproject.toml): separate proxy vs. core dev dependencies make it easier for non-proxy contributors to run tests locally - e.g. no need to install hypercorn * ci(test-litellm.yml): pin python version * test(test_rerank.py): move test - cannot be mocked, requires aws credentials for e2e testing * ci: add thank you message to ci * test: add mock env var to test * test: add autouse to tests * test: test mock env vars for e2e tests	2025-03-28 12:32:04 -07:00
Krrish Dholakia	24b3e80eba	ci: update github action	2025-03-25 23:11:45 -07:00
Krish Dholakia	6cd6ff801f	ci(publish-migrations.yml): add action for publishing prisma db migrations (#9537 )	2025-03-25 17:55:59 -07:00
Ishaan Jaff	165b1887bd	fix docker img deploy - deploy stable releases from main-stable	2025-03-15 20:34:32 -07:00
Ishaan Jaff	f5e9211c1b	fix ghcr build	2025-03-15 20:14:04 -07:00
Ishaan Jaff	4f898c9f48	fix ghcr deploy	2025-03-15 19:37:09 -07:00
Ishaan Jaff	df7efa17f8	fix docker img tag displayed on stable releases	2025-03-15 13:46:30 -07:00
Manuel Cañete	fb4ebf0fd4	ci: add helm unittest	2025-03-08 01:29:25 +01:00
Ishaan Jaff	4032838408	fix load tests on litellm release notes	2025-02-26 19:11:43 -08:00
Ishaan Jaff	bca6e37c24	fix _get_docker_run_command_stable_release	2025-02-25 19:11:30 -08:00
Ishaan Jaff	bfae5c4161	fix naming docker stable release	2025-02-11 20:53:52 -08:00
Ishaan Jaff	022917b7b5	fix stale issue mgmt	2025-01-27 18:56:02 -08:00
Ishaan Jaff	c1a1c052f0	fix stale issue mgmt	2025-01-27 18:53:59 -08:00
Ishaan Jaff	02edf191a3	action for stale (#8045 )	2025-01-27 18:50:58 -08:00
Krrish Dholakia	ed1e3e9dc1	ci(reset_stable.yml): fix to run on release created events	2024-12-28 19:53:18 -08:00
Krrish Dholakia	bb9171e037	ci(reset_stable.yml): modify to work with all kinds of releases	2024-12-21 12:13:26 -08:00
Krrish Dholakia	741500e089	build(reset_stable.yml): rename branch to 'litellm_stable_release_branch' use this branch to trigger load test / other workflows for stable releases	2024-12-19 17:43:37 -08:00
Krrish Dholakia	19e67b8c0e	build(reset_stable.yml): add new workflow to reset litellm_stable to latest release	2024-12-19 17:36:58 -08:00
Ishaan Jaff	c7f14e936a	(code quality) run ruff rule to ban unused imports (#7313 ) * remove unused imports * fix AmazonConverseConfig * fix test * fix import * ruff check fixes * test fixes * fix testing * fix imports	2024-12-19 12:33:42 -08:00
Krrish Dholakia	e5e49f5c49	build: fix test	2024-12-03 12:25:36 -08:00
Krrish Dholakia	7ecbc3beec	build(label-mlops.yml): fix check	2024-12-03 12:23:31 -08:00
Krrish Dholakia	445ba4de73	build(label-mlops.yml): add tag to mlops user requests	2024-12-03 12:20:48 -08:00
Krish Dholakia	66c1ee09cf	ci: remove redundant lint.yml workflow (#6622 )	2024-11-07 01:05:58 +05:30
Ishaan Jaff	45ff74ae81	fix flake8 checks	2024-11-06 10:45:58 -08:00
Krish Dholakia	cc8dd80209	allow configuring httpx hooks for AsyncHTTPHandler (#6290 ) (#6415 ) * allow configuring httpx hooks for AsyncHTTPHandler (#6290) Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * Fixes and minor improvements for Helm Chart (#6402) * reckoner hack * fix default * add extracontainers option * revert chart * fix extracontainers * fix deployment * remove init container * update docs * add helm lint to deploy step * change name * (refactor) prometheus async_log_success_event to be under 100 LOC (#6416) * unit testig for prometheus * unit testing for success metrics * use 1 helper for _increment_token_metrics * use helper for _increment_remaining_budget_metrics * use _increment_remaining_budget_metrics * use _increment_top_level_request_and_spend_metrics * use helper for _set_latency_metrics * remove noqa violation * fix test prometheus * test prometheus * unit testing for all prometheus helper functions * fix prom unit tests * fix unit tests prometheus * fix unit test prom * (refactor) router - use static methods for client init utils (#6420) * use InitalizeOpenAISDKClient * use InitalizeOpenAISDKClient static method * fix # noqa: PLR0915 * (code cleanup) remove unused and undocumented logging integrations - litedebugger, berrispend (#6406) * code cleanup remove unused and undocumented code files * fix unused logging integrations cleanup * update chart version * add circleci tests --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Xingyao Wang <xingyao@all-hands.dev> * fix: fix linting error * fix(http_handler.py): fix linting error --------- Co-authored-by: Alejandro Rodríguez <alejorro70@gmail.com> Co-authored-by: Robert Brennan <accounts@rbren.io> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>	2024-10-24 22:00:24 -07:00
Ishaan Jaff	5de69cb1b2	fix using Dockerfile	2024-10-08 08:45:40 +05:30
Ishaan Jaff	d742e8cb43	(clean up) move docker files from root to `docker` folder (#6109 ) * fix move docker files to docker folders * move check file length * fix docker hub deploy	2024-10-08 08:23:52 +05:30

1 2 3 4 5 ...

301 Commits