Commit Graph

5148 Commits

Author SHA1 Message Date
Ishaan Jaff cbac70a4ec MANUS docs (#18817) 2026-01-08 18:58:10 +05:30
Sameer Kankute af6883712e Add tests for kimi 2 bedrock model 2026-01-08 10:07:33 +05:30
Alexsander Hamir 98d7a428b6 Fix: Clarify database_connection_pool_limit applies per worker, not per instance (#18780) 2026-01-07 13:57:03 -08:00
Krish Dholakia c89a4115eb Update production proxy resource recommendations (#18771)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2026-01-08 00:37:12 +05:30
Ishaan Jaff 929af510fa [Feat] New provider - Add Azure BFL FLux for image edits (#18766)
* add azure_ai/flux.2-pro

* get_flux2_image_generation_url

* azure_client_params

* docs

* add Image Editing

* add azure ai image edits

* AzureFoundryFlux2ImageEditConfig

* TestAzureAIFlux2ImageEdit
2026-01-07 23:28:39 +05:30
Ishaan Jaff 3430325919 [Feat] Add Azure BFL - Flux 2 models (#18764)
* add azure_ai/flux.2-pro

* get_flux2_image_generation_url

* azure_client_params

* docs
2026-01-07 23:28:28 +05:30
Krish Dholakia 80ead21c3a Litellm improve endpoint discovery (#18762)
* docs: document all endpoints in .json and add consistency checks against docs + providers.json

* docs: add more tests + improve coverage
2026-01-07 17:35:01 +05:30
Ishaan Jaffer 3530218930 fix aupload_container_file 2026-01-07 14:19:36 +05:30
Ishaan Jaff 1f141f0dbb [Feat] Litellm new endpoint add container file upload (#18743)
* init upload_container_file

* init upload_container_file

* _prepare_multipart_file_upload

* fix upload_container_file

* aupload_container_file, upload_container_file

* register_container_file_endpoints
2026-01-07 13:36:55 +05:30
Yuta Saito e5be160ae0 docs: sets ARIZE_PROJECT_NAME 2026-01-07 14:27:16 +09:00
Goutham Karthi 2865b17988 adding signoz integration to observability docs 2026-01-06 15:29:44 -08:00
Alexsander Hamir bb4c01ffa0 Add LITELLM_DISABLE_LAZY_LOADING env var to fix VCR cassette creation issue (#18725) 2026-01-06 14:49:11 -08:00
Otavio Brito 18ca6b2add Handle not supported region for vertex ai count tokens - v1/messages/count_tokens (#18665)
* Handle not supported region for vertex ai count tokens

* add unit test
2026-01-07 00:05:00 +05:30
Lundin Matthews 762345172c Add LlamaGate as a new provider (#18673)
Adds LlamaGate (https://llamagate.dev) as an OpenAI-compatible provider with:
- Provider configuration in providers.json
- Documentation page with usage examples
- Model pricing for 17 models across categories:
  - General purpose (Llama 3.1/3.2, Mistral, Qwen, Dolphin)
  - Reasoning (DeepSeek R1, OpenThinker)
  - Code (Qwen Coder, DeepSeek Coder, CodeLlama)
  - Vision (Qwen VL, LLaVA, Gemma 3)
  - Embeddings (Nomic, Qwen3 Embedding)

Provider details:
- Base URL: https://api.llamagate.dev/v1
- Auth: Bearer token via LLAMAGATE_API_KEY
- Pricing: $0.02-$0.55 per 1M tokens
- All models are open-weights
2026-01-07 00:00:30 +05:30
Kazuki Matsumaru 26bdf7b7a8 Remove redundant comments about setting litellm.callbacks (#18711)
- Removed duplicate comment in test_router_endpoints.py
- Removed duplicate comment in logging.md
- Kept clearer comment: 'Set litellm.callbacks = [proxy_handler_instance] on the proxy'
2026-01-06 23:58:01 +05:30
Pascal Bro cac2a8d158 Fix/gcs cache docs missing for proxy mode (#13328)
* fixed issues with gcs cache to verify functionality

* restore changes

* Fix capitalization of 'S3 Bucket Cache'

---------

Co-authored-by: Nelson Alfonso <45660392+Dashing-Nelson@users.noreply.github.com>
2026-01-06 23:49:08 +05:30
Sameer Kankute 1c177a576b Add the LITELLM_REASONING_AUTO_SUMMARY in doc 2026-01-06 18:04:09 +05:30
Sameer Kankute 353769d2ac Merge pull request #18688 from BerriAI/litellm_fix_reasoning_summary_default
[Fix] Put automatic reasoning summary inclusion behind feat flag
2026-01-06 17:56:40 +05:30
Ishaan Jaff 76eda472be [Feat] New API Endpoint - Responses API (v1/responses/compact) (#18697)
* init transform_compact_response_api_request

* init acompact_responses

* init async_compact_response_api_handler in llm http handler

* init transform_compact_response_api_request for openai

* init acompact_responses

* fix acompact_responses

* add OAI Compact API

* docs responses API Compact

* code qa checks

* test_openai_compact_responses_api

* fix mypy linting
2026-01-06 16:24:04 +05:30
Sameer Kankute bb00a53786 Put reasoning summary behind feat flag 2026-01-06 11:36:20 +05:30
0717376 12f02f6c54 feat: Add GigaChat provider support (#18564)
* feat: Add GigaChat provider support

Add native support for GigaChat API (Sber AI, Russia's leading LLM).

Supported features:
- Chat completions (sync/async)
- Streaming (sync/async)
- Function calling / Tools
- Structured output via JSON schema (emulated through function calls)
- Image input (base64 and URL)
- Embeddings

Closes #18515

* fix: resolve mypy type errors in GigaChat handler

- Fix _prepare_file_data return type (use 3-tuple for cleaner type flow)
- Add type annotations for lists in _process_content_parts methods
- Add type annotations in _collapse_user_messages
- Use ChatCompletionToolCallChunk for proper tool_use typing
- Add type: ignore[override] for astreaming async generator

* refactor(gigachat): migrate to BaseConfig pattern

* fix: remove unused imports

* fix: resolve mypy type errors

* fix: mypy type errors

* refactor: address review feedback for GigaChat provider

- Remove singleton pattern, reuse litellm HTTPHandler
- Move constants/errors to transformation files, delete common_utils.py
- Add models to model_prices_and_context_window.json
- Fix ssl_verify not passed to HTTP client for embeddings

* docs: update GigaChat documentation with ssl_verify requirement
2026-01-06 10:10:02 +05:30
YutaSaito 31470032ad Merge pull request #18681 from BerriAI/litellm_feat_mcp-server-visibility-configurable
[feat] mcp server visibility configurable
2026-01-06 12:53:07 +09:00
Yuta Saito 29e658012a docs: add user_mcp_management_mode 2026-01-06 11:43:58 +09:00
Yuta Saito 375f3ed439 fix: missing mcp_allow_all_ui.png 2026-01-06 11:23:22 +09:00
YutaSaito 100f617a8f Merge pull request #18668 from BerriAI/litellm_feat_noma_mcp_guardrail
[feat] add support MCP guardrail to noma
2026-01-06 07:01:57 +09:00
Yuta Saito bdd95314a7 docs: mention MCP prompt/resources types in overview 2026-01-06 06:29:37 +09:00
Yuta Saito 4eecf59f86 feat: add support MCP guardrail to noma 2026-01-06 06:15:56 +09:00
drorIvry b6a64ff99a feature/ add qualifire guardrails (#18594)
* init guardrails

* init guardrails

* some fixes

* some fixes

* ruff

* some fixes

* some fixes

* some fixes

* some fixes

* some fixes

* some fixes

* docs
2026-01-06 01:34:43 +05:30
orgersh92 d8d10f5e25 lasso-guardrail-api docs (#18652) 2026-01-05 20:58:26 +05:30
Ishaan Jaffer f752ec3eaf fix 2026-01-05 20:25:39 +05:30
Ishaan Jaff bf1c5bef59 [UI] Add view for estimating costs across requests (#18645)
* add estimate_cost endpoint

* TestCostEstimateEndpoint

* fix estimate_cost

* add /cost/estimate to spend tracking routes

* fix code QA checks

* fixes endpoint

* v0 cost estimator

* v0 cost estimator

* formatNumberWithCommas

* fix admin view

* docs

* docs fix + export PDF/CSV

* fixes for export
2026-01-05 19:19:58 +05:30
Yuta Saito fee9553c51 feat: add allow_all_keys doc 2026-01-05 16:56:19 +09:00
amangupta-20 399579f8ea feat: Add Levo AI integration (#18529) 2026-01-05 08:49:21 +05:30
cantalupo555 9b1c5f7e36 feat(zai): Add GLM-4.7 model with reasoning support (#18476)
Add support for Z.AI GLM-4.7, latest flagship model with enhanced reasoning capabilities.

Changes:
- Add zai/glm-4.7 to model pricing with /bin/bash.60/M input, .20/M output
- Add cached input pricing (/bin/bash.11/M) for GLM-4.7
- Add supports_reasoning flag to enable thinking parameter
- Update ZAIChatConfig to support thinking parameter for models with reasoning
- Update documentation with GLM-4.7 as latest flagship model
- Add cached input column to pricing table (GLM-4.7 only)
- Add tests for GLM-4.7 reasoning support and cost calculation
- Update all examples to use GLM-4.7

Model specifications:
- Context: 200K input, 128K output
- Supports: reasoning, function calling, tool choice, prompt caching
- Pricing: Same as GLM-4.6 with cache support

See: https://docs.z.ai/guides/llm/glm-4.7
2026-01-04 00:44:19 +05:30
Krish Dholakia 969790c463 Iam roles anywhere docs (#18559)
* Add documentation for IAM Roles Anywhere

Co-authored-by: krrishdholakia <krrishdholakia@gmail.com>

* Refactor Bedrock provider docs for IAM Roles Anywhere

Co-authored-by: krrishdholakia <krrishdholakia@gmail.com>

---------

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2026-01-04 00:10:07 +05:30
Krish Dholakia 87fe62229f feat: Add adopters page and data structure (#18605)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2026-01-03 21:51:19 +05:30
Krish Dholakia 0aae5153b6 docs: Clarify Bedrock AgentCore documentation (#18603)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2026-01-03 16:06:07 +05:30
Chetan Choudhary 687adc6024 Add log_format parameter to GenericAPILogger (#18587)
Adds log_format parameter supporting json_array (default), ndjson, and single formats. NDJSON format enables webhook integrations like Sumo Logic to parse individual log records at ingest time. Defaults to json_array for backward compatibility.
2026-01-02 23:28:30 +05:30
Sameer Kankute c8bb92afdc Merge pull request #18579 from BerriAI/litellm_remove_prompt_chaching_header
Remove prompt caching headers as the support has been removed
2026-01-02 21:01:50 +05:30
YutaSaito e9365c496d Merge pull request #18580 from BerriAI/litellm_feat_support_metaurl_on_ui
feat: add UI support for configuring meta URLs
2026-01-02 17:54:26 +09:00
Lukas de Boer edc8413f1e Add Kubernetes ServiceAccount JWT authentication support (#18055)
* Allow get_nested_value dot notation to support escaping for Kubernetes JWT Support

* Add support for team and org alias fields, add docs, tests

* Fix lint issue with max statements in handle jwt logic
2026-01-02 14:02:31 +05:30
Yuta Saito 35f9a75d55 feat: add UI support for configuring meta URLs 2026-01-02 15:07:37 +09:00
Sameer Kankute ca94990307 remove prompt caching headers as the support has been removed 2026-01-02 11:08:35 +05:30
Alexsander Hamir 936aa6821f [Fix] CI/CD - litellm_security_tests (#18567) 2026-01-01 14:20:04 -08:00
Alexsander Hamir a7da4833da [Fix] CI/CD - check_code_and_doc_quality (#18560) 2026-01-01 11:18:59 -08:00
Yuta Saito b343d15157 fix: prevent LiteLLM from closing external OTEL spans 2026-01-01 08:28:48 +09:00
Prajeena Maharjan b720fba142 Fix formatting in proxy configs documentation
Got an error message:

{"error":{"message":"Invalid JSON payload: trailing comma is not allowed: line 8 column 8 (char 141)","type":"invalid_request_error","param":"request_body","code":"400"}}%
2025-12-28 23:48:43 -06:00
Matt Cowger 9f4a4fd271 Add docs as follow on to #18362 (#18388) 2025-12-25 11:08:47 +05:30
Ishaan Jaff 1123cfa928 [Feat] AI Gateway - Add support for Platform Fee / Margins (#18427)
* init cost_margin_config

* feat: add cost margin

* init types

* LITELLM_SETTINGS_SAFE_DB_OVERRIDES

* feat _apply_cost_margin

* ui endpoint

* ui provider margins

* add margin

* refactored ui

* test cost margins

* refactored ui

* provider discounts

* add cost_breakdown to spendLogs

* add CostBreakdownViewer

* fix cost breakdown

* docs fix

* doc margins

* docs margins
2025-12-25 11:07:27 +05:30
Sameer Kankute 42121ad13b Update minimax docs to be in proper format 2025-12-24 09:25:36 +05:30