Ishaan Jaff
cbac70a4ec
MANUS docs ( #18817 )
2026-01-08 18:58:10 +05:30
Sameer Kankute
af6883712e
Add tests for kimi 2 bedrock model
2026-01-08 10:07:33 +05:30
Alexsander Hamir
98d7a428b6
Fix: Clarify database_connection_pool_limit applies per worker, not per instance ( #18780 )
2026-01-07 13:57:03 -08:00
Krish Dholakia
c89a4115eb
Update production proxy resource recommendations ( #18771 )
...
Co-authored-by: Cursor Agent <cursoragent@cursor.com >
2026-01-08 00:37:12 +05:30
Ishaan Jaff
929af510fa
[Feat] New provider - Add Azure BFL FLux for image edits ( #18766 )
...
* add azure_ai/flux.2-pro
* get_flux2_image_generation_url
* azure_client_params
* docs
* add Image Editing
* add azure ai image edits
* AzureFoundryFlux2ImageEditConfig
* TestAzureAIFlux2ImageEdit
2026-01-07 23:28:39 +05:30
Ishaan Jaff
3430325919
[Feat] Add Azure BFL - Flux 2 models ( #18764 )
...
* add azure_ai/flux.2-pro
* get_flux2_image_generation_url
* azure_client_params
* docs
2026-01-07 23:28:28 +05:30
Krish Dholakia
80ead21c3a
Litellm improve endpoint discovery ( #18762 )
...
* docs: document all endpoints in .json and add consistency checks against docs + providers.json
* docs: add more tests + improve coverage
2026-01-07 17:35:01 +05:30
Ishaan Jaffer
3530218930
fix aupload_container_file
2026-01-07 14:19:36 +05:30
Ishaan Jaff
1f141f0dbb
[Feat] Litellm new endpoint add container file upload ( #18743 )
...
* init upload_container_file
* init upload_container_file
* _prepare_multipart_file_upload
* fix upload_container_file
* aupload_container_file, upload_container_file
* register_container_file_endpoints
2026-01-07 13:36:55 +05:30
Yuta Saito
e5be160ae0
docs: sets ARIZE_PROJECT_NAME
2026-01-07 14:27:16 +09:00
Goutham Karthi
2865b17988
adding signoz integration to observability docs
2026-01-06 15:29:44 -08:00
Alexsander Hamir
bb4c01ffa0
Add LITELLM_DISABLE_LAZY_LOADING env var to fix VCR cassette creation issue ( #18725 )
2026-01-06 14:49:11 -08:00
Otavio Brito
18ca6b2add
Handle not supported region for vertex ai count tokens - v1/messages/count_tokens ( #18665 )
...
* Handle not supported region for vertex ai count tokens
* add unit test
2026-01-07 00:05:00 +05:30
Lundin Matthews
762345172c
Add LlamaGate as a new provider ( #18673 )
...
Adds LlamaGate (https://llamagate.dev ) as an OpenAI-compatible provider with:
- Provider configuration in providers.json
- Documentation page with usage examples
- Model pricing for 17 models across categories:
- General purpose (Llama 3.1/3.2, Mistral, Qwen, Dolphin)
- Reasoning (DeepSeek R1, OpenThinker)
- Code (Qwen Coder, DeepSeek Coder, CodeLlama)
- Vision (Qwen VL, LLaVA, Gemma 3)
- Embeddings (Nomic, Qwen3 Embedding)
Provider details:
- Base URL: https://api.llamagate.dev/v1
- Auth: Bearer token via LLAMAGATE_API_KEY
- Pricing: $0.02-$0.55 per 1M tokens
- All models are open-weights
2026-01-07 00:00:30 +05:30
Kazuki Matsumaru
26bdf7b7a8
Remove redundant comments about setting litellm.callbacks ( #18711 )
...
- Removed duplicate comment in test_router_endpoints.py
- Removed duplicate comment in logging.md
- Kept clearer comment: 'Set litellm.callbacks = [proxy_handler_instance] on the proxy'
2026-01-06 23:58:01 +05:30
Pascal Bro
cac2a8d158
Fix/gcs cache docs missing for proxy mode ( #13328 )
...
* fixed issues with gcs cache to verify functionality
* restore changes
* Fix capitalization of 'S3 Bucket Cache'
---------
Co-authored-by: Nelson Alfonso <45660392+Dashing-Nelson@users.noreply.github.com >
2026-01-06 23:49:08 +05:30
Sameer Kankute
1c177a576b
Add the LITELLM_REASONING_AUTO_SUMMARY in doc
2026-01-06 18:04:09 +05:30
Sameer Kankute
353769d2ac
Merge pull request #18688 from BerriAI/litellm_fix_reasoning_summary_default
...
[Fix] Put automatic reasoning summary inclusion behind feat flag
2026-01-06 17:56:40 +05:30
Ishaan Jaff
76eda472be
[Feat] New API Endpoint - Responses API (v1/responses/compact) ( #18697 )
...
* init transform_compact_response_api_request
* init acompact_responses
* init async_compact_response_api_handler in llm http handler
* init transform_compact_response_api_request for openai
* init acompact_responses
* fix acompact_responses
* add OAI Compact API
* docs responses API Compact
* code qa checks
* test_openai_compact_responses_api
* fix mypy linting
2026-01-06 16:24:04 +05:30
Sameer Kankute
bb00a53786
Put reasoning summary behind feat flag
2026-01-06 11:36:20 +05:30
0717376
12f02f6c54
feat: Add GigaChat provider support ( #18564 )
...
* feat: Add GigaChat provider support
Add native support for GigaChat API (Sber AI, Russia's leading LLM).
Supported features:
- Chat completions (sync/async)
- Streaming (sync/async)
- Function calling / Tools
- Structured output via JSON schema (emulated through function calls)
- Image input (base64 and URL)
- Embeddings
Closes #18515
* fix: resolve mypy type errors in GigaChat handler
- Fix _prepare_file_data return type (use 3-tuple for cleaner type flow)
- Add type annotations for lists in _process_content_parts methods
- Add type annotations in _collapse_user_messages
- Use ChatCompletionToolCallChunk for proper tool_use typing
- Add type: ignore[override] for astreaming async generator
* refactor(gigachat): migrate to BaseConfig pattern
* fix: remove unused imports
* fix: resolve mypy type errors
* fix: mypy type errors
* refactor: address review feedback for GigaChat provider
- Remove singleton pattern, reuse litellm HTTPHandler
- Move constants/errors to transformation files, delete common_utils.py
- Add models to model_prices_and_context_window.json
- Fix ssl_verify not passed to HTTP client for embeddings
* docs: update GigaChat documentation with ssl_verify requirement
2026-01-06 10:10:02 +05:30
YutaSaito
31470032ad
Merge pull request #18681 from BerriAI/litellm_feat_mcp-server-visibility-configurable
...
[feat] mcp server visibility configurable
2026-01-06 12:53:07 +09:00
Yuta Saito
29e658012a
docs: add user_mcp_management_mode
2026-01-06 11:43:58 +09:00
Yuta Saito
375f3ed439
fix: missing mcp_allow_all_ui.png
2026-01-06 11:23:22 +09:00
YutaSaito
100f617a8f
Merge pull request #18668 from BerriAI/litellm_feat_noma_mcp_guardrail
...
[feat] add support MCP guardrail to noma
2026-01-06 07:01:57 +09:00
Yuta Saito
bdd95314a7
docs: mention MCP prompt/resources types in overview
2026-01-06 06:29:37 +09:00
Yuta Saito
4eecf59f86
feat: add support MCP guardrail to noma
2026-01-06 06:15:56 +09:00
drorIvry
b6a64ff99a
feature/ add qualifire guardrails ( #18594 )
...
* init guardrails
* init guardrails
* some fixes
* some fixes
* ruff
* some fixes
* some fixes
* some fixes
* some fixes
* some fixes
* some fixes
* docs
2026-01-06 01:34:43 +05:30
orgersh92
d8d10f5e25
lasso-guardrail-api docs ( #18652 )
2026-01-05 20:58:26 +05:30
Ishaan Jaffer
f752ec3eaf
fix
2026-01-05 20:25:39 +05:30
Ishaan Jaff
bf1c5bef59
[UI] Add view for estimating costs across requests ( #18645 )
...
* add estimate_cost endpoint
* TestCostEstimateEndpoint
* fix estimate_cost
* add /cost/estimate to spend tracking routes
* fix code QA checks
* fixes endpoint
* v0 cost estimator
* v0 cost estimator
* formatNumberWithCommas
* fix admin view
* docs
* docs fix + export PDF/CSV
* fixes for export
2026-01-05 19:19:58 +05:30
Yuta Saito
fee9553c51
feat: add allow_all_keys doc
2026-01-05 16:56:19 +09:00
amangupta-20
399579f8ea
feat: Add Levo AI integration ( #18529 )
2026-01-05 08:49:21 +05:30
cantalupo555
9b1c5f7e36
feat(zai): Add GLM-4.7 model with reasoning support ( #18476 )
...
Add support for Z.AI GLM-4.7, latest flagship model with enhanced reasoning capabilities.
Changes:
- Add zai/glm-4.7 to model pricing with /bin/bash.60/M input, .20/M output
- Add cached input pricing (/bin/bash.11/M) for GLM-4.7
- Add supports_reasoning flag to enable thinking parameter
- Update ZAIChatConfig to support thinking parameter for models with reasoning
- Update documentation with GLM-4.7 as latest flagship model
- Add cached input column to pricing table (GLM-4.7 only)
- Add tests for GLM-4.7 reasoning support and cost calculation
- Update all examples to use GLM-4.7
Model specifications:
- Context: 200K input, 128K output
- Supports: reasoning, function calling, tool choice, prompt caching
- Pricing: Same as GLM-4.6 with cache support
See: https://docs.z.ai/guides/llm/glm-4.7
2026-01-04 00:44:19 +05:30
Krish Dholakia
969790c463
Iam roles anywhere docs ( #18559 )
...
* Add documentation for IAM Roles Anywhere
Co-authored-by: krrishdholakia <krrishdholakia@gmail.com >
* Refactor Bedrock provider docs for IAM Roles Anywhere
Co-authored-by: krrishdholakia <krrishdholakia@gmail.com >
---------
Co-authored-by: Cursor Agent <cursoragent@cursor.com >
2026-01-04 00:10:07 +05:30
Krish Dholakia
87fe62229f
feat: Add adopters page and data structure ( #18605 )
...
Co-authored-by: Cursor Agent <cursoragent@cursor.com >
2026-01-03 21:51:19 +05:30
Krish Dholakia
0aae5153b6
docs: Clarify Bedrock AgentCore documentation ( #18603 )
...
Co-authored-by: Cursor Agent <cursoragent@cursor.com >
2026-01-03 16:06:07 +05:30
Chetan Choudhary
687adc6024
Add log_format parameter to GenericAPILogger ( #18587 )
...
Adds log_format parameter supporting json_array (default), ndjson, and single formats. NDJSON format enables webhook integrations like Sumo Logic to parse individual log records at ingest time. Defaults to json_array for backward compatibility.
2026-01-02 23:28:30 +05:30
Sameer Kankute
c8bb92afdc
Merge pull request #18579 from BerriAI/litellm_remove_prompt_chaching_header
...
Remove prompt caching headers as the support has been removed
2026-01-02 21:01:50 +05:30
YutaSaito
e9365c496d
Merge pull request #18580 from BerriAI/litellm_feat_support_metaurl_on_ui
...
feat: add UI support for configuring meta URLs
2026-01-02 17:54:26 +09:00
Lukas de Boer
edc8413f1e
Add Kubernetes ServiceAccount JWT authentication support ( #18055 )
...
* Allow get_nested_value dot notation to support escaping for Kubernetes JWT Support
* Add support for team and org alias fields, add docs, tests
* Fix lint issue with max statements in handle jwt logic
2026-01-02 14:02:31 +05:30
Yuta Saito
35f9a75d55
feat: add UI support for configuring meta URLs
2026-01-02 15:07:37 +09:00
Sameer Kankute
ca94990307
remove prompt caching headers as the support has been removed
2026-01-02 11:08:35 +05:30
Alexsander Hamir
936aa6821f
[Fix] CI/CD - litellm_security_tests ( #18567 )
2026-01-01 14:20:04 -08:00
Alexsander Hamir
a7da4833da
[Fix] CI/CD - check_code_and_doc_quality ( #18560 )
2026-01-01 11:18:59 -08:00
Yuta Saito
b343d15157
fix: prevent LiteLLM from closing external OTEL spans
2026-01-01 08:28:48 +09:00
Prajeena Maharjan
b720fba142
Fix formatting in proxy configs documentation
...
Got an error message:
{"error":{"message":"Invalid JSON payload: trailing comma is not allowed: line 8 column 8 (char 141)","type":"invalid_request_error","param":"request_body","code":"400"}}%
2025-12-28 23:48:43 -06:00
Matt Cowger
9f4a4fd271
Add docs as follow on to #18362 ( #18388 )
2025-12-25 11:08:47 +05:30
Ishaan Jaff
1123cfa928
[Feat] AI Gateway - Add support for Platform Fee / Margins ( #18427 )
...
* init cost_margin_config
* feat: add cost margin
* init types
* LITELLM_SETTINGS_SAFE_DB_OVERRIDES
* feat _apply_cost_margin
* ui endpoint
* ui provider margins
* add margin
* refactored ui
* test cost margins
* refactored ui
* provider discounts
* add cost_breakdown to spendLogs
* add CostBreakdownViewer
* fix cost breakdown
* docs fix
* doc margins
* docs margins
2025-12-25 11:07:27 +05:30
Sameer Kankute
42121ad13b
Update minimax docs to be in proper format
2025-12-24 09:25:36 +05:30