Commit Graph

3827 Commits

Author SHA1 Message Date
Krrish Dholakia af930f22a3 docs: add missing docs 2025-05-23 21:55:30 -07:00
Mayank 8da898c55e [feature] ConfidentAI logging enabled for proxy and sdk (#10649)
* async success implemented

* fail async event

* sync events added

* docs added

* docs added

* test added

* style

* test

* .

* lock file genrated due to tenacity change

* mypy errors

* resolved comments

* resolved comments

* resolved comments

* resolved comments

* style

* style

* resolved comments
2025-05-23 11:40:48 -07:00
Ishaan Jaff c8a0088970 [Fix] Reliability Fix - Removing code that was creating threads on errors (#11066)
* fix: only init langfuse if active

* fix: only init langfuse if active

* fix: add initialized_langfuse_clients count

* fix: add MAX_LANGFUSE_INITIALIZED_CLIENTS

* fix: use safe init langfuse

* test: init langfuse clients

* test: test_langfuse_not_initialized_returns_none_early

* docs MAX_LANGFUSE_INITIALIZED_CLIENTS

* fix: use correct langfuse callback

* fix: code qa
2025-05-22 18:04:15 -07:00
Ishaan Jaff 89daa1dbad docs add claude-4 models 2025-05-22 13:41:09 -07:00
Ishaan Jaff 2c90ca0189 docs fix example 2025-05-22 09:33:22 -07:00
Ishaan Jaff d95c3a16f3 docs fix ad hoc recognizer 2025-05-22 08:55:07 -07:00
Ishaan Jaff dd4a65b83a Feat: add MCP to Responses API and bump openai python sdk (#11029)
* feat: add MCP to responses API

* feat: bump openai version to 1.75.0

* docs MCP + responses API

* fixes: type checking

* fixes: type checking

* build: use latest openai 1.81.0

* fix: linting error

* fix: linting error

* fix: test

* fix: linting errors

* fix: test

* fix: test

* fix: linting

* Revert "fix: linting"

This reverts commit ebb19ff8cb1f8fcc3e224390e351676daccb33de.

* fix: linting
2025-05-22 07:24:10 -07:00
fzowl 73727abd84 Updating the available VoyageAI models in the docs (#11003)
* Refresh VoyageAI models and prices and context

* Refresh VoyageAI models and prices and context

* Refresh VoyageAI models and prices and context

* Updating the available VoyageAI models in the docs

* Updating the available VoyageAI models in the docs
2025-05-21 14:04:43 -07:00
Søren Mathiasen aa11ea3a9f support vertex_ai global endpoints for chat (#10658) 2025-05-21 07:58:36 -07:00
Ishaan Jaff 3a6802fef1 [Feat] - Add Support for Showing Passthrough endpoint Error Logs on LiteLLM UI (#10990)
* fix: add error logging for passthrough endpoints

* feat: add error logging for passthrough endpoints

* fix: post_call_failure_hook track errors on pt

* fix: use constant for MAXIMUM_TRACEBACK_LINES_TO_LOG

* docs MAXIMUM_TRACEBACK_LINES_TO_LOG

* test: ensure failure callback triggered

* fix: move _init_kwargs_for_pass_through_endpoint
2025-05-20 18:29:39 -07:00
Krrish Dholakia a676f69cab docs: cleanup 2025-05-19 22:10:04 -07:00
Jugal D. Bhatt c6260fe31a add cla to docs (#10963)
* add cla to docs

* cla docs clarity
2025-05-19 21:23:22 -07:00
Cole McIntosh 4381ddc963 Add new documentation files for LiteLLM (#10961)
- Created `llms-full.txt` with comprehensive details on LiteLLM features, usage, and supported models.
- Added `llms.txt` for quick access to key links and resources related to LiteLLM, including guides, release notes, and integration documentation.
2025-05-19 15:17:44 -07:00
Daniel Staiger be672b2b18 Update github.md (#10946)
Updated clarification in the use of the models form github. (Github uses the model name: <company>/<model-Name> while litellm wants github/<model-Name>

Updated the example to a model that is actually supported / available on github right now
2025-05-19 13:21:50 -07:00
Jorge Piedrahita Ortiz fd40bc6ddd update sambanova models and parameters (#10900)
* add sambanova to completion input params table

* update sambanova supported args

* update sambanova supported models

* minor changes

* fix sambanova model list

* update sambanova models

* update sambanova models

* update sambanova docs

* minor chnage sambanova url

* update type to match OpenAIGPTConfig

* minor change
2025-05-18 20:45:20 -07:00
Earl St Sauver 6ac3c6ba35 Enable structured JSON schema outputs in LM Studio (for validated responses) (#10929)
- docs/my-website/docs/providers/lm_studio.md: add Structured Output section with JSON schema and Pydantic examples
- litellm/llms/lm_studio/chat/transformation.py: extend map_openai_params to handle `response_format` mappings (`json_schema`, `json_object`) and move them to optional_params
- litellm/utils.py: include `LM_STUDIO` in `supports_response_schema` list
- tests/litellm/llms/lm_studio/test_lm_studio_chat_transformation.py: add tests for Pydantic model and dict-based JSON schema handling

Co-authored-by: Earl St Sauver <estasuver@gmail.com>
2025-05-18 20:43:49 -07:00
Krrish Dholakia 16b2158d25 docs: update release notes 2025-05-17 19:47:57 -07:00
Krrish Dholakia 6bfd05bb76 docs: update docs with key highlights 2025-05-17 19:47:57 -07:00
DarinVerheijke 54a0766ff9 Add provider Featherless AI (access to 4200+ models) (#10596)
* add featherless provider

* change docs

* clean lint

* fix provider logic

* Add fixes per review

* update parameters

* update error msg

* remove featherless

* add tests

* add mock test e2e

* update to openai mock test
2025-05-17 19:38:46 -07:00
Krrish Dholakia b53f1e7677 docs: clarify enterprise feature 2025-05-17 19:36:35 -07:00
Ishaan Jaff 8a05c57071 docs pii masking cleanup 2025-05-17 18:55:58 -07:00
Ishaan Jaff d367eb3460 [Docs] Pii masking with LiteLLM (#10924)
* doc pii masking

* remove old doc

* docs add deployment options

* docs guardrail pii masking

* docs pii masking / tracing

* add images of tracing guards litellm ui

* docs presidio pii
2025-05-17 18:35:33 -07:00
Ishaan Jaff cdf6026fdd docs apply guard (#10923) 2025-05-17 18:34:35 -07:00
Krish Dholakia b4fc703d3b LiteLLM Stable release notes (#10919)
* docs(index/v1.70.1-stable): style improvements

* style: add style improvements to docs

* docs: cleanup docs

* docs: more style improvements

* docs: style improvements

* docs(gemini/realtime): add docs on realtime api via Google AI Studio

* docs: add openai example to anthropic web search docs

* docs: add missing doc links

* docs: doc cleanup

* docs: add more doc links

* fix: cleanup

* docs: add docker information

* docs: update doc links

* docs: add demo instance details to doc

s
2025-05-17 17:54:25 -07:00
Krrish Dholakia d79cd70a1a docs(index/v1.70.1-stable): add initial release note 2025-05-17 16:13:44 -07:00
Jugal D. Bhatt b342aa9253 add docs for spend logs (#10913)
* add docs for spend logs

* change docs

* change docs

* change docs and change default run loops.

There was a calc error previously, it was not a million when run 100 times, but 100k, now default run would delete 500k records in 50s (running 500 loops of function before exiting)

* add docs changes to ui logs to metion deletion
2025-05-17 16:00:28 -07:00
Jugal D. Bhatt 7702663666 add constant to config settings.md (#10908) 2025-05-16 18:33:19 -07:00
Jugal D. Bhatt 44c100d05d [Feat] handle multi-pod deployment for SpendLogs Retention (#10895)
* handle multi-pod deployment

* fix utc and ruff errors

* add constants.py

* add lock duration acc to interval

* add lock duration on pod lock manager

* update tests to use redis

* update comments from review

* update config_Settings.md

* lint errors

* remove custom ttl setting

* add constants.py

* add constants.py

* add check for pod lock manager, and allow otherwise

* remove dup try except and move to finally

remove args
2025-05-16 18:18:51 -07:00
Ishaan Jaff bd1e0634bf [Feat] Add Lakera v2 Guardrail Support (#10880)
* feat: add types for lakera v2

* feat: add call v2 guard for lakera

* feat: add call v2 guard for lakera

* feat: add LAKERA_V2

* feat: add LAKERA_V2 params

* feat: add initialize_lakera_v2

* fix: lakera pii masking

* test: lakera pii masking

* fix: lakera pii masking with tracing

* fix: lakera pii masking with tracing

* fix: fix linting errors

* fix: lakera ai docs
2025-05-16 17:08:21 -07:00
Will Chen 0942a9d51d Fix example commands in reliability.md (#10894)
-D is incorrect and dumps the headers.
The correct flag is -d which sets it as data
2025-05-16 13:21:34 -07:00
Jason 803dda05ed fix: fix novita ai docs (#10882) 2025-05-16 10:04:55 -07:00
Krrish Dholakia ffcdf441d2 fix: fix docs 2025-05-15 22:32:02 -07:00
Damian Gleumes 384a7ba94d [Feat]: Configure LiteLLM to Parse User Headers from Open Web UI (#9802)
* add user_header_name

* docs: add per-user tracking to Open WebUI with LiteLLM doc

* docs: standardize "OpenWeb UI" spelling across openweb_ui.md

* docs: improve wording for openweb_ui guide

* fix end_user_id not being set

- move user header parsing to add_litellm_data_to_request
- also set  user_api_key_dict.end_user_id from user header
2025-05-15 22:01:12 -07:00
dependabot[bot] 1e08816489 build(deps): bump undici from 6.21.1 to 6.21.3 in /docs/my-website (#10855)
Bumps [undici](https://github.com/nodejs/undici) from 6.21.1 to 6.21.3.
- [Release notes](https://github.com/nodejs/undici/releases)
- [Commits](https://github.com/nodejs/undici/compare/v6.21.1...v6.21.3)

---
updated-dependencies:
- dependency-name: undici
  dependency-version: 6.21.3
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-15 08:33:12 -07:00
user202729 044f5f973b Add documentation on in-memory and disk caching (#10522) 2025-05-14 23:07:13 -07:00
Ishaan Jaff fc8d28f7df docs - DEFAULT_MAX_RECURSE_DEPTH_SENSITIVE_DATA_MASKER 2025-05-14 18:29:31 -07:00
Ishaan Jaff abe7b71e6a Revert "Revert "[Fix] Allow overriding all constants using a .env variable (#10803)""
This reverts commit cf4b2ebda4.
2025-05-14 18:27:43 -07:00
Jugal D. Bhatt a754a25828 Feat add retention config (#10815)
* add function to check config flag

* added unit tests

* convert to seconds support

* added in settings.md

* Updated config_settings.md

* remove extra point

* change config var

* resolve conflict
2025-05-14 18:16:25 -07:00
Ishaan Jaff be18923250 fix code qa check for OTEL env vars 2025-05-14 17:56:27 -07:00
Damien Pontifex 63a4287bff Supper OTLP standard environment variables for configuration (#10813)
- Environment variables outlined at https://opentelemetry.io/docs/languages/sdk-configuration/otlp-exporter/
- Default to OTLP standard, but fallback to existing so as to not break backwards compatibility
2025-05-14 17:40:28 -07:00
Ishaan Jaff cf4b2ebda4 Revert "[Fix] Allow overriding all constants using a .env variable (#10803)"
This reverts commit de901bbf24.
2025-05-13 21:27:44 -07:00
Ishaan Jaff 8142c20c98 [Feat] Allow specifying PII Entities Config when using Presidio Guardrails (#10810)
* refactor: use analyze_text, anonymize_text

* feat: allow defining pii_entities_config for presidio

* feat: use entities config for presidio analyze request

* feat: add test_presidio_pii.py

* testing: add guardrails testing job

* feat: allow blocking specific entities pii

* test: use 1 file for presidio guard tests

* fix: presidio pii tests

* test: presidio blocked entity

* clean up docs

* docs presidio pii parsing

* fix: raise_exception_if_blocked_entities_detected

* fix: linting errors
2025-05-13 19:48:56 -07:00
Ishaan Jaff 5fbcd22604 [Fix] Allow overriding all constants using a .env variable (#10803)
* fix: bump: DEFAULT_MAX_RECURSE_DEPTH

* fix: bump: DEFAULT_MAX_RECURSE_DEPTH

* test: test_vertex_ai_complex_response_schema

* fix: allow all constants to be overriden

* fix: allow all numeric constants to be overriden with env vars

* fix: remove dup DEFAULT_MAX_TOKENS in constants.py

* document all constants env vars

* docs - DEFAULT_PROMPT_INJECTION_SIMILARITY_THRESHOLD
2025-05-13 16:10:55 -07:00
Krish Dholakia 53f6514f12 Support Anthropic web search tool + Add more google finish reason mapping (#10785)
* fix(vertex_and_google_ai_studio_gemini.py): handle additional google finish reasons

Fixes https://github.com/BerriAI/litellm/issues/10768

* test: add more unit tests

* fix(anthropic/chat/transformation.py): support anthropic web search

Fixes https://github.com/BerriAI/litellm/issues/10664

* fix(anthropic/chat/transformation.py): add anthropic web search 'max uses' param support

* docs(anthropic.md): add doc for web search tool calling

Closes https://github.com/BerriAI/litellm/issues/10664

* build(model_prices_and_context_window.json): add search tool pricing for anthropic

* fix: suppress linting error

* test: update tests

* fix: fix ruff check
2025-05-12 22:45:51 -07:00
Krish Dholakia d37cc63250 Add new model provider Novita AI (#7582) (#9527)
* Add new model provider Novita AI (#7582)

* feat: add new model provider Novita AI

* feat: use deepseek r1 model for examples in Novita AI docs

* fix: fix tests

* fix: fix tests for novita

* fix: fix novita transformation

* ci: fix ci yaml

* fix: fix novita transformation and test (#10056)

---------

Co-authored-by: Jason <ggbbddjm@gmail.com>
2025-05-12 21:49:30 -07:00
Ishaan Jaff 643d2a8ccb [Feat] Option to force/always use the litellm proxy (#10559) (#10633) (#10773)
* [Feat] Option to force/always use the litellm proxy (#10559) (#10633)

* fix: add use_litellm_proxy

* fix: update LiteLLMProxyChatConfig

* fix get llm provider logic

* tests get llm provider logic

* add dynamic use_litellm_proxy

* docs forcsing litellm proxy usage

* fix: _should_use_litellm_proxy_by_default

* fixes: get_custom_llm_provider

---------

Co-authored-by: Antoine Legrand <2t.antoine@gmail.com>
2025-05-12 20:22:54 -07:00
Oscar Savolainen 75135ac884 Update Nscale model providers to point to website (#10764)
* Instead of listing models and pricing, we provide a link to our website. We also highlight our free credits.

* Add all models supported tip to top of NScale README

* add space

---------

Co-authored-by: Oscar Savolainen <oscar.savolainen@enscale.com>
2025-05-12 17:10:39 -07:00
Ishaan Jaff 2cc4a87861 [Docs] Using litellm with Google ADK (#10777)
* docs litellm ADK usage

* docs litellm google adk

* docs litellm ADK

* docs litellm with ADK usage examples

* docs litellm proxy with ADK

* cookbook litellm ADK
2025-05-12 16:41:49 -07:00
Krrish Dholakia 0d6efff312 docs: update change on docs 2025-05-10 23:59:40 -07:00
Ishaan Jaff 8bcd5b6094 docs fix order 2025-05-10 23:16:51 -07:00