Commit Graph

3464 Commits

Author SHA1 Message Date
Ishaan Jaff cd5f0e809e docs responses API session management 2025-04-26 14:01:24 -07:00
Ishaan Jaff 3d516fee1f docs responses API GET, DELETE examples 2025-04-26 12:40:21 -07:00
Ishaan Jaff 8c8b786267 doc fix stable release add images of UI improvements 2025-04-26 12:13:59 -07:00
Krish Dholakia 93b6df96e0 Prisma Migrate - support setting custom migration dir (#10336)
* build(litellm-proxy-extras/utils.py): correctly generate baseline migration for non-empty db

* fix(litellm-proxy-extras/utils.py): Fix issue in migration, where if a migration fails during baselining, all are still marked as applied

* fix(prisma_client.py): don't pass separate schema.prisma to litellm-proxy-extras

use the one in litellm-proxy-extras

* fix(litellm-proxy-extras/utils.py): support passing custom dir for baselining db in read-only fs

Fixes https://github.com/BerriAI/litellm/issues/9885

* fix(utils.py): give helpful warning message when permission denied error raised in fs
2025-04-26 12:05:06 -07:00
Ishaan Jaff f2899cb66e [Docs] v1.67.4-stable (#10338)
* draft 1.67.4

* completed New Models / Updated Models

* docs clean up image gen

* docs azure granularities for whisper

* docs New Models / Updated Models

* docs cohere

* update cohere docs

* complete New Models / Updated Models updates

* docs responses API

* complete Logging / Guardrail Integrations

* completed Logging / Guardrail Integrations

* doc Spend Tracking Improvements

* Logging / Guardrail Integrations
2025-04-26 11:54:18 -07:00
Ishaan Jaff f7e75fa276 [Docs] UI Session Logs (#10334)
* add ui logs session doc

* docs add instructions on how to do session management

* docs session management

* docs session management

* docs responses api session management

* docs ui logs
2025-04-26 08:26:23 -07:00
Géraud Le Falher 0cc0da37f3 Update image_generation.md (#10312)
Copy parameters `size` and `quality` from [OpenAI API page](https://platform.openai.com/docs/api-reference/images/create)
2025-04-25 08:50:12 -07:00
Krrish Dholakia 6023427dae docs(gemini.md): cleanup 2025-04-23 21:54:12 -07:00
Dimitri Papadopoulos Orfanos 5e2fd49dd3 Fix typos (#10232) 2025-04-23 20:59:25 -07:00
Ishaan Jaff 36ee132514 [Feat] Add gpt-image-1 cost tracking (#10241)
* add gpt-image-1

* add gpt-image-1 example to docs
2025-04-23 12:20:55 -07:00
Ishaan Jaff 6e4fed59b6 docs agent ops logging 2025-04-22 18:32:28 -07:00
Dwij b2955a2bdd Add AgentOps Integration to LiteLLM (#9685)
* feat(sidebars): add new item for agentops integration in Logging & Observability category

* Update agentops_integration.md to enhance title formatting and remove redundant section

* Enhance AgentOps integration in documentation and codebase by removing LiteLLMCallbackHandler references, adding environment variable configurations, and updating logging initialization for AgentOps support.

* Update AgentOps integration documentation to include instructions for obtaining API keys and clarify environment variable setup.

* Add unit tests for AgentOps integration and improve error handling in token fetching

* Add unit tests for AgentOps configuration and token fetching functionality

* Corrected agentops test directory

* Linting fix

* chore: add OpenTelemetry dependencies to pyproject.toml

* chore: update OpenTelemetry dependencies and add new packages in pyproject.toml and poetry.lock
2025-04-22 10:29:01 -07:00
Ishaan Jaff ebfff975d4 docs responses routing 2025-04-21 23:05:53 -07:00
Ishaan Jaff 104e4cb1bc [Feat] Add infinity embedding support (contributor pr) (#10196)
* Feature - infinity support for #8764 (#10009)

* Added support for infinity embeddings

* Added test cases

* Fixed tests and api base

* Updated docs and tests

* Removed unused import

* Updated signature

* Added support for infinity embeddings

* Added test cases

* Fixed tests and api base

* Updated docs and tests

* Removed unused import

* Updated signature

* Updated validate params

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>

* fix InfinityEmbeddingConfig

---------

Co-authored-by: Prathamesh Saraf <pratamesh1867@gmail.com>
2025-04-21 20:01:29 -07:00
Ishaan Jaff 0c2f705417 [Feat] Add Responses API - Routing Affinity logic for sessions (#10193)
* test for test_responses_api_routing_with_previous_response_id

* test_responses_api_routing_with_previous_response_id

* add ResponsesApiDeploymentCheck

* ResponsesApiDeploymentCheck

* ResponsesApiDeploymentCheck

* fix ResponsesApiDeploymentCheck

* test_responses_api_routing_with_previous_response_id

* ResponsesApiDeploymentCheck

* test_responses_api_deployment_check.py

* docs routing affinity

* simplify ResponsesApiDeploymentCheck

* test response id

* fix code quality check
2025-04-21 20:00:27 -07:00
Krrish Dholakia 1ff7625984 docs: cleanup 2025-04-20 09:26:05 -07:00
Krrish Dholakia aa55103486 docs: cleanup doc 2025-04-20 09:20:47 -07:00
Krrish Dholakia 1d9b58688b docs(sidebars.js): place scim doc in correct place 2025-04-20 09:20:10 -07:00
Krrish Dholakia bb13ac45c8 docs(index.md): cleanup 2025-04-19 19:16:10 -07:00
Ishaan Jaff 1be36be72e Litellm docs SCIM (#10174)
* docs scim

* docs SCIM stash

* docs litellm SCIM

* docs fix

* docs scim with LiteLLM
2025-04-19 18:29:09 -07:00
Krish Dholakia bbfcb1ac7e Litellm release notes 04 19 2025 (#10169)
* docs(index.md): initial draft release notes

* docs: note all pending docs

* build(model_prices_and_context_window.json): add o3, gpt-4.1, o4-mini pricing

* docs(vllm.md): update vllm doc to show file message type support

* docs(mistral.md): add mistral passthrough route doc

* docs(gemini.md): add gemini thinking to docs

* docs(vertex.md): add thinking/reasoning content for gemini models to docs

* docs(index.md): more links

* docs(index.md): add more links, images

* docs(index.md): cleanup highlights
2025-04-19 17:26:30 -07:00
Ishaan Jaff daf024bad1 Supported Responses API Parameters 2025-04-19 17:14:53 -07:00
Ishaan Jaff f39d917886 [Docs] Responses API (#10172)
* docs litellm responses api

* doc fix

* docs responses API

* add get_supported_openai_params for LiteLLMCompletionResponsesConfig

* add Supported Responses API Parameters
2025-04-19 17:10:45 -07:00
Ishaan Jaff 6de3481252 doc fix 2025-04-18 19:57:03 -07:00
Ishaan Jaff 76f00a5121 add info on litellm release 2025-04-18 19:56:29 -07:00
Ishaan Jaff 3d5022bd79 [Feat] Support for all litellm providers on Responses API (works with Codex) - Anthropic, Bedrock API, VertexAI, Ollama (#10132)
* transform request

* basic handler for LiteLLMCompletionTransformationHandler

* complete transform litellm to responses api

* fixes to test

* fix stream=True

* fix streaming iterator

* fixes for transformation

* fixes for anthropic codex support

* fix pass response_api_optional_params

* test anthropic responses api tools

* update responses types

* working codex with litellm

* add session handler

* fixes streaming iterator

* fix handler

* add litellm codex example

* fix code quality

* test fix

* docs litellm codex

* litellm codexdoc

* docs openai codex with litellm

* docs litellm openai codex

* litellm codex

* linting fixes for transforming responses API

* fix import error

* fix responses api test

* add sync iterator support for responses api
2025-04-18 19:53:59 -07:00
Ishaan Jaff 19664960eb docs azure responses API 2025-04-17 18:47:44 -07:00
Krrish Dholakia 8be8022914 docs(vertex_ai.md): document new vertex passthrough route 2025-04-16 22:25:16 -07:00
dependabot[bot] 7b7b43e1a7 build(deps): bump http-proxy-middleware in /docs/my-website (#10064)
Bumps [http-proxy-middleware](https://github.com/chimurai/http-proxy-middleware) from 2.0.7 to 2.0.9.
- [Release notes](https://github.com/chimurai/http-proxy-middleware/releases)
- [Changelog](https://github.com/chimurai/http-proxy-middleware/blob/v2.0.9/CHANGELOG.md)
- [Commits](https://github.com/chimurai/http-proxy-middleware/compare/v2.0.7...v2.0.9)

---
updated-dependencies:
- dependency-name: http-proxy-middleware
  dependency-version: 2.0.9
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-16 09:55:44 -07:00
Ishaan Jaff dcc43e797a [Docs] Auto prompt caching (#10044)
* docs prompt cache controls

* doc fix auto prompt caching
2025-04-15 22:29:47 -07:00
Marc Abramowitz 837a6948d8 Fix typo: Entrata -> Entra in code (#9922)
* Fix typo: Entrata -> Entra

* Fix a few more
2025-04-15 17:31:18 -07:00
Joakim Lorentz c9cf43df5b chore(docs): Update logging.md (#10006)
Fixes a missing slash in OTEL_ENDPOINT example
2025-04-15 16:34:55 -07:00
Krrish Dholakia 8424171c2a fix(config_settings.md): cleanup 2025-04-15 13:41:22 -07:00
Krrish Dholakia aff0d1a18c docs(cohere.md): add cohere cost tracking support to docs 2025-04-14 23:46:58 -07:00
Eoous e94eb4ec70 env for litellm.modify_params (#9964) 2025-04-14 22:33:56 -07:00
Krish Dholakia 9b0f871129 Add /vllm/* and /mistral/* passthrough endpoints (adds support for Mistral OCR via passthrough)
* feat(llm_passthrough_endpoints.py): support mistral passthrough

Closes https://github.com/BerriAI/litellm/issues/9051

* feat(llm_passthrough_endpoints.py): initial commit for adding vllm passthrough route

* feat(vllm/common_utils.py): add new vllm model info route

make it possible to use vllm passthrough route via factory function

* fix(llm_passthrough_endpoints.py): add all methods to vllm passthrough route

* fix: fix linting error

* fix: fix linting error

* fix: fix ruff check

* fix(proxy/_types.py): add new passthrough routes

* docs(config_settings.md): add mistral env vars to docs
2025-04-14 22:06:33 -07:00
Krish Dholakia 8faf56922c Fix azure tenant id check from env var + response_format check on api_version 2025+ (#9993)
* fix(azure/common_utils.py): check for azure tenant id, client id, client secret in env var

Fixes https://github.com/BerriAI/litellm/issues/9598#issuecomment-2801966027

* fix(azure/gpt_transformation.py): fix passing response_format to azure when api year = 2025

Fixes https://github.com/BerriAI/litellm/issues/9703

* test: monkeypatch azure api version in test

* test: update testing

* test: fix test

* test: update test

* docs(config_settings.md): document env vars
2025-04-14 22:02:35 -07:00
Marc Klingen db857c74d4 chore: ordering of logging & observability docs (#9994) 2025-04-14 16:49:04 -07:00
Ishaan Jaff 64bb89c70f docs fix 2025-04-12 21:20:54 -07:00
Krrish Dholakia 2ed63da5f8 docs: cleanup 2025-04-12 19:52:19 -07:00
Krrish Dholakia f8d52e2db9 docs: refactor order 2025-04-12 19:23:07 -07:00
Krrish Dholakia 65e18f6abe docs(index.md): update changelog with realtime api cost tracking details 2025-04-12 19:15:40 -07:00
Krrish Dholakia 44368389f4 docs(litellm_managed_files.md): cleanup doc 2025-04-12 18:24:52 -07:00
Ishaan Jaff 2394cd465e stable release note fixes (#9954)
* docs fix

* docs metrics

* docs fix release notes

* docs 1.66.0-stable
2025-04-12 17:26:38 -07:00
Ishaan Jaff c86e678809 [Docs] v1.66.0-stable fixes (#9953)
* add categories for spend tracking improvements

* xai reasoning usage

* docs tag management

* docs tag based routing

* [Beta] Routing based

* docs tag based routing

* docs tag routing

* docs enterprise web search
2025-04-12 16:57:25 -07:00
Ishaan Jaff eb998ee1c0 [v1.66.0-stable] Release notes (#9952)
* release notes

* docs release notes

* docs fix release notes

* docs clean up

* docs clean up

* release notes

* docs sso tag management
2025-04-12 15:32:52 -07:00
Krish Dholakia 25d4cf1c1d Litellm managed files docs (#9948)
* docs(files_endpoints.md): add doc on litellm managed files

* refactor: separate litellm managed file docs from `/files` docs

clearer

* docs(litellm_managed_files.md): add architecture diagram explaining managed files
2025-04-12 13:02:33 -07:00
dependabot[bot] eb19639215 build(deps): bump @babel/runtime in /docs/my-website (#9934)
Bumps [@babel/runtime](https://github.com/babel/babel/tree/HEAD/packages/babel-runtime) from 7.26.0 to 7.27.0.
- [Release notes](https://github.com/babel/babel/releases)
- [Changelog](https://github.com/babel/babel/blob/main/CHANGELOG.md)
- [Commits](https://github.com/babel/babel/commits/v7.27.0/packages/babel-runtime)

---
updated-dependencies:
- dependency-name: "@babel/runtime"
  dependency-version: 7.27.0
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-12 07:45:53 -07:00
Ishaan Jaff 57bc03b30b [Feat] Add reasoning_effort support for xai/grok-3-mini-beta model family (#9932)
* add BaseReasoningEffortTests

* BaseReasoningLLMTests

* fix test rename

* docs update thinking / reasoning content docs
2025-04-11 19:17:09 -07:00
Ishaan Jaff f9ce754817 [Feat] Add litellm.supports_reasoning() util to track if an llm supports reasoning (#9923)
* add supports_reasoning for xai models

* add "supports_reasoning": true for o1 series models

* add supports_reasoning util

* add litellm.supports_reasoning

* add supports reasoning for claude 3-7 models

* add deepseek as supports reasoning

* test_supports_reasoning

* add supports reasoning to model group info

* add supports_reasoning

* docs supports reasoning

* fix supports_reasoning test

* "supports_reasoning": false,

* fix test

* supports_reasoning
2025-04-11 17:56:04 -07:00