Commit Graph

3793 Commits

Author SHA1 Message Date
Ishaan Jaff 7da4e2197d Benefits of using gemini-cli with LiteLLM 2025-06-28 21:59:07 -07:00
Ishaan Jaff 50dd9980dc docs - update release notes 2025-06-28 21:56:58 -07:00
Krish Dholakia f7af8902b0 /v1/messages - Remove hardcoded model name on streaming + Tags - enable setting custom header tags (#12131)
* fix(anthropic/experimental_pass_through): use given model name when returning streaming chunks

don't harcode model name on streaming

confusing for user

* fix(anthropic/streaming_iterator.py): remove scope of import

* feat(litellm_logging.py): allow admin to specify additional headers for using as spend tags

Closes https://github.com/BerriAI/litellm/issues/12129

* test(test_litellm_logging.py): add unit tests

* feat(openweb_ui.md): add custom tag tutorial to docs

* docs(cost_tracking.md): add tag based usage UI screenshot

* test: update test

* fix: fix import
2025-06-28 21:49:35 -07:00
Krrish Dholakia 123631c93d docs(index.md): update release note with cleaner table for updated models 2025-06-28 21:48:24 -07:00
Krrish Dholakia e77eb82cd6 docs(index.md): initial pre-release note 2025-06-28 13:24:19 -07:00
Guanghui Liu 27849359a9 fix example config.yaml in claude code tutorial (#12133) 2025-06-28 11:07:48 -07:00
Ishaan Jaff 8b73716a66 add docs 2025-06-28 09:26:12 -07:00
Youfu Zhang 1c68c24358 introduce new environment variable NO_REDOC to opt-out Redoc (#12092)
Signed-off-by: Youfu Zhang <zhangyoufu@gmail.com>
2025-06-27 21:26:37 -07:00
Ishaan Jaff 7465f062c0 add panw guard docs 2025-06-27 21:13:04 -07:00
Ishaan Jaff bb98f68451 docs fix 2025-06-27 21:09:33 -07:00
ohmeow 9ab7d97b85 adds model to metadata to estimated cost is calculated; corrects how to pass metadata in braintrust in the docs (#12022) 2025-06-27 18:11:07 -07:00
Ishaan Jaff ebf6395bc1 [Feat] Add Eleven Labs - Speech To Text Support on LiteLLM (#12119)
* add ELEVENLABS as a provider

* add deepgram to main.py

* add ElevenLabsException

* add ElevenLabsAudioTranscriptionConfig

* add transform_audio_transcription_response

* TestElevenLabsAudioTranscription

* add elevenlabs/scribe_v1 to model cost map

* add ElevenLabsAudioTranscriptionConfig

* add AudioTranscriptionRequestData

* add ElevenLabs transform

* use AudioTranscriptionRequestData

* refactoring fixes

* add ProcessedAudioFile util for reading audio files

* test_elevenlabs_diarize_parameter_passthrough

* docs eleven labs

* docs fixes

* fix code qa checks

* fixes - audio transcription

* ui - add ElevenLabs logo

* add elevenlabs logo

* docs - ElevenLabs

* test fix elevenlabs
2025-06-27 17:50:49 -07:00
Jason Roberts b57cfb8bff Adding Feature: Palo Alto Networks Prisma AIRS Guardrail (#12116)
* feat: Add PANW Prisma AIRS guardrail integration

- Add PANW_PRISMA_AIRS to SupportedGuardrailIntegrations enum
- Update guardrail registry and initializers
- Add complete documentation with curl examples and response formats
- Support pre_call, post_call, and during_call modes
- Include fail-safe error handling and comprehensive logging
- Integration with official PANW Prisma AIRS API

* feat: Add PANW Prisma AIRS guardrail integration

- Update to test file
- Fail-closed security on API errors

* update fail closed behavior

* fix: Update PANW Prisma AIRS guardrail integration pattern

* fix: Remove unused import

* fix: addressed MyPy error
2025-06-27 15:17:36 -07:00
Ishaan Jaff f85cd9c54a [Docs] - Show how to use fallbacks with audio transcriptions endpoints (#12115)
* add overview

* docs audio transcriptions

* docs on fallbacks with audio endpoints
2025-06-27 12:43:55 -07:00
Ishaan Jaff f1c7024e70 [Feat] Add Bridge from generateContent <> /chat/completions (#12081)
* add GenerateContentToCompletionHandler

* working - non streaming bridge

* add GoogleGenAIAdapter

* add google gen ai adapter

* working streaming bridge

* working streaming usage for adapter

* tool calling transform for generate content to openai

* fixes for accumulating tool calls

* fix code qa checks

* Best Practices for Production

* fix code qa checks

* test_streaming_partial_tool_calls_accumulation

* linting fixes

* add supported_openai_chat_completion_params

* fix translate_generate_content_to_completion

* test_google_genai_adapter.py
2025-06-27 11:08:55 -07:00
Cole McIntosh 60bf3c36bf fix(docs): Remove unused dotenv dependency from docusaurus config (#12102)
* fix(docs): Remove unused dotenv dependency from docusaurus config

The dotenv package was being required in docusaurus.config.js but was listed as
a devDependency, causing build failures. Since no environment variables are
actually used in the config, removed the unnecessary import.

* fix(docs): Remove reference to non-existent spending_monitoring doc

The sidebars.js file was referencing proxy/spending_monitoring which was deleted
in commit ba7463b9c. This was causing the documentation build to fail with missing
document errors.
2025-06-27 08:20:44 -07:00
Ishaan Jaff ba7463b9c6 docs - spend monitoring 2025-06-26 17:39:13 -07:00
Ishaan Jaff edf416c24f docs - Use LiteLLM with Gemini CLI 2025-06-26 17:31:19 -07:00
Ishaan Jaff 72ecac8962 docs - user agent 2025-06-26 16:23:59 -07:00
Ishaan Jaff 16126f0d6b docs Gemini Text-to-Speech 2025-06-26 15:59:48 -07:00
Ishaan Jaff 80acf6bc97 docs */speech to /chat/completions Bridge** 2025-06-26 15:53:25 -07:00
Ishaan Jaff fc39343599 docs update 2025-06-26 15:32:33 -07:00
Ishaan Jaff e556071544 docs - Use LiteLLM with Gemini CLI 2025-06-26 13:35:33 -07:00
tanjiro bd0a5f7387 Inkeep searchbar and chat added to the Docs (#12030)
* inkeep chat added

* add prod key
2025-06-26 10:13:42 -07:00
Ishaan Jaff 22ff3da3cf [Fix] Allow using HTTP_ Proxy settings with trust_env (#12066)
* allow using trust_env

* add docs on how to use HTTP_PROXY

* docs AIOHTTP_TRUST_ENV

* test_aiohttp_transport_trust_env_setting

* docs fix
2025-06-26 08:37:22 -07:00
Cole McIntosh 327405ffae Merge pull request #12050 from colesmcintosh/docs/elasticsearch-logging-tutorial
Fix Elasticsearch tutorial image rendering
2025-06-26 09:00:00 -06:00
Ishaan Jaff f8f9e2299b docs latency headers 2025-06-26 07:58:23 -07:00
Ishaan Jaff 329f8ec7a6 docs gemini cli 2025-06-25 22:39:50 -07:00
Ishaan Jaff bca94535da docs gemini cli 2025-06-25 22:31:47 -07:00
Cole McIntosh f9ef45474b Fix Elasticsearch tutorial image rendering
- Replace markdown image syntax with React Image component
- Import @theme/IdealImage for proper image rendering in Docusaurus
- Addresses PR feedback to fix image display issues
2025-06-25 16:27:14 -06:00
Cole McIntosh 51c1c7bd36 Add Elasticsearch Logging Tutorial (#11761)
* docs: add Elasticsearch logging tutorial and update sidebar

* docs: update Elasticsearch logging tutorial to include OpenTelemetry setup and configuration

* docs: remove sections from Elasticsearch logging tutorial

* docs: remove analytics examples from Elasticsearch logging tutorial

* Update Elasticsearch version and logging exporter configuration in the Elasticsearch logging tutorial

* Add visualization instructions for LLM telemetry data in Kibana to Elasticsearch logging tutorial

* Add Elasticsearch demo image to documentation

* Move demo image for Elasticsearch logging tutorial
2025-06-25 15:17:37 -07:00
Cole McIntosh 9111b67950 Move demo image for Elasticsearch logging tutorial 2025-06-25 16:14:27 -06:00
Cole McIntosh b3aa8c980e Add Elasticsearch demo image to documentation 2025-06-25 16:11:41 -06:00
Cole McIntosh c218b62822 Add visualization instructions for LLM telemetry data in Kibana to Elasticsearch logging tutorial 2025-06-25 16:11:32 -06:00
Cole McIntosh d7568a37f0 Merge branch 'BerriAI:main' into docs/elasticsearch-logging-tutorial 2025-06-25 15:42:18 -06:00
Krish Dholakia 24c2cd1bd9 Anthropic /v1/messages - Custom LLM Server support (#12016)
* fix(handler.py): support routing custom llm's to chat completion handler

Adds custom llm support for anthropic

* test(test_anthropic_experimental_pass_through_messages_handler.py): add unit test confirming custom llm respected

* docs(custom_llm_server.md): document anthropic custom llm translation

* test(volcengine.py): map thinking in extra body

Fixes https://github.com/BerriAI/litellm/issues/11879

* feat(main.py): support `azure/responses/<deployment-name>` model string

this allows us to route the model correctly

Closes https://github.com/BerriAI/litellm/issues/11879

* docs(azure_responses.md): document calling azure responses api models via chat completions bridge

Closes https://github.com/BerriAI/litellm/issues/11917

* fix: fix custom provider check

* test: update tests
2025-06-24 22:00:44 -07:00
Ishaan Jaff d6cc384780 [Feat] OpenAI/Azure OpenAI - Add support for creating vector stores on LiteLLM (#12021)
* add create/acreate vector store

* add azure config

* add _base_validate_azure_environment

* fix base test

* add get_base_create_vector_store_args

* use base llm for headers responses api

* add _get_base_azure_url

* fix AzureOpenAIVectorStoreConfig

* TestAzureOpenAIVectorStore

* fix azure openai vector store

* fix test comment

* fix unused imports

* test_validate_environment_azure_api_key_within_secret_str

* test_azure_transformation.py
2025-06-24 20:46:48 -07:00
Ishaan Jaff 2bb8048864 [Feat] Add OpenAI Search Vector Store Operation (#12018)
* add BaseVectorStoreTransformation

* fix BaseVectorStoreTransformation

* add OpenAIVectorStoreTransformation

* fix transform

* add search, asearch vector stores

* add skeleton for vector store searching

* fix VectorStoreSearchOptionalRequestParams

* fix VectorStoreRequestUtils

* fix litellm.asearch/litellm.search

* fix BaseVectorStoreConfig

* add vector_store_search_handler to llm http handler

* use llm http handler for searching vector stores

* fix base vector store config

* fix vector_store_search_handler

* async_vector_store_search_handler

* add conftest

* add BaseVectorStoreTest

* move litellm.integrations.vector_store_integrations

* fix working OAI OpenAIVectorStoreConfig

* add Search vector store

* add OpenAI Vector Stores
2025-06-24 15:52:43 -07:00
Krrish Dholakia 8da22be199 docs(self_serve.md): update doc 2025-06-24 13:25:17 -07:00
Krrish Dholakia c9a8198d12 docs(self_serve.md): clarify team must be created before setting as default team 2025-06-24 13:24:23 -07:00
Krrish Dholakia 330cd4f2a9 docs(self_serve.md): update self serve docs to include default team / member budget information 2025-06-24 09:31:33 -07:00
Krish Dholakia a89397a798 Litellm dev 06 23 2025 p1 (#11989)
* fix(litellm_logging.py): fix using router model id for logging calls

Fixes https://github.com/BerriAI/litellm/issues/11975#issuecomment-2995882238

* test(test_litellm_logging.py): add unit test for custom price tracking

* fix(vertex_ai/): don't send invalid format parameter to vertex

causes calls to fail

* fix(vertex_ai_context_caching.py): if cached content present and tools in message, cache tools as well

gemini throws errors if tools passed in alongside cached content

* test: add unit tests

* fix: fix linting errors

* test: test_vertex_ai_common_utils.py

update test

* fix(streaming_handler.py): unset response cost when creating model response
2025-06-23 22:33:06 -07:00
Ishaan Jaff eec0e4d2be docs - web search 2025-06-23 21:25:35 -07:00
Ishaan Jaff b5c48c8c22 [Feat] Add List Callbacks API Endpoint (#11987)
* add get_callbacks_by_type

* add list_callbacks

* fix _get_callback_string

* add callback_management_endpoints_router

* fix proxy config.yaml

* fixes list callbacks

* TestCallbackManagementEndpoints

* update docs

* docs Response Fields

* docs header format

* docs Dynamic Callback Management
2025-06-23 15:34:25 -07:00
Ishaan Jaff 8c5fb6f539 [Feat] Enterprise - Allow dynamically disabling callbacks in request headers (#11985)
* Add support for disabling callbacks via x-litellm-disable-callbacks header

* add _is_callback_disabled_via_headers

* add get_proxy_server_request_headers

* _is_callback_disabled_via_headers

* X_LITELLM_DISABLE_CALLBACKS

* add EnterpriseCallbackControls

* use EnterpriseCallbackControls

* use CustomLoggerRegistry

* use CustomLoggerRegistry

* CustomLoggerRegistry

* EnterpriseCallbackControls

* TestEnterpriseCallbackControls

* docs clean up

* docs dynamic callbacks

* doc fixes

* fix code qa checks

* fix CustomLoggerRegistry

---------

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-06-23 14:32:05 -07:00
Mukesh Sai Kumar a44ff7c486 Fix markdown table not rendering properly (#11969) 2025-06-23 09:28:51 -07:00
Ishaan Jaff 05b95eb702 docs - Recommended Machine Specifications (#11980) 2025-06-23 08:26:29 -07:00
Cole McIntosh 3c22d52d6c Merge branch 'BerriAI:main' into docs/elasticsearch-logging-tutorial 2025-06-22 12:51:26 -06:00
Ishaan Jaff b36dbb870c fix ghcr 2025-06-22 09:06:58 -07:00
Ishaan Jaff 9e5c97aabd fix pip pkg 2025-06-22 09:05:45 -07:00