Commit Graph

24590 Commits

Author SHA1 Message Date
Ishaan Jaff 1237be04a5 test_aaamodel_prices_and_context_window_json_is_valid 2025-09-04 07:58:20 -07:00
Krish Dholakia 572ac0b88f Merge pull request #14211 from zhxlp/main
fix: image_generation supports extra_body parameter
2025-09-04 07:35:53 -07:00
Krish Dholakia cb8e3936ae Merge pull request #14216 from mubashir1osmani/fix_all_docs
Fix custom callbacks doc
2025-09-04 07:35:38 -07:00
Krish Dholakia 8738f630ac Merge pull request #14232 from mubashir1osmani/litellm_docs
[docs]: added more info to load balancing & pass through endpoints
2025-09-04 07:31:54 -07:00
Ishaan Jaff 8878951f16 docs fix: disable_add_user_agent_to_request_tags 2025-09-04 07:30:18 -07:00
Krish Dholakia a8cce5fbf3 Merge pull request #14237 from yeahyung/fix/tpm_limit_bug
Fixes #14204 TPM Rate Limit Bug
2025-09-04 07:29:47 -07:00
Krish Dholakia a74feb221f Merge pull request #14241 from 22mSqRi/fix-key-reset-and-expires
fix: Key Budget not resets at expectable times
2025-09-04 07:29:06 -07:00
22mSqRi 25afa67616 fix: Key Budget not resets at expectable times 2025-09-04 09:04:01 +00:00
yeahyung 7829de2948 (#14204) add test code 2025-09-04 16:41:29 +09:00
yeahyung 9e3010daa4 (#14204) increase token usage with TTL preservation 2025-09-04 16:41:23 +09:00
mubashir1osmani b0450d2ddf docs: added more info to load balancing & passthrough endpoints 2025-09-03 23:58:48 -04:00
Ishaan Jaff 69d5a91e02 bump: version 1.76.2 → 1.76.3 2025-09-03 18:28:52 -07:00
Ishaan Jaff ab3cd5e96e fix memory_usage_in_mem_cache cache endpoint vulnerability (#14229) 2025-09-03 18:28:11 -07:00
Ishaan Jaff 23ae7170d1 [Feat] Allow using Veo Video Generation through LiteLLM Pass through routes (#14228)
* fix: add follow_redirects=True,

* test_pass_through_with_httpbin_redirect

* cook book veo video

* docs Veo Video Generation with Google AI Studio

* add veo-3.0-generate-preview cost tracking details

* track vertex_video_models
2025-09-03 18:25:43 -07:00
Ishaan Jaff be7c762882 add video_generation 2025-09-03 18:25:27 -07:00
Ishaan Jaff 19e2bab8c8 [Feat] Add Initial support for Bedrock Batches API (#14190)
* fix acreate_file with bedrock

* fix routing to bedrock batches api

* fix create_file

* working batch file upload

* fix batches API for file upload

* test: bedrock files and batches API

* add BaseBatchesConfig

* fix get_provider_batches_config

* transform bedrock batches

* fix run create batch through llm http handler

* test_async_file_and_batch

* main.batches creation

* fix: CommonBatchFilesUtils

* fix async_create_batch

* test_async_file_and_batch

* BedrockBatchesConfig

* fix ruff check

* ruff check fix

* fix docs ref
2025-09-03 17:19:58 -07:00
Ishaan Jaff beb300abae [Fix] SCIM - Bug fixes for handling SCIM Group Memberships (#14226)
* Feat: add better SCIM debugging

* fix _get_scim_member_display

* fix patch_group

* test_update_group_e2e

* test_get_scim_member_value
v1.76.2-nightly
2025-09-03 16:15:51 -07:00
Ishaan Jaff e127820bf1 fix proxy_logging_guardrails_model_info_tests 2025-09-03 16:10:30 -07:00
Ishaan Jaff 1d44a6e4a1 Feat: add better SCIM debugging (#14221) 2025-09-03 15:52:53 -07:00
Ishaan Jaff 057d6f5af6 fix: PartnerModelPrefixes 2025-09-03 13:03:04 -07:00
Ishaan Jaff c9f211f331 fix VertexAIPartnerModels 2025-09-03 13:01:47 -07:00
Ishaan Jaff 8e9352fce7 test fix 2025-09-03 11:06:09 -07:00
Ishaan Jaff 3b8409f469 bump: version 1.76.1 → 1.76.2 2025-09-03 11:00:07 -07:00
Ishaan Jaff 9eecff9d9a test_proxy_custom_logger.py::test_chat_completio 2025-09-03 10:57:10 -07:00
katsuhiro muto ca43514db4 [Feat] Support reasoning_effort in Groq (#14207)
* Support reasoning_effort in groq

* add test
2025-09-03 10:43:47 -07:00
Harshavardhan K bc84b411f4 Fix/remove deprecated cerebras gpt oss 20b (#14213)
* fix: remove deprecated cerebras/openai/gpt-oss-20b model from pricing files

- Remove cerebras/openai/gpt-oss-20b entry from model_prices_and_context_window.json
- Remove cerebras/openai/gpt-oss-20b entry from model_prices_and_context_window_backup.json
- Fixes GitHub issue #13972 where the deprecated model was still being treated as valid
- Only the 120b version is now supported by Cerebras

* fix: remove deprecated cerebras gpt-oss-20b
2025-09-03 09:46:08 -07:00
mubashir1osmani d1c376f93e fix custom callback docs 2025-09-03 12:45:42 -04:00
zhxlp 583df9ade0 fix: image_generation supports extra_body parameter 2025-09-03 23:27:28 +08:00
mubashir1osmani ecc607257b fixed prometheus metrics 2025-09-03 02:14:59 -04:00
Ishaan Jaff 63c4a30564 TestVertexAIGPTOSSTransformation 2025-09-02 23:07:52 -07:00
Krish Dholakia 35b501b8d5 Merge pull request #14176 from mjmendo/feat/alert_type_in_slack_message
feat: added alert type to alert messate to slack for easier handling
2025-09-02 23:05:41 -07:00
Krish Dholakia dc0d2a44a5 Merge pull request #14171 from kutsushitaneko/fix/14158_tool_call_id
fix(oci): Handle assistant messages with both content and tool_calls in OCI provider (#14158)
2025-09-02 23:05:04 -07:00
Krish Dholakia 6e852bcec6 Merge pull request #14169 from BerriAI/remove-filters-in-user-info
Remove table filter on user info page
2025-09-02 23:03:30 -07:00
Krish Dholakia c8b177963e Merge pull request #14191 from kayoch1n/fix-h11-model-name
Remove "/" or ":" from model name when being used as h11 header name
2025-09-02 23:02:45 -07:00
Krish Dholakia cfead05086 Merge pull request #14193 from zhxlp/main
fix: Log page parameter passing error
2025-09-02 23:01:17 -07:00
zhxlp 54cca0cc7e fix: Log page parameter passing error 2025-09-03 11:34:58 +08:00
kayoch1n 76555cad81 Format code 2025-09-03 11:20:52 +08:00
kayoch1n 1a97a80c51 Format code 2025-09-03 11:18:57 +08:00
kayoch1n ffbe5cd899 Add a testcase 2025-09-03 11:08:23 +08:00
Krish Dholakia 46655544a4 Merge pull request #14179 from zhirafovod/otel_logs_metrics 2025-09-02 19:50:40 -07:00
kayoch1n edf047b6bb Replace "/" with "-" in model name when being used as a h11 header name 2025-09-03 10:00:16 +08:00
Ishaan Jaff 212ca20edf docs fix 2025-09-02 15:19:35 -07:00
Ishaan Jaff 128d9a3488 [Feature]: Add header support for spend_logs_metadata (#14186)
* fix: allow settings spend_logs_metadata

* fix add_litellm_data_for_backend_llm_call

* fix: add add_litellm_metadata_from_request_headers

* fix add_litellm_metadata_from_request_headers

* test_add_litellm_metadata_from_request_headers

* add_litellm_metadata_from_request_headers

* docs Tracking Spend with custom metadata

* add_litellm_metadata_from_request_headers

* add_litellm_metadata_from_request_headers
2025-09-02 15:13:15 -07:00
Ishaan Jaff c821f1ddf1 [Feature]: Support GPT-OSS models on vertex ai (#14184)
* add VertexAIGPTOSSTransformation

* fix: optional_params

* fix: is_vertex_partner_model

* test_partner_models_httpx

* docs GPT oss docs

* test_vertex_ai_gpt_oss_reasoning_effort

* add vertex ai models
2025-09-02 14:15:26 -07:00
Ishaan Jaff 4b7c114c2a google-cloud-aiplatform 2025-09-02 14:15:00 -07:00
Ishaan Jaff 47c04a8a8c proxy_logging_guardrails_model_info_tests 2025-09-02 14:13:17 -07:00
Krrish Dholakia d282e7eaac fix: handle non-str case 2025-09-02 12:38:05 -07:00
Ishaan Jaff f0f84d6c5c refactor: BEDROCK_CONVERSE_MODELS 2025-09-02 12:12:21 -07:00
Sameer Kankute 4adfd18bc6 [Feat]Add support for safety_identifier parameter in chat.completions.create (#14174)
* Add support for safety_identifier parameter in chat.completions.create

* make sure param is getting actually passed to the raw api
2025-09-02 09:37:08 -07:00
Marcelo Mendoza db1b418b15 fix: update alert type formatting to handle enum names in Slack messages 2025-09-02 17:54:54 +02:00