Commit Graph

16053 Commits

Author SHA1 Message Date
Ishaan Jaff 369ddfb49e docs vertex context caching 2024-08-08 17:18:12 -07:00
Krish Dholakia e42a2a0a04 Merge pull request #5117 from BerriAI/litellm_auth_fix
fix(user_api_key_auth.py): Fix issue with key auth w/ user not in db
2024-08-08 17:14:23 -07:00
Krish Dholakia 4ef7743d71 Merge branch 'main' into litellm_auth_fix 2024-08-08 17:14:16 -07:00
Ishaan Jaff b58664232c Merge pull request #5114 from BerriAI/litellm_safe_get_file_checksum
[Fix-Bug]: Whisper is broken
2024-08-08 17:11:03 -07:00
Krrish Dholakia da0413f0a4 fix(factory.py): handle openai function message having tool call id 2024-08-08 17:06:50 -07:00
Ishaan Jaff 68fdf9de2d docs readme 2024-08-08 17:06:29 -07:00
Ishaan Jaff 84c05a57d6 docs use (LLM Gateway) in some places 2024-08-08 17:00:52 -07:00
Ishaan Jaff f179759672 docs vertex ai 2024-08-08 16:12:36 -07:00
Ishaan Jaff e671ae58e3 Merge pull request #5119 from BerriAI/litellm_add_gemini_context_caching_litellm
[Feat-Proxy] Add Support for VertexAI context caching
2024-08-08 16:08:58 -07:00
Ishaan Jaff 75769538e7 Merge pull request #5115 from BerriAI/litellm_handle_case_when_prom_svc_logger_none
fix handle case when service logger has no attribute prometheusService
2024-08-08 16:08:47 -07:00
Ishaan Jaff 1924f11b4c add default_vertex_config 2024-08-08 16:07:26 -07:00
Ishaan Jaff d78c38f8e7 docs vertex 2024-08-08 16:07:14 -07:00
Ishaan Jaff a3dd3a19fa docs cachedContent endpoint 2024-08-08 16:06:23 -07:00
Ishaan Jaff cae941f4c0 auto update the model in cachedContents request 2024-08-08 15:56:46 -07:00
Krrish Dholakia 1bf8af5cb1 fix(anthropic.py): fix translation from /v1/messages format to openai format 2024-08-08 15:55:01 -07:00
Ishaan Jaff 0d662368ac add native cachedContents endpoint 2024-08-08 15:32:07 -07:00
Krrish Dholakia 32f343b947 test: skip flaky langsmith tests 2024-08-08 13:43:44 -07:00
Krrish Dholakia b65a27035e test(test_langsmith.py): skip flaky test 2024-08-08 13:42:15 -07:00
Krrish Dholakia 5da4c27e8d fix(internal_user_endpoints.py): expose new 'internal_user_budget_duration' flag
Relevant to - https://github.com/BerriAI/litellm/issues/5106
2024-08-08 13:05:03 -07:00
Krrish Dholakia 169e8b541c test(test_langsmith.py): fix test 2024-08-08 12:53:19 -07:00
Ishaan Jaff 8ad5a40283 doc on using litellm proxy with vertex ai content caching 2024-08-08 11:45:46 -07:00
Ishaan Jaff 909977ace6 add example using litellm proxy with gemini context caching 2024-08-08 11:35:41 -07:00
Ishaan Jaff 7b6110bbb3 add tests to make sure correct vertex ai route is used 2024-08-08 11:28:19 -07:00
Ishaan Jaff d237e625ca use v1beta1 when using cached_content 2024-08-08 11:09:31 -07:00
Krrish Dholakia 5703da9b42 fix(user_api_key_auth.py): Fixes https://github.com/BerriAI/litellm/issues/5111 v1.43.3-dev1 2024-08-08 10:30:15 -07:00
Krrish Dholakia 5d664c0441 build(model_prices_and_context_window.json): Fixes https://github.com/BerriAI/litellm/issues/5113 2024-08-08 09:11:59 -07:00
Krrish Dholakia ee8d2f25b9 build: ui - update to include max budget per team 2024-08-08 09:09:23 -07:00
Ishaan Jaff 7d4e834091 fix handle case when service logger has no attribute prometheusServicesLogger 2024-08-08 08:23:29 -07:00
Ishaan Jaff 68a36600c2 fix use get_file_check_sum 2024-08-08 08:03:08 -07:00
Krrish Dholakia 2710bec02d docs(scheduler.md): cleanup docs to use /chat/completion endpoint 2024-08-07 21:49:06 -07:00
Krrish Dholakia 381fc213f8 bump: version 1.43.2 → 1.43.3 v1.43.3 2024-08-07 20:33:10 -07:00
Krrish Dholakia a03860b1cf test(test_completion.py): fix merge error 2024-08-07 20:31:38 -07:00
Krish Dholakia e1610d37b9 Merge pull request #5099 from BerriAI/litellm_personal_user_budgets
fix(user_api_key_auth.py): respect team budgets over user budget, if key belongs to team
2024-08-07 20:00:16 -07:00
Krish Dholakia 37ef63f522 Merge pull request #5101 from BerriAI/litellm_router_prioritization
feat(router.py): allows /chat/completion endpoint to work for request prioritization calls
2024-08-07 20:00:06 -07:00
Krish Dholakia baf01b47d8 Merge branch 'main' into litellm_personal_user_budgets 2024-08-07 19:59:50 -07:00
Krrish Dholakia 7e1f296981 docs(self_serve.md): cleanup docs on how to onboard new users + teams 2024-08-07 19:58:36 -07:00
Krrish Dholakia ec0b511119 fix: use more descriptive flag 2024-08-07 18:59:46 -07:00
Krrish Dholakia b7e31638fd fix(internal_user_endpoints.py): respect 'max_user_budget' for new internal user's 2024-08-07 18:50:40 -07:00
Ishaan Jaff e17e959c62 image gen catch when predictions not in json response 2024-08-07 18:50:26 -07:00
Ishaan Jaff e1a78faf98 run that ci cd again 2024-08-07 18:47:11 -07:00
Krrish Dholakia b4392d2b67 test(test_amazing_vertex_completion.py): fix test for json schema validation in openai schema v1.43.2 2024-08-07 18:39:20 -07:00
Ishaan Jaff b571f3e115 run that ci/cd again 2024-08-07 18:25:52 -07:00
Krrish Dholakia 4e02425d8b fix(vertex_ai_partner.py): pass model for llama3 param mapping 2024-08-07 18:16:41 -07:00
Ishaan Jaff 75bfe81748 fix all optional param tests 2024-08-07 18:16:41 -07:00
Ishaan Jaff 472a1df76f fix test for wildcard routing 2024-08-07 18:16:41 -07:00
Ishaan Jaff 4c375d13d6 fix test_drop_params_parallel_tool_calls 2024-08-07 18:16:41 -07:00
Krrish Dholakia a15317a377 fix(vertex_ai_partner.py): pass model for llama3 param mapping 2024-08-07 18:07:14 -07:00
Ishaan Jaff 4e90a25417 fix all optional param tests 2024-08-07 17:52:40 -07:00
Krrish Dholakia 400653992c feat(router.py): allow using .acompletion() for request prioritization
allows /chat/completion endpoint to work for request prioritization calls
2024-08-07 16:43:12 -07:00
Ishaan Jaff 6b03998e32 fix test for wildcard routing 2024-08-07 16:41:00 -07:00