Marc Abramowitz
83c242bbb3
Add commented set_verbose line to proxy_config
...
because I've wanted to do this a couple of times and couldn't remember
the exact syntax.
2024-05-16 15:59:37 -07:00
Krrish Dholakia
54587db402
fix(alerting.py): fix datetime comparison logic
2024-05-14 22:10:09 -07:00
Ishaan Jaff
9bde3ccd1d
(ci/cd) fixes
2024-05-13 20:49:02 -07:00
Krrish Dholakia
99e8f0715e
test(test_end_users.py): fix end user region routing test
2024-05-11 22:42:43 -07:00
Ishaan Jaff
9c4f1ec3e5
fix - failing test_end_user_specific_region test
2024-05-11 17:05:37 -07:00
Ishaan Jaff
a4695c3010
test - using langfuse as a failure callback
2024-05-10 17:37:32 -07:00
Krrish Dholakia
3d18897d69
feat(router.py): enable filtering model group by 'allowed_model_region'
2024-05-08 22:10:17 -07:00
Ishaan Jaff
6a06aba443
(ci/cd) use db connection limit
2024-05-06 11:15:22 -07:00
Ishaan Jaff
e8d3dd475a
fix fake endpoint used on ci/cd
2024-05-06 10:37:39 -07:00
Ishaan Jaff
56a75ee7fe
(ci/cd) fix tests
2024-05-01 13:42:54 -07:00
Krrish Dholakia
d4bca6707b
ci(proxy_server_config.yaml): use redis for usage-based-routing-v2
2024-04-22 13:34:36 -07:00
Krrish Dholakia
1507b23e30
test(test_openai_endpoints.py): make test stricter
2024-04-20 12:11:54 -07:00
Krrish Dholakia
01a1a8f731
fix(caching.py): dual cache async_batch_get_cache fix + testing
...
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Ishaan Jaff
adae555fb1
Merge branch 'main' into litellm_fix_using_wildcard_openai_models_proxy
2024-04-15 14:35:06 -07:00
Ishaan Jaff
6df5337e65
test - wildcard openai models on proxy
2024-04-15 14:05:26 -07:00
Ishaan Jaff
ecc6aa060f
test - team based logging on proxy
2024-04-15 13:26:55 -07:00
Krrish Dholakia
ea1574c160
test(test_openai_endpoints.py): add concurrency testing for user defined rate limits on proxy
2024-04-12 18:56:13 -07:00
Krrish Dholakia
74aa230eac
fix(main.py): automatically infer mode for text completion models
2024-04-12 14:16:21 -07:00
Krrish Dholakia
3665b890f8
build(proxy_server_config.yaml): cleanup config
2024-04-11 20:20:09 -07:00
Krrish Dholakia
bdfb74f8a5
test(test_openai_endpoints.py): add local test, for proxy concurrency
2024-04-11 17:16:23 -07:00
Krrish Dholakia
695eb129ad
fix(router.py): fix client init for streaming timeouts
2024-04-05 12:30:15 -07:00
Krrish Dholakia
9fe1c26e0d
build(proxy_server_config.yaml): add store_model_in_db flag
2024-04-03 23:20:41 -07:00
Krrish Dholakia
c0204310ee
fix(main.py): fix translation to text_completions format for async text completion calls
2024-03-30 09:02:51 -07:00
Ishaan Jaff
4d22fe0947
bump: version 1.34.4 → 1.34.5
2024-03-26 18:04:05 -07:00
Ishaan Jaff
da503eab18
Merge branch 'main' into litellm_remove_litellm_telemetry
2024-03-26 11:35:02 -07:00
Ishaan Jaff
b6af44c4dc
(fix) telemetry = false
2024-03-26 11:23:23 -07:00
Krrish Dholakia
49e8cdbff9
fix(router.py): check for context window error when handling 400 status code errors
...
was causing proxy context window fallbacks to not work as expected
2024-03-26 08:08:15 -07:00
Krrish Dholakia
f588bff69b
fix(proxy_server.py): fix spend log update
2024-03-18 20:26:28 -07:00
Krish Dholakia
2c2f322d5a
Update proxy_server_config.yaml
2024-03-16 11:16:13 -07:00
Krish Dholakia
f3cf1ec71f
Update proxy_server_config.yaml
2024-03-15 22:40:39 -07:00
Krish Dholakia
24e673dd6d
Update proxy_server_config.yaml
2024-03-15 22:19:37 -07:00
Krish Dholakia
961c3548f5
Update proxy_server_config.yaml
2024-03-15 19:06:06 -07:00
ishaan-jaff
d28145f916
(fix) fake openai endpoint
2024-03-13 15:07:50 -07:00
ishaan-jaff
acd8f7b48c
(fix) example fake openai endpoint
2024-03-13 15:06:44 -07:00
ishaan-jaff
788583a737
(ci/cd) use fake-openai-endpoint in docker builds
2024-03-13 14:57:41 -07:00
Krrish Dholakia
f683acda61
build: fix default config
2024-03-11 13:56:10 -07:00
Krrish Dholakia
1369e18e85
build: fix default config.yaml
2024-03-11 13:43:50 -07:00
ishaan-jaff
ab48edca78
(fix) default config users use on startup
2024-03-11 11:39:06 -07:00
ishaan-jaff
53c67d302a
(fix) fix default dockerfile startup
2024-03-11 11:26:21 -07:00
Krrish Dholakia
b9854a99d2
test: increase time before checking budget reset - avoid deadlocking
2024-03-06 22:16:59 -08:00
Krrish Dholakia
ad55f4dbb5
feat(proxy_server.py): retry if virtual key is rate limited
...
currently for chat completions
2024-03-05 19:00:03 -08:00
Krrish Dholakia
cd419eb8a5
test(test_keys.py): add more duration for testing budget updates
2024-03-05 15:35:13 -08:00
Krrish Dholakia
37ad5efc61
fix(proxy/utils.py): fix resetting budget logic
2024-03-02 20:52:54 -08:00
Krrish Dholakia
5265d9042d
test: testing fixes
2024-02-27 20:35:42 -08:00
ishaan-jaff
d07846646c
(ui) fix build command
2024-02-21 21:02:46 -08:00
Krrish Dholakia
ef0171e063
feat(utils.py): support cost tracking for openai/azure image gen models
2024-02-03 17:09:54 -08:00
Krrish Dholakia
09ec6d6458
fix(utils.py): fix sagemaker async logging for sync streaming
...
https://github.com/BerriAI/litellm/issues/1592
2024-01-25 12:49:45 -08:00
Krrish Dholakia
30d615f442
build(proxy_server_config.yaml): add proxy budget to default yaml
2024-01-24 20:12:03 -08:00
Krrish Dholakia
09b7235b31
fix: support info level logging on pkg + proxy
2024-01-20 17:45:47 -08:00
Krrish Dholakia
f5ced089d6
test(tests/): add unit testing for proxy server endpoints
2024-01-19 14:54:29 -08:00