litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-07-04 17:08:48 +00:00

Files

T

Arseny Boykov f4318bccd3 [Performance] Use _PROXY_MaxParallelRequestsHandler_v3 by default again (#14450 )

* Use _PROXY_MaxParallelRequestsHandler_v3 by default (#14352)

(cherry picked from commit f3fa45cf8fbd5f5cce2f45a7312776d5005fb08e)
(cherry picked from commit 5b680bb4a3)

* Use random api_key for parallel requests test

* Fix off-by-one error in parallel request rate limit

The rate limiter was incorrectly rejecting requests when the limit was met, but not exceeded. The check in `is_cache_list_over_limit` was `int(counter_value) + 1 > current_limit`, which caused the first request to be rejected if the limit was 1.

This commit removes the `+ 1`, changing the logic to `int(counter_value) > current_limit`. The check now correctly allows requests up to the specified parallel limit.

* Test actual parallel requests

* Ensure rate limiting works correctly for multiple users

* Add sequential rate-limit test

* Revert random key usage

2025-09-12 17:33:55 -07:00

.litellm_cache

…

auto_router

[Feat] Backend Router - Add Auto-Router powered by semantic-router (#12955 )

2025-07-24 18:32:56 -07:00

example_config_yaml

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_configs

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_model_response_typing

…

adroit-crow-413218-bc47f303efc9.json

vertex testing use pathrise-convert-1606954137718

2025-01-05 14:00:17 -08:00

azure_fine_tune.jsonl

…

batch_job_results_furniture.jsonl

…

cache_unit_tests.py

(code refactor) - Add BaseRerankConfig. Use BaseRerankConfig for cohere/rerank and azure_ai/rerank (#7319 )

2024-12-19 17:03:34 -08:00

conftest.py

[Perf] Improvements for Async Success Handler (Logging Callbacks) - Approx +130 RPS (#13905 )

2025-08-23 13:13:23 -07:00

create_mock_standard_logging_payload.py

[Bug Fix]: Errors in LiteLLM When Using Embeddings Model with Usage-Based Routing (#7390 )

2024-12-23 17:42:24 -08:00

data_map.txt

…

eagle.wav

…

example.jsonl

VertexAI non-jsonl file storage support (#9781 )

2025-04-09 14:01:48 -07:00

gettysburg.wav

…

large_text.py

…

model_cost.json

…

openai_batch_completions_router.jsonl

…

openai_batch_completions.jsonl

…

speech_vertex.mp3

…

stream_chunk_testdata.py

…

test_acompletion_fallbacks.py

(core sdk fix) - fix fallbacks stuck in infinite loop (#7751 )

2025-01-13 19:34:34 -08:00

test_acompletion.py

Complete o3 model support (#8183 )

2025-02-02 22:36:37 -08:00

test_acooldowns_router.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_add_function_to_prompt.py

…

test_add_update_models.py

Allow team admins to add/update/delete models on UI + show api base and model id on request logs (#9572 )

2025-03-27 12:06:31 -07:00

test_aim_guardrails.py

migrate to use new aim FW API

2025-08-19 12:20:19 +03:00

test_alangfuse.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_amazing_vertex_completion.py

test fix

2025-09-03 11:06:09 -07:00

test_anthropic_prompt_caching.py

LiteLLM Minor Fixes & Improvements (01/16/2025) - p2 (#7828 )

2025-02-02 23:17:50 -08:00

test_arize_ai.py

Merge branch 'main' into litellm_arize_dynamic_logging

2025-03-18 22:13:35 -07:00

test_arize_phoenix.py

fix arize config tests

2025-05-13 20:21:14 -07:00

test_assistants.py

test_create_delete_assistants

2025-07-15 21:35:25 -07:00

test_async_fn.py

test_text_completion_stream - hf

2025-07-03 16:00:51 -07:00

test_audio_speech.py

feat(speech/): working gemini tts support via openai's /v1/speech endpoint (#11832 )

2025-06-18 10:36:25 -07:00

test_auth_utils.py

User Headers X LiteLLM Users Mapping feature (#14485 )

2025-09-12 11:49:37 -07:00

test_azure_content_safety.py

…

test_azure_openai.py

test: fixes

2025-05-31 12:42:56 -07:00

test_azure_perf.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_bad_params.py

test_completion_invalid_param_cohere

2025-04-02 06:49:11 -07:00

test_basic_python_version.py

[MCP Gateway] Litellm mcp client list fail (#13114 )

2025-07-30 15:23:19 -07:00

test_batch_completion_return_exceptions.py

…

test_batch_completions.py

test fix: gcp deprecated gemini-1.5-flash

2025-08-06 08:43:45 -07:00

test_blocked_user_list.py

…

test_braintrust.py

[Performance] Improve LiteLLM Python SDK RPS by +200 RPS (#13839 )

2025-08-20 21:46:33 -07:00

test_budget_manager.py

…

test_caching_handler.py

test_async_log_cache_hit_on_callbacks

2025-09-08 17:15:53 -07:00

test_caching_ssl.py

test: update tests

2025-05-20 13:08:47 -07:00

test_caching.py

test: remove eol bedrock model from tests

2025-09-09 19:48:35 -07:00

test_class.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_completion_cost.py

test: remove end of life model from tests

2025-09-09 21:01:45 -07:00

test_completion_with_retries.py

fix(main.py): fix retries being multiplied when using openai sdk (#7221 )

2024-12-14 11:56:55 -08:00

test_completion.py

test: remove end of life model from tests

2025-09-09 21:01:45 -07:00

test_config.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_cost_calc.py

test(test_cost_calc.py): fix test to handle llm api errors

2024-12-24 16:49:02 -08:00

test_custom_api_logger.py

…

test_custom_callback_input.py

test: remove eol bedrock model from tests

2025-09-09 19:48:35 -07:00

test_custom_llm.py

test: update test with new kwargs

2025-06-11 22:19:17 -07:00

test_custom_logger.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_disk_cache_unit_tests.py

…

test_dual_cache.py

(code refactor) - Add BaseRerankConfig. Use BaseRerankConfig for cohere/rerank and azure_ai/rerank (#7319 )

2024-12-19 17:03:34 -08:00

test_dynamic_rate_limit_handler.py

…

test_dynamodb_logs.py

…

test_embedding.py

feat(JinaAI): support multimodal embedding models (#13181 )

2025-08-05 19:21:56 -07:00

test_exceptions.py

Revert "Litellm dev 07 21 2025 p1 (#12848 )"

2025-07-22 18:28:36 -07:00

test_file_types.py

…

test_function_call_parsing.py

…

test_function_calling.py

test fix

2025-09-06 17:08:31 -07:00

test_function_setup.py

…

test_gcs_bucket.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_gcs_cache_unit_tests.py

Add GCS bucket caching support (#13122 )

2025-08-04 16:09:33 -07:00

test_get_llm_provider.py

test_default_api_base

2025-07-04 18:26:54 -07:00

test_get_model_file.py

…

test_get_model_info.py

test whitelisted models

2025-06-28 14:46:16 -07:00

test_get_optional_params_embeddings.py

…

test_get_optional_params_functions_not_supported.py

…

test_google_ai_studio_gemini.py

…

test_guardrails_ai.py

…

test_health_check.py

test: add test_audio_speech_health_check_with_another_voice

2025-05-27 10:20:00 +02:00

test_helicone_integration.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_http_parsing_utils.py

2025-07-10 18:20:41 -07:00

test_img_resize.py

fix: Support WebP image format and avoid token calculation error (#7182 )

2024-12-12 14:32:39 -08:00

test_lakera_ai_prompt_injection.py

Merge pull request #9222 from BerriAI/litellm_snowflake_pr_mar_13

2025-03-13 21:35:39 -07:00

test_langchain_ChatLiteLLM.py

…

test_langsmith.py

…

test_least_busy_routing.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_litellm_max_budget.py

…

test_literalai.py

…

test_llm_guard.py

[Refactor] Move LLM Guard, Secret Detection to Enterprise Pip packagea (#10782 )

2025-05-13 09:42:22 -07:00

test_load_test_router_s3.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_loadtest_router.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_logfire.py

…

test_logging.py

…

test_longer_context_fallback.py

…

test_lowest_cost_routing.py

test fix

2025-09-01 17:04:47 -07:00

test_lowest_latency_routing.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_lunary.py

…

test_max_tpm_rpm_limiter.py

…

test_mem_leak.py

…

test_mem_usage.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_mock_request.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_model_alias_map.py

test_model_alias_map

2025-09-01 17:59:40 -07:00

test_model_max_token_adjust.py

…

test_multiple_deployments.py

…

test_ollama_local_chat.py

…

test_ollama_local.py

…

test_ollama.py

Ensure consistent 'created' across all chunks + set tool call id for ollama streaming calls (#11528 )

2025-06-07 20:50:07 -07:00

test_openai_moderations_hook.py

…

test_opik.py

…

test_pass_through_endpoints.py

[Performance] Use _PROXY_MaxParallelRequestsHandler_v3 by default again (#14450 )

2025-09-12 17:33:55 -07:00

test_profiling_router.py

…

test_prometheus_service.py

Embedding caching fixes - handle str -> list cache, set usage tokens for cache hits, combine usage tokens on partial cache hits (#10424 )

2025-04-29 21:21:28 -07:00

test_prompt_caching.py

…

test_prompt_injection_detection.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_promptlayer_integration.py

…

test_provider_specific_config.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_pydantic_namespaces.py

…

test_pydantic.py

…

test_register_model.py

…

test_router_auto_router.py

test_router_auto_router

2025-07-26 13:33:53 -07:00

test_router_batch_completion.py

test fix

2025-09-01 17:04:47 -07:00

test_router_budget_limiter.py

test_provider_budgets_e2e_test_expect_to_fail

2025-07-19 16:00:25 -07:00

test_router_caching.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_client_init.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_cooldown_handlers.py

test

2025-09-06 16:38:43 -07:00

test_router_custom_routing.py

…

test_router_debug_logs.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_fallback_handlers.py

(Feat) - return x-litellm-attempted-fallbacks in responses from litellm proxy (#8558 )

2025-02-15 14:54:23 -08:00

test_router_fallbacks.py

ci/cd new release

2025-07-23 13:50:36 -07:00

test_router_get_deployments.py

set flaky tests as flaky

2025-06-14 13:51:52 -07:00

test_router_init.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_max_parallel_requests.py

…

test_router_pattern_matching.py

(code quality) run ruff rule to ban unused imports (#7313 )

2024-12-19 12:33:42 -08:00

test_router_policy_violation.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_retries.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_timeout.py

test: remove end of life model from tests

2025-09-09 21:01:45 -07:00

test_router_utils.py

Router - reduce p99 latency w/ redis enabled by 50% + OTEL - track pre_call hook latency (#13362 )

2025-08-09 16:09:51 -07:00

test_router_with_fallbacks.py

…

test_router.py

test: remove redundant test

2025-09-09 20:37:09 -07:00

test_rules.py

…

test_sagemaker.py

test: mock sagemaker tests

2025-03-21 16:21:18 -07:00

test_scheduler.py

…

test_secret_detect_hook.py

[Refactor] Move LLM Guard, Secret Detection to Enterprise Pip packagea (#10782 )

2025-05-13 09:42:22 -07:00

test_simple_shuffle.py

…

test_spend_calculate_endpoint.py

test fix

2025-09-01 17:04:47 -07:00

test_stream_chunk_builder.py

test_stream_chunk_builder_litellm_usage_chunks

2025-08-07 15:22:52 -07:00

test_streaming.py

test: remove end of life model from tests

2025-09-09 21:01:45 -07:00

test_supabase_integration.py

…

test_team_config.py

…

test_text_completion.py

[LLM Translation] Fix Realtime API endpoint for no intent (#13476 )

2025-08-14 16:24:14 -07:00

test_timeout.py

test: remove end of life model from tests

2025-09-09 21:01:45 -07:00

test_together_ai.py

…

test_tpm_rpm_routing_v2.py

test: update test

2025-08-13 23:09:18 -07:00

test_traceloop.py

test: skip redundant test

2025-02-10 22:13:58 -08:00

test_ui_sso_helper_utils.py

…

test_unit_test_caching.py

(Bug fix) - don't log messages in model_parameters in StandardLoggingPayload (#8932 )

2025-03-01 13:39:45 -08:00

test_update_spend.py

test_batch_update_spend

2025-04-01 07:12:29 -07:00

test_validate_environment.py

…

test_wandb.py

…

test_whisper.py

[TECH] fix TU 2

2025-07-08 16:36:11 +02:00

user_cost.json

…

vertex_ai.jsonl

…

vertex_batch_completions.jsonl

…

vertex_key.json

ci/cd update vertex acct

2025-01-05 13:43:32 -08:00

whitelisted_bedrock_models.txt

Add supports_pdf_input: true to Claude 3.7 bedrock models (#9917 )

2025-05-01 14:56:54 -07:00