litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-18 21:35:06 +00:00

Files

T

Krish Dholakia 9e35ca2010 Embedding caching fixes - handle str -> list cache, set usage tokens for cache hits, combine usage tokens on partial cache hits (#10424 )

* build(model_prices_and_context_window.json): add fireworks ai new 0-4b pricing tier

* build(model_prices_and_context_window.json): add more fireworks ai models

* test: update testing

* fix(caching_handler.py): handle str + list cache

Fixes issue on cache hits for embedding when initial cached input was str

* test(test_caching.py): add e2e test on caching with individual item and then list

* fix(caching_handler.py): set usage tokens for cache hits

enables token counting to work

* fix(caching_handler.py): combine usage between cached result and embedding response

Handles case of new input to embedding response

* fix: cleanup

* test: move to gpt-4o-new-test

* test: update test

2025-04-29 21:21:28 -07:00

.litellm_cache

…

example_config_yaml

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_configs

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_model_response_typing

LiteLLM Minor Fixes & Improvements (11/05/2024) (#6590 )

2024-11-07 04:17:05 +05:30

adroit-crow-413218-bc47f303efc9.json

vertex testing use pathrise-convert-1606954137718

2025-01-05 14:00:17 -08:00

azure_fine_tune.jsonl

…

batch_job_results_furniture.jsonl

…

cache_unit_tests.py

(code refactor) - Add BaseRerankConfig. Use BaseRerankConfig for cohere/rerank and azure_ai/rerank (#7319 )

2024-12-19 17:03:34 -08:00

conftest.py

ci(conftest.py): reset conftest.py for local_testing/ (#6657 )

2024-11-08 19:14:16 +05:30

create_mock_standard_logging_payload.py

[Bug Fix]: Errors in LiteLLM When Using Embeddings Model with Usage-Based Routing (#7390 )

2024-12-23 17:42:24 -08:00

data_map.txt

…

eagle.wav

…

example.jsonl

VertexAI non-jsonl file storage support (#9781 )

2025-04-09 14:01:48 -07:00

gettysburg.wav

…

large_text.py

…

messages_with_counts.py

…

model_cost.json

…

openai_batch_completions_router.jsonl

…

openai_batch_completions.jsonl

…

speech_vertex.mp3

…

stream_chunk_testdata.py

…

test_acompletion_fallbacks.py

(core sdk fix) - fix fallbacks stuck in infinite loop (#7751 )

2025-01-13 19:34:34 -08:00

test_acompletion.py

Complete o3 model support (#8183 )

2025-02-02 22:36:37 -08:00

test_acooldowns_router.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_add_function_to_prompt.py

LiteLLM Minor Fixes & Improvements (11/05/2024) (#6590 )

2024-11-07 04:17:05 +05:30

test_add_update_models.py

Allow team admins to add/update/delete models on UI + show api base and model id on request logs (#9572 )

2025-03-27 12:06:31 -07:00

test_aim_guardrails.py

Support post-call guards for stream and non-stream responses

2025-03-13 08:53:54 +02:00

test_alangfuse.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_amazing_vertex_completion.py

Handle more gemini tool calling edge cases + support bedrock 'stable-image-core' (#10351 )

2025-04-28 14:22:29 -07:00

test_anthropic_prompt_caching.py

LiteLLM Minor Fixes & Improvements (01/16/2025) - p2 (#7828 )

2025-02-02 23:17:50 -08:00

test_arize_ai.py

Merge branch 'main' into litellm_arize_dynamic_logging

2025-03-18 22:13:35 -07:00

test_arize_phoenix.py

Litellm contributor prs 02 24 2025 (#8775 )

2025-02-24 18:55:48 -08:00

test_assistants.py

test assistants fixes

2025-04-19 08:09:45 -07:00

test_async_fn.py

refactor(sagemaker/): separate chat + completion routes + make them b… (#7151 )

2024-12-10 19:40:05 -08:00

test_audio_speech.py

Litellm dev 2024 12 20 p1 (#7335 )

2024-12-20 21:22:31 -08:00

test_auth_utils.py

Ollama ssl verify = False + Spend Logs reliability fixes (#7931 )

2025-01-23 23:05:41 -08:00

test_azure_content_safety.py

(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )

2024-10-14 16:34:01 +05:30

test_azure_openai.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_azure_perf.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_bad_params.py

test_completion_invalid_param_cohere

2025-04-02 06:49:11 -07:00

test_basic_python_version.py

Litellm dev 01 10 2025 p2 (#7679 )

2025-01-10 21:50:53 -08:00

test_batch_completion_return_exceptions.py

…

test_batch_completions.py

Litellm dev contributor prs 01 31 2025 (#8168 )

2025-02-01 09:05:20 -08:00

test_blocked_user_list.py

(docs) add docstrings for all /key, /user, /team, /customer endpoints (#6804 )

2024-11-18 19:44:06 -08:00

test_braintrust.py

Litellm dev 01 07 2025 p3 (#7635 )

2025-01-08 11:46:24 -08:00

test_budget_manager.py

Litellm ruff linting enforcement (#5992 )

2024-10-01 19:44:20 -04:00

test_caching_handler.py

test(test_caching_handler.py): move to in-memory cache - prevent redis flakiness from impacting ci/cd

2025-03-28 13:32:04 -07:00

test_caching_ssl.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_caching.py

Embedding caching fixes - handle str -> list cache, set usage tokens for cache hits, combine usage tokens on partial cache hits (#10424 )

2025-04-29 21:21:28 -07:00

test_class.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_completion_cost.py

Update fireworks ai pricing (#10425 )

2025-04-29 20:58:05 -07:00

test_completion_with_retries.py

fix(main.py): fix retries being multiplied when using openai sdk (#7221 )

2024-12-14 11:56:55 -08:00

test_completion.py

Embedding caching fixes - handle str -> list cache, set usage tokens for cache hits, combine usage tokens on partial cache hits (#10424 )

2025-04-29 21:21:28 -07:00

test_config.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_cost_calc.py

test(test_cost_calc.py): fix test to handle llm api errors

2024-12-24 16:49:02 -08:00

test_custom_api_logger.py

…

test_custom_callback_input.py

Update fireworks ai pricing (#10425 )

2025-04-29 20:58:05 -07:00

test_custom_callback_router.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_custom_llm.py

LiteLLM Minor Fixes & Improvements (01/10/2025) - p1 (#7670 )

2025-01-10 17:49:05 -08:00

test_custom_logger.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_disk_cache_unit_tests.py

LiteLLM Minor Fixes & Improvements (11/12/2024) (#6705 )

2024-11-12 22:50:51 +05:30

test_dual_cache.py

(code refactor) - Add BaseRerankConfig. Use BaseRerankConfig for cohere/rerank and azure_ai/rerank (#7319 )

2024-12-19 17:03:34 -08:00

test_dynamic_rate_limit_handler.py

LiteLLM Minor Fixes & Improvements (10/15/2024) (#6242 )

2024-10-16 07:32:06 -07:00

test_dynamodb_logs.py

…

test_embedding.py

test_embedding_response_ratelimit_headers

2025-04-11 17:54:54 -07:00

test_exceptions.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_file_types.py

…

test_function_call_parsing.py

…

test_function_calling.py

Litellm dev 04 05 2025 p2 (#9774 )

2025-04-07 21:02:52 -07:00

test_function_setup.py

…

test_gcs_bucket.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_get_llm_provider.py

test(test_get_llm_provider.py): cover scenario where xai not in model name

2025-03-18 11:04:59 -07:00

test_get_model_file.py

LiteLLM Minor Fixes & Improvements (10/05/2024) (#6083 )

2024-10-05 18:59:11 -04:00

test_get_model_info.py

test fix - output_cost_per_reasoning_token was added to model cost map

2025-04-19 10:02:25 -07:00

test_get_optional_params_embeddings.py

…

test_get_optional_params_functions_not_supported.py

…

test_google_ai_studio_gemini.py

…

test_guardrails_ai.py

LiteLLM Minor Fixes & Improvements (10/15/2024) (#6242 )

2024-10-16 07:32:06 -07:00

test_guardrails_config.py

feat(guardrails.py): return specific litellm params in /guardrails/list endpoint

2025-02-10 22:13:58 -08:00

test_health_check.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_helicone_integration.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_http_parsing_utils.py

(Bug fix) - reading /parsing request body when on hypercorn (#8734 )

2025-02-25 15:18:04 -08:00

test_img_resize.py

fix: Support WebP image format and avoid token calculation error (#7182 )

2024-12-12 14:32:39 -08:00

test_lakera_ai_prompt_injection.py

Merge pull request #9222 from BerriAI/litellm_snowflake_pr_mar_13

2025-03-13 21:35:39 -07:00

test_langchain_ChatLiteLLM.py

…

test_langsmith.py

Litellm dev 11 30 2024 (#6974 )

2024-12-02 21:03:33 -08:00

test_least_busy_routing.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_litellm_max_budget.py

…

test_literalai.py

Litellm Minor Fixes & Improvements (10/03/2024) (#6049 )

2024-10-03 18:02:28 -04:00

test_llm_guard.py

(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )

2024-10-14 16:34:01 +05:30

test_load_test_router_s3.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_loadtest_router.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_logfire.py

…

test_logging.py

LiteLLM Minor Fixes & Improvements (11/05/2024) (#6590 )

2024-11-07 04:17:05 +05:30

test_longer_context_fallback.py

…

test_lowest_cost_routing.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_lowest_latency_routing.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_lunary.py

…

test_max_tpm_rpm_limiter.py

(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )

2024-10-14 16:34:01 +05:30

test_mem_leak.py

LiteLLM Minor Fixes & Improvements (10/30/2024) (#6519 )

2024-11-02 00:44:32 +05:30

test_mem_usage.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_mock_request.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_model_alias_map.py

test: fix test

2025-04-16 07:57:10 -07:00

test_model_max_token_adjust.py

…

test_multiple_deployments.py

…

test_ollama_local_chat.py

…

test_ollama_local.py

…

test_ollama.py

Litellm dev 03 08 2025 p3 (#9089 )

2025-03-09 18:20:56 -07:00

test_openai_moderations_hook.py

(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )

2024-10-14 16:34:01 +05:30

test_opik.py

[Feat] Observability integration - Opik by Comet (#6062 )

2024-10-10 18:27:50 +05:30

test_parallel_request_limiter.py

Add all /key/generate api params to UI + add metadata fields on team AND org add/update (#8667 )

2025-02-19 21:13:06 -08:00

test_pass_through_endpoints.py

oops

2025-03-11 08:27:36 -04:00

test_presidio_masking.py

Litellm dev 12 06 2024 (#7067 )

2024-12-06 22:44:18 -08:00

test_profiling_router.py

…

test_prometheus_service.py

Embedding caching fixes - handle str -> list cache, set usage tokens for cache hits, combine usage tokens on partial cache hits (#10424 )

2025-04-29 21:21:28 -07:00

test_prometheus.py

done

2025-01-22 20:19:31 +09:00

test_prompt_caching.py

LiteLLM Minor Fixes & Improvements (12/05/2024) (#7037 )

2024-12-05 00:02:31 -08:00

test_prompt_injection_detection.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_promptlayer_integration.py

LiteLLM Minor Fixes & Improvements (11/05/2024) (#6590 )

2024-11-07 04:17:05 +05:30

test_provider_specific_config.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_pydantic_namespaces.py

…

test_pydantic.py

…

test_register_model.py

…

test_router_batch_completion.py

(code quality) run ruff rule to ban unused imports (#7313 )

2024-12-19 12:33:42 -08:00

test_router_budget_limiter.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_caching.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_client_init.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_cooldowns.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_custom_routing.py

…

test_router_debug_logs.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_fallback_handlers.py

(Feat) - return x-litellm-attempted-fallbacks in responses from litellm proxy (#8558 )

2025-02-15 14:54:23 -08:00

test_router_fallbacks.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_get_deployments.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_init.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_max_parallel_requests.py

fix(lowest_tpm_rpm_routing.py): fix parallel rate limit check (#6577 )

2024-11-05 22:03:44 +05:30

test_router_pattern_matching.py

(code quality) run ruff rule to ban unused imports (#7313 )

2024-12-19 12:33:42 -08:00

test_router_policy_violation.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_retries.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_tag_routing.py

build: merge litellm_dev_03_01_2025_p2

2025-03-03 23:05:41 -08:00

test_router_timeout.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_utils.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_router_with_fallbacks.py

…

test_router.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_rules.py

Litellm ruff linting enforcement (#5992 )

2024-10-01 19:44:20 -04:00

test_sagemaker.py

test: mock sagemaker tests

2025-03-21 16:21:18 -07:00

test_scheduler.py

…

test_secret_detect_hook.py

test(base_llm_unit_tests.py): add test to ensure drop params is respe… (#8224 )

2025-02-03 16:04:44 -08:00

test_simple_shuffle.py

…

test_spend_calculate_endpoint.py

…

test_stream_chunk_builder.py

Add property ordering for vertex ai schema (#9828 ) + Fix combining multiple tool calls (#10040 )

2025-04-15 22:29:25 -07:00

test_streaming.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_supabase_integration.py

Litellm ruff linting enforcement (#5992 )

2024-10-01 19:44:20 -04:00

test_team_config.py

…

test_text_completion.py

Handle fireworks ai tool calling response (#10130 )

2025-04-19 09:37:45 -07:00

test_timeout.py

Update fireworks ai pricing (#10425 )

2025-04-29 20:58:05 -07:00

test_together_ai.py

…

test_token_counter.py

fix(utils.py): handle token counter error when invalid message passed in (#8670 )

2025-02-19 22:21:34 -08:00

test_tpm_rpm_routing_v2.py

test: update tests to new deployment model (#10142 )

2025-04-18 14:22:12 -07:00

test_traceloop.py

test: skip redundant test

2025-02-10 22:13:58 -08:00

test_ui_sso_helper_utils.py

LiteLLM Minor Fixes & Improvements (10/17/2024) (#6293 )

2024-10-17 22:09:11 -07:00

test_unit_test_caching.py

(Bug fix) - don't log messages in model_parameters in StandardLoggingPayload (#8932 )

2025-03-01 13:39:45 -08:00

test_update_spend.py

test_batch_update_spend

2025-04-01 07:12:29 -07:00

test_validate_environment.py

…

test_wandb.py

LiteLLM Minor Fixes & Improvements (11/05/2024) (#6590 )

2024-11-07 04:17:05 +05:30

test_whisper.py

Add OpenAI gpt-4o-transcribe support (#9517 )

2025-03-26 23:10:25 -07:00

user_cost.json

…

vertex_ai.jsonl

…

vertex_batch_completions.jsonl

(feat) add Vertex Batches API support in OpenAI format (#7032 )

2024-12-04 19:40:28 -08:00

vertex_key.json

ci/cd update vertex acct

2025-01-05 13:43:32 -08:00

whitelisted_bedrock_models.txt

Add anthropic3-7-sonnet (#8766 )

2025-02-24 12:59:00 -08:00