litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-25 05:07:03 +00:00

Author	SHA1	Message	Date
Krish Dholakia	9e35ca2010	Embedding caching fixes - handle str -> list cache, set usage tokens for cache hits, combine usage tokens on partial cache hits (#10424 ) * build(model_prices_and_context_window.json): add fireworks ai new 0-4b pricing tier * build(model_prices_and_context_window.json): add more fireworks ai models * test: update testing * fix(caching_handler.py): handle str + list cache Fixes issue on cache hits for embedding when initial cached input was str * test(test_caching.py): add e2e test on caching with individual item and then list * fix(caching_handler.py): set usage tokens for cache hits enables token counting to work * fix(caching_handler.py): combine usage between cached result and embedding response Handles case of new input to embedding response * fix: cleanup * test: move to gpt-4o-new-test * test: update test	2025-04-29 21:21:28 -07:00
Krish Dholakia	1ea046cc61	test: update tests to new deployment model (#10142 ) * test: update tests to new deployment model * test: update model name * test: skip cohere rbac issue test * test: update test - replace gpt-4o model	2025-04-18 14:22:12 -07:00
Ishaan Jaff	f05bdd4074	(fix) `PrometheusServicesLogger` `_get_metric` should return metric in Registry (#6486 ) * fix logging DB fails on prometheus * unit testing log to otel wrapper * unit testing for service logger + prometheus * use LATENCY buckets for service logging * fix service logging * fix _get_metric in prom services logger * add clear doc string * unit testing for prom service logger	2024-10-29 21:29:19 +05:30
Ishaan Jaff	69b1bc1f1e	(fix) Prometheus - Log Postgres DB latency, status on prometheus (#6484 ) * fix logging DB fails on prometheus * unit testing log to otel wrapper * unit testing for service logger + prometheus * use LATENCY buckets for service logging * fix service logging	2024-10-29 12:17:35 +05:30
Ishaan Jaff	4d1b4beb3d	(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 ) * use folder for caching * fix importing caching * fix clickhouse pyright * fix linting * fix correctly pass kwargs and args * fix test case for embedding * fix linting * fix embedding caching logic * fix refactor handle utils.py * fix test_embedding_caching_azure_individual_items_reordered	2024-10-14 16:34:01 +05:30
Krrish Dholakia	3560f0ef2c	refactor: move all testing to top-level of repo Closes https://github.com/BerriAI/litellm/issues/486	2024-09-28 21:08:14 -07:00

6 Commits