litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-18 00:48:01 +00:00

Author	SHA1	Message	Date
Ishaan Jaffer	01fd4d7cef	fix fireworks test	2025-11-26 18:58:32 -08:00
Ishaan Jaffer	badbadba0d	fix img URL for tests	2025-11-22 09:41:15 -08:00
Ishaan Jaff	bd7d653bae	Revert "Update perplexity cost tracking (#15743 )" (#16345 ) This reverts commit `ad6a0f4d44`.	2025-11-06 19:00:45 -08:00
Sameer Kankute	ad6a0f4d44	Update perplexity cost tracking (#15743 ) * Update perplexity cost tracking * fix lint errors * fix code * fix tests in perplexity * fix test realted to api call * fix exception test	2025-11-03 08:45:34 -08:00
Krish Dholakia	74ae7aed44	build: Squashed commit of the following: (#16176 ) commit bb0b050fb01633d83c1c2932f8e9c11432911847 Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Sat Nov 1 20:00:01 2025 -0700 test: update tests commit b2da4bdac23868e69a9452805b231f8830e49912 Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Oct 22 14:58:01 2025 -0700 fix(langfuse_otel_attributes.py): log tools and other optional params commit 75bee1f2748f32b230467de0b085c55bf1d687a9 Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Oct 22 14:42:05 2025 -0700 feat(langfuse_otel/): working request/response logging on spans Closes https://github.com/BerriAI/litellm/issues/13764 commit a3e4fa5b81e82f71c74fb9e7dc859c6cb40495f5 Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Oct 22 14:20:39 2025 -0700 fix: initial commit fixing langfuse request/response logging with OTEL commit 09fc9deac844004104822810e42975cd9c68f0e3 Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Oct 22 13:33:52 2025 -0700 fix(litellm_logging.py): for responses api - return a unified usage object for logging ensures logging integrations all pull the right usage information	2025-11-02 09:46:40 -08:00
Ishaan Jaffer	2c52791b83	test_model_function_invoke	2025-10-25 14:56:12 -07:00
Ishaan Jaffer	e6b61213ca	test_completion_azure_deployment_id	2025-10-25 12:26:06 -07:00
Ishaan Jaffer	762053a8e9	test_model_function_invoke	2025-10-25 12:26:06 -07:00
Ishaan Jaffer	214c10f6ef	test_completion_cost_databricks_embedding	2025-10-25 11:47:03 -07:00
Ishaan Jaffer	74106589d0	test_completion_azure_ai_gpt_4o_with_flexible_api_base	2025-10-25 11:30:51 -07:00
Ishaan Jaffer	0bedf1c0a7	fix tests	2025-10-25 10:19:24 -07:00
Ishaan Jaff	f55745fc5e	[Fix] Forward anthropic-beta headers to Bedrock, VertexAI (#15700 ) * [Fix] Forward anthropic-beta headers to Bedrock and other cross-provider scenarios (#15623) * add_provider_specific_headers_to_request * fix add_provider_specific_headers_to_request * test_provider_specific_header_multi_provider * test_provider_specific_header_in_request --------- Co-authored-by: Jack Venberg <jack.venberg@rover.com>	2025-10-18 16:26:32 -07:00
Ishaan Jaffer	6d74f33043	test azure instruct	2025-09-27 15:01:25 -07:00
Ishaan Jaffer	5077e36f1b	test_completion_azure	2025-09-27 15:01:25 -07:00
Ishaan Jaffer	0efb0e2990	test_completion_gemini	2025-09-27 14:00:01 -07:00
Ishaan Jaffer	04cc292c3f	test_completion_base64	2025-09-27 13:58:54 -07:00
Ishaan Jaffer	c27beb74b9	test fix	2025-09-27 12:40:34 -07:00
Ishaan Jaffer	02cc9133a5	test_async_chat_azure_stream	2025-09-27 12:37:36 -07:00
Ishaan Jaffer	6964b5a67a	test humanloop	2025-09-23 18:28:27 -07:00
Krrish Dholakia	d05f58721e	test: remove end of life model from tests	2025-09-09 21:01:45 -07:00
Ishaan Jaff	fd39f22e3e	test_completion_openrouter_reasoning_content	2025-08-30 09:27:37 -07:00
Ishaan Jaff	eeed03a78f	test fix: gcp deprecated gemini-1.5-flash	2025-08-06 08:43:45 -07:00
Ishaan Jaff	642cfa26b0	remove deprecated	2025-07-22 20:59:34 -07:00
Ishaan Jaff	bf300f8ca7	Revert "Litellm dev 07 21 2025 p1 (#12848 )" This reverts commit `e4e10aa4ed`.	2025-07-22 18:28:36 -07:00
Krish Dholakia	e4e10aa4ed	Litellm dev 07 21 2025 p1 (#12848 ) * fix(main.py): fix async retryer Fixes https://github.com/BerriAI/litellm/issues/12830 * fix(forward_clientside_headers_by_model_group.py): filter out 'content-type' from forwardable headers clientside content-type != proxy content type, can cause requests to hang * test(tests/): update tests	2025-07-21 22:09:39 -07:00
Krish Dholakia	c0319d0d01	Litellm dev fix gemini web search tracking (#12288 ) * feat(stream_chunk_builder_utils.py): correctly return web_search_requests on stream chunk builder * fix(types/utils.py): handle prompttokendetails * fix(stream_chunk_builder_utils.py): fix ruff check error * test: try-except rate limit error * fix: fix import	2025-07-03 12:27:14 -07:00
Krrish Dholakia	a198d4a39f	test: change mistral model service tier exceeded	2025-07-02 21:11:02 -07:00
Ishaan Jaff	bc835c6044	test_lm_studio_completion	2025-06-06 20:41:00 -07:00
Krrish Dholakia	4e3c8ae94f	test: update test due to cohere ssl issues	2025-05-19 20:07:57 -07:00
Krish Dholakia	d37cc63250	Add new model provider Novita AI (#7582 ) (#9527 ) * Add new model provider Novita AI (#7582) * feat: add new model provider Novita AI * feat: use deepseek r1 model for examples in Novita AI docs * fix: fix tests * fix: fix tests for novita * fix: fix novita transformation * ci: fix ci yaml * fix: fix novita transformation and test (#10056) --------- Co-authored-by: Jason <ggbbddjm@gmail.com>	2025-05-12 21:49:30 -07:00
Ishaan Jaff	88f5f9b7f8	fix ai21 test	2025-05-07 21:45:57 -07:00
Ishaan Jaff	580e221000	fix ai21 test	2025-05-07 21:26:35 -07:00
Ishaan Jaff	de7870cb54	Add `llamafile` as a provider (#10203 ) (#10482 ) * Update docs for OpenAI compatible providers, add Llamafile docs, include Llamafile in the sidebar * Add Llamafile as an LlmProviders enum * Add llamafile as a OpenAI compatible provider (in the list of compatible providers) * Add Llamafile chat config and tests * Wire up Llamafile Co-authored-by: Peter Wilson <peter@mozilla.ai>	2025-05-01 18:36:55 -07:00
Krrish Dholakia	4ab0ee0b65	test: more testing fixes	2025-05-01 15:36:13 -07:00
Krish Dholakia	9e35ca2010	Embedding caching fixes - handle str -> list cache, set usage tokens for cache hits, combine usage tokens on partial cache hits (#10424 ) * build(model_prices_and_context_window.json): add fireworks ai new 0-4b pricing tier * build(model_prices_and_context_window.json): add more fireworks ai models * test: update testing * fix(caching_handler.py): handle str + list cache Fixes issue on cache hits for embedding when initial cached input was str * test(test_caching.py): add e2e test on caching with individual item and then list * fix(caching_handler.py): set usage tokens for cache hits enables token counting to work * fix(caching_handler.py): combine usage between cached result and embedding response Handles case of new input to embedding response * fix: cleanup * test: move to gpt-4o-new-test * test: update test	2025-04-29 21:21:28 -07:00
Krish Dholakia	d783190e04	Update fireworks ai pricing (#10425 ) * build(model_prices_and_context_window.json): add fireworks ai new 0-4b pricing tier * build(model_prices_and_context_window.json): add more fireworks ai models * test: update testing * test: testing updates * test: update test * test: update test	2025-04-29 20:58:05 -07:00
Ishaan Jaff	b9756bf006	test_completion_azure	2025-04-19 07:24:11 -07:00
Krish Dholakia	1ea046cc61	test: update tests to new deployment model (#10142 ) * test: update tests to new deployment model * test: update model name * test: skip cohere rbac issue test * test: update test - replace gpt-4o model	2025-04-18 14:22:12 -07:00
Krrish Dholakia	415abfc222	test: update test	2025-04-18 13:13:58 -07:00
Krrish Dholakia	f7dd688035	test: handle cohere rbac issue (verified happens on calling azure directly)	2025-04-18 08:42:12 -07:00
Ishaan Jaff	ad09d250ef	test fix azure deprecated mistral	2025-04-15 22:32:14 -07:00
Ishaan Jaff	b3f37b860d	test fix azure deprecated mistral ai	2025-04-15 21:42:40 -07:00
Krish Dholakia	ac9f03beae	Allow passing `thinking` param to litellm proxy via client sdk + Code QA Refactor on get_optional_params (get correct values) (#9386 ) * fix(litellm_proxy/chat/transformation.py): support 'thinking' param Fixes https://github.com/BerriAI/litellm/issues/9380 * feat(azure/gpt_transformation.py): add azure audio model support Closes https://github.com/BerriAI/litellm/issues/6305 * fix(utils.py): use provider_config in common functions * fix(utils.py): add missing provider configs to get_chat_provider_config * test: fix test * fix: fix path * feat(utils.py): make bedrock invoke nova config baseconfig compatible * fix: fix linting errors * fix(azure_ai/transformation.py): remove buggy optional param filtering for azure ai Removes incorrect check for support tool choice when calling azure ai - prevented calling models with response_format unless on litell model cost map * fix(amazon_cohere_transformation.py): fix bedrock invoke cohere transformation to inherit from coherechatconfig * test: fix azure ai tool choice mapping * fix: fix model cost map to add 'supports_tool_choice' to cohere models * fix(get_supported_openai_params.py): check if custom llm provider in llm providers * fix(get_supported_openai_params.py): fix llm provider in list check * fix: fix ruff check errors * fix: support defs when calling bedrock nova * fix(factory.py): fix test	2025-04-07 21:04:11 -07:00
Krish Dholakia	fcf17d114f	Litellm dev 04 05 2025 p2 (#9774 ) * test: move test to just checking async * fix(transformation.py): handle function call with no schema * fix(utils.py): handle pydantic base model in message tool calls Fix https://github.com/BerriAI/litellm/issues/9321 * fix(vertex_and_google_ai_studio.py): handle tools=[] Fixes https://github.com/BerriAI/litellm/issues/9080 * test: remove max token restriction * test: fix basic test * fix(get_supported_openai_params.py): fix check * fix(converse_transformation.py): support fake streaming for meta.llama3-3-70b-instruct-v1:0 * fix: fix test * fix: parse out empty dictionary on dbrx streaming + tool calls * fix(handle-'strict'-param-when-calling-fireworks-ai): fireworks ai does not support 'strict' param * fix: fix ruff check ' * fix: handle no strict in function * fix: revert bedrock change - handle in separate PR	2025-04-07 21:02:52 -07:00
Krish Dholakia	34bdf36eab	Add inference providers support for Hugging Face (#8258 ) (#9738 ) (#9773 ) * Add inference providers support for Hugging Face (#8258) * add first version of inference providers for huggingface * temporarily skipping tests * Add documentation * Fix titles * remove max_retries from params and clean up * add suggestions * use llm http handler * update doc * add suggestions * run formatters * add tests * revert * revert * rename file * set maxsize for lru cache * fix embeddings * fix inference url * fix tests following breaking change in main * use ChatCompletionRequest * fix tests and lint * [Hugging Face] Remove outdated chat completion tests and fix embedding tests (#9749) * remove or fix tests * fix link in doc * fix(config_settings.md): document hf api key --------- Co-authored-by: célina <hanouticelina@gmail.com>	2025-04-05 10:50:15 -07:00
Ishaan Jaff	e7a8b5a809	run ci/cd again	2025-03-26 08:12:51 -07:00
Ishaan Jaff	c010cdef59	test_dynamic_azure_params	2025-03-18 17:26:23 -07:00
Krrish Dholakia	e2ae504a81	test: skip flaky tests	2025-03-11 19:43:04 -07:00
Krish Dholakia	f899b828cf	Support openrouter `reasoning_content` on streaming (#9094 ) * feat(convert_dict_to_response.py): support openrouter format of reasoning content * fix(transformation.py): fix openrouter streaming with reasoning content Fixes https://github.com/BerriAI/litellm/issues/8193#issuecomment-270892962 * fix: fix type error	2025-03-09 20:03:59 -07:00
Ishaan Jaff	f9cee4c46b	(Bug Fix) Using LiteLLM Python SDK with model=`litellm_proxy/` for embedding, image_generation, transcription, speech, rerank (#8815 ) * test_litellm_gateway_from_sdk * fix embedding check for openai * test litellm proxy provider * fix image generation openai compatible models * fix litellm.transcription * test_litellm_gateway_from_sdk_rerank * docs litellm python sdk * docs litellm python sdk with proxy * test_litellm_gateway_from_sdk_rerank * ci/cd run again * test_litellm_gateway_from_sdk_image_generation * test_litellm_gateway_from_sdk_embedding * test_litellm_gateway_from_sdk_embedding	2025-02-25 16:22:37 -08:00

1 2 3

117 Commits