litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-17 18:48:36 +00:00

Author	SHA1	Message	Date
Ishaan Jaffer	b0db9d6bb7	code qa check	2025-12-19 19:00:44 +05:30
Alexsander Hamir	28821427ce	[Fix] CI/CD #1 - mypy \| check_code_and_doc_quality \| guardrails_testing (#18195 )	2025-12-18 06:31:01 -08:00
Alexsander Hamir	892d7e8d70	[Fix] CI/CD - Fix Bedrock tool calling test failures with non-serializable objects and internal parameters (#17930 ) * fix(bedrock): filter non-serializable objects from request params - Enhanced filter_exceptions_from_params() to filter callable objects (functions) and Logging objects - Applied filtering in Bedrock's _prepare_request_params() before deepcopy - Applied filtering to additional_request_params before JSON serialization - Prevents TypeError during deepcopy (APIConnectionError objects) and JSON serialization (functions, Logging objects) - Fixes test_bedrock_tool_calling test failures Root cause: MCP-related functions (handle_chat_completion_with_mcp, completion_callable) and litellm_logging_obj were incorrectly added to optional_params via add_provider_specific_params_to_optional_params(), which then ended up in additional_request_params. These objects should be in litellm_params, not optional_params. * fix(bedrock): filter internal MCP parameters from API requests Filter out LiteLLM internal/MCP-related parameters (skip_mcp_handler, mcp_handler_context, _skip_mcp_handler) from additional_request_params before sending to Bedrock API to prevent 'extraneous key' errors. - Added filter_internal_params() helper function in core_helpers.py - Applied filtering in Bedrock's _prepare_request_params() method - Fixes test_bedrock_completion.py::test_bedrock_tool_calling * fix: mypy type error * fix: add filter_exceptions_from_params to recursive function ignore list - Add filter_exceptions_from_params to IGNORE_FUNCTIONS in recursive_detector.py - Function is safe: has max_depth parameter (default 20) to prevent infinite recursion	2025-12-13 12:38:07 -08:00
Sameer Kankute	b39c21d90c	fix: Add _delete_nested_value_custom to recursive function ignore list The _delete_nested_value_custom function is recursive but has bounded depth (limited by the number of path segments), preventing infinite recursion. This is necessary for nested field removal in additional_drop_params.	2025-12-09 17:56:13 +05:30
Ishaan Jaff	38ddd50628	[Bug fix] Vector Store List Endpoint Returns 404 (#17229 ) * fix vector store management * fix: add vector_store_management_router * TestVectorStoreManagementEndpointsExist * fix pass_through	2025-11-27 12:51:51 -08:00
Ishaan Jaff	5c192a23c3	[Feat] Add new RAG API on LiteLLM AI Gateway (#17109 ) * init RAG api types * add RAG endpoints * init main.py for RAG ingest API * init RecursiveCharacterTextSplitter * add BaseRAGIngestion * fix OpenAIRAGIngestion * fix img handler * init OpenAIRAGIngestion * init BedrockRAGIngestion * init BedrockRAGIngestion * init rag tests * init BedrockVectorStoreOptions * implement BedrockRAGIngestion * add BaseRAGAPI * add endpoint for RAG ingest * add ingest RAG endpoints * add test doc * add parse_rag_ingest_request * update endpoints * docs add docs for new RAG API * fix qa check * fix linting * docs ficx * docs * add max depth checks * docs anthropic	2025-11-25 17:54:29 -08:00
Ishaan Jaffer	badbadba0d	fix img URL for tests	2025-11-22 09:41:15 -08:00
yuneng-jiang	4b25398afe	[Infra] CI/CD Fixes (#16937 ) * Attempt CI/CD Fix * Adding test for coverage * Adding max depth to copilot and vertex * Fixing mypy lint and docker database * Fixing UI build issues * Update playwright test	2025-11-21 13:58:19 -08:00
Ishaan Jaffer	8e73e11fc8	SEARCH_PROVIDERS	2025-11-06 17:07:53 -08:00
Ishaan Jaffer	d32890ba55	fix _redact_base64	2025-10-28 17:38:16 -07:00
Ishaan Jaff	ad62a6d3d1	[Feat] Add DataforSEO Search API (#15817 ) * docs google PSE * add SearchProviders * add search providers * add PSE search * add SearchProviders * get_provider_search_config * add Search * init Search * add get_http_method on BaseSearch * fixes for Google PSE * TestGooglePSESearch * add DATAFORSEO * add DataForSEOSearchConfig * TestDataForSEOSearch * add DataForSEO * fix base transform * fix search * fix dataforSEO * docs fix * fix linting * fix linting	2025-10-22 16:00:40 -07:00
Ishaan Jaffer	8590646b84	fix code QA check	2025-10-07 17:49:57 -07:00
Ishaan Jaff	f8f4207994	[Security Fix] fix: don't log JWT SSO token on .info() log (#15145 ) * fix: get_redirect_response_from_openid * fix info log check * fix: forward_upstream_to_client	2025-10-02 15:07:37 -07:00
Ishaan Jaffer	81765bba17	fix fastuuid	2025-09-26 18:20:32 -07:00
Alex Shoop	b6247d0d77	code cov test script check_fastuuid_usage.py	2025-09-24 10:27:22 +09:00
Ishaan Jaffer	725cf3627d	fix: license check.ini	2025-09-18 19:28:07 -07:00
Alexsander Hamir	59409429d4	fix: reduced __inits__ overhead in 7% (#14689 ) * fix: avoid redundant __init__ calls on hot path Previously, imports on the request hot path caused __init__ to run excessively for every request. This change ensures initialization happens once, reducing cpu overhead. * fix: remove redundant __init__ import The current implementation no longer requires an import at the top of the function. * fix: placed on core utils for future reuse * test: add coverage & remove inline import A general import-checking tool across all endpoints would be a large PR. This commit focuses on a smaller, targeted fix for the discussed case. * added import check to CI	2025-09-18 17:18:05 -07:00
Ishaan Jaff	555efd60b5	[Security] Fix: Ensure .info() logs are not used for request/responses + Add code QA check for possible violations (#14386 ) * test: info log * fixes for debug * add info_log_check * fix: debug level * fix: debug level * fix: use .debug * fix info statements * fix info log check * fix: fix emit_standard_logging_payload * fix RequestBody * Potential fix for code scanning alert no. 3339: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix check * fix code qa check --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-09-09 13:55:56 -07:00
Ishaan Jaff	f1699d4cbf	fix IGNORE_FUNCTIONS	2025-09-08 17:33:36 -07:00
Ishaan Jaff	6400552308	fix lic check	2025-09-08 17:18:42 -07:00
Ishaan Jaff	4c7b3e10ed	[Bug Fix] Gemini Tool Calling - fix gemini empty enum property (#14155 ) * fix: _convert_schema_types * fix recursive detector * test_convert_schema_types_type_array_conversion * fix: DEFAULT_NUM_WORKERS_LITELLM_PROXY * add _fix_enum_empty_strings * test_tool_call_with_empty_enum_property * test_fix_enum_empty_strings * fix _fix_enum_empty_strings	2025-09-01 17:05:01 -07:00
Ishaan Jaff	2331fb45d5	[Bug]: Gemini 2.5 Pro – schema validation fails with OpenAI-style type arrays in tools (#14154 ) * fix: _convert_schema_types * fix recursive detector * test_convert_schema_types_type_array_conversion * fix: DEFAULT_NUM_WORKERS_LITELLM_PROXY	2025-09-01 16:53:20 -07:00
0x-fang	9f17bedb48	bump up boto3 and aioboto3 versions (#13665 )	2025-08-15 17:06:08 -07:00
Ishaan Jaff	f5a1c14929	Revert "update boto3 version to 1.37.38 (#13656 )" This reverts commit `6f2264a13d`.	2025-08-15 11:24:59 -07:00
0x-fang	6f2264a13d	update boto3 version to 1.37.38 (#13656 )	2025-08-15 11:06:54 -07:00
Ishaan Jaff	621b3dca7b	[Bug Fix] Mistral Tool Calling - Grammar error: at 3(11): failed to compile JSON schema (#13389 ) * test_claude_tool_use_with_gemini * add _remove_json_schema_refs * add _clean_tool_schema_for_mistral * fixes mistral tool calls * _remove_json_schema_refs * fix - vertex, remove hardcoded test	2025-08-07 13:50:22 -07:00
Krish Dholakia	c7e4435bdc	Fix - using managed files w/ OTEL + UI - add model group alias on UI (#13171 ) * fix(router.py): safe deep copy kwargs OTEL adds a parent_otel_span which cannot be deepcopied * fix: use safe deep copy in other places as well * test: add script to check and ban copy.deepcopy of kwargs enforce safe_deep_copy usage * build(ui/): new component for adding model group alias on UI * fix(proxy_server.py): support updating model_group_alias via /config/update allows ui component to work * fix(router.py): update model_group_alias in router settings based on db value * fix: fix code qa error	2025-07-31 21:22:04 -07:00
Ishaan Jaff	b8e404dd95	[Feat] Backend Router - Add Auto-Router powered by `semantic-router` (#12955 ) * add router.json * test_router_auto_router * async_pre_routing_hook * fixes for auto router * add async_pre_routing_hook * add LiteLLMRouterEncoder * update test auto_router_embedding_model * add auto_router_embedding_model * add AutoRouter * fix async_pre_routing_hook * update async_pre_routing_hook * fix auto router * fix router.json * working router init * working embedding encoder * working auto router * test_router_auto_router * test auto router * add semantic-router as optional for litellm * add extras * semantic_router==0.1.10 * ruff fix * use aiohttp==3.10.11 * python-dotenv==1.0.1 * test auto router * test_router_auto_router * semantic_router * test_is_auto_router_deployment * fix check * fix docker build step * add semantic_router * Revert "add semantic_router" This reverts commit 537b67288798731a119d811f643b682086377ee9.	2025-07-24 18:32:56 -07:00
Ishaan Jaff	99ad24c683	[Feat] LiteLLM x Cloudzero integration - Allow exporting spend to cloudzero (#12908 ) * add cz init * add DataAnalyzer * add CZRNGenerator -> LiteLLM resource generator * add CloudZeroStreamer * add CBFTransformer * add cloudzero DB connection * add config * add cli.py * cleanup * add CloudZeroLogger * add CloudZeroLogger * update CloudZeroLogger * add cloudzero_router to litellm proxy * add CloudZeroInitRequest * add cloudzero spend endpoints * simple dry run endpoint * refactor dir structure * add well types CBFRecord * TestCBFTransformer * TestCloudZeroStreamer * ruff fix * add polars pip + docker requirements.txt * _group_by_date * update code qa check * docs for CZ params	2025-07-23 13:00:58 -07:00
Ishaan Jaff	57b0b4edf3	[Bug fix] [Bug]: Verbose log is enabled by default (#12596 ) * test find_set_verbose_assignments * fix set verbose * test set verbose * fix unused import	2025-07-14 20:06:16 -07:00
Krish Dholakia	e7f1fa26ab	UI - Azure Content Guardrails (#12341 ) * build(model_prices_and_context_window.json): remove 'supports_tool_choice' for specific mistral models Closes https://github.com/BerriAI/litellm/issues/11750 * feat: initial commit adding cleaner ui for azure text moderation guardrails * feat(guardrail_endpoints.py): add discoverable guardrail configs and improve converting base model to dict with types * fix(guardrail_provider_fields.tsx): render from api endpoint correctly * fix(guardrail_provider_fields.tsx): cleanup * refactor(guardrail_endpoints.py): refactor to handle dictionaries with literal - allows multiselect * feat(ui/): render dictionary with known keys correctly * feat(ui/): render optional params on separate page * style(ui/): style improvements to rendering optional params on the UI * feat(azure/prompt_shield.py): add azure prompt shield back on UI * fix(add_guardrail_form.tsx): fix form to handle updated api * fix(guardrail_optional_params.tsx): ensure values are nested correctly for writing to api * fix: fix linting error * feat(text_moderation.py): handle str to int conversion * fix(guardrail_info.tsx): only render pii settings if guardrail is presidio * fix(guardrail_info.tsx): add guardrail specific fields to update settings allows updating guardrail fields (e.g. severity threshold) post-create * fix(guardrail_endpoints.py): set guardrail_id in guardrail object ensures duplicate objects not created on guardrail update * fix(guardrail_endpoints.py): allow provider specific fields to be updated on patch update * refactor(guardrail_endpoints.py): remove duplicate info endpoint * fix(guardrail_endpoints.py): mask sensitive keys on returning via guardrail `/info` Prevent leaking keys * fix(guardrail_optional_params.tsx): return numerical input when numerical component used fixes issue where output was a str * fix(guardrail_optional_params.tsx): render dict keys correctly * fix(text_moderation.py): fix severity by category check * fix(proxy/utils.py): check if guardrail should run for post call streaming hook Prevents invalid guardrails from running if not requested * test: fix import * fix: fix linting error * test: update test * fix: fix tests * fix: fix code qa errors * fix(guardrail_endpoints.py): set max depth for function * test: update recursive_detector.py * test: update list * build: merge main * fix: fix ruff check errors	2025-07-05 10:19:29 -07:00
Ishaan Jaff	b4f45e838a	add test	2025-07-04 10:06:40 -07:00
Ishaan Jaff	ab13edd155	check_for_litellm_module_deletion	2025-07-04 10:06:40 -07:00
Ishaan Jaff	0c19414b36	[⚡️ Python SDK import] - reduce python sdk import time by .3s (#12140 ) * use 1 file for KeyManagementSystem * move key management settings * fix import locs * test_proxy_types_not_imported * test the import loc * fix import item * fix imports * fix import loc * fix imports	2025-06-28 14:57:10 -07:00
Ishaan Jaff	f1c7024e70	[Feat] Add Bridge from generateContent <> /chat/completions (#12081 ) * add GenerateContentToCompletionHandler * working - non streaming bridge * add GoogleGenAIAdapter * add google gen ai adapter * working streaming bridge * working streaming usage for adapter * tool calling transform for generate content to openai * fixes for accumulating tool calls * fix code qa checks * Best Practices for Production * fix code qa checks * test_streaming_partial_tool_calls_accumulation * linting fixes * add supported_openai_chat_completion_params * fix translate_generate_content_to_completion * test_google_genai_adapter.py	2025-06-27 11:08:55 -07:00
Krrish Dholakia	164118df81	docs(litellm_logging_code_coverage.py): fix check	2025-06-18 21:36:03 -07:00
Krish Dholakia	0d09c8ec96	Litellm dev 06 18 2025 p1 (#11872 ) * fix(spend_tracking_utils.py): add user agent tags from standard logging payload, in spend logs payload * feat(litellm_logging.py): identify user agent tags as `User-Agent: ..` and allow admin to disable storing user agent as tag * fix(azure_ai/): pass content type header in azure ai request Fixes https://github.com/BerriAI/litellm/issues/11227 * test: add unit test * fix(router.py): fix passing dynamic credentials to retrieve batch Fixes batch retrieval when using router * test: add more unit tests	2025-06-18 21:24:36 -07:00
Krish Dholakia	06484f6e5a	Xai, VertexAI, Google AI Studio - live web search support in OpenAI format (#11251 ) * build(model_prices_and_context_window.json): fix 'supports_web_search' flag - openai only supports it on 2 models - gpt-4o-search-preview and gpt-4o-mini-search-preview * feat(xai/chat): add xai web search options param support * test: add max tokens to test xai output very verbose * build(xai/): add web search support for all xai models * build(model_prices_and_cost.json): add gemini-2.0 supports web search * feat(gemini/): map openai 'web_search_options' to google's 'googlesearch' tool * build(model_prices_and_context_window.json): add supports_web_search for vertex_ai/gemini-2 models * fix: fix circular reference error * fix(convert_dict_to_response.py): handle scenario where xai returns finish reason as 'stop' for tool calls * fix: reduce function size * fix: import session handling * Revert "fix: import session handling" This reverts commit `deb257dc10`. * fix: linting pin mypy * [Feat]: Guardrails - Add streaming for bedrock post guard (#11247) * feat: add streaming for bedrock post guard * fix: bedrock guardrails * fix: add clear comments * Update litellm/proxy/guardrails/guardrail_hooks/bedrock_guardrails.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update litellm/proxy/guardrails/guardrail_hooks/bedrock_guardrails.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * fix: clean up bedrock guardrails --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * [Fix] Responses API - Session management (#11254) * fix: import session handling * fix: imports for session handler * tests: tests for session handler * Update enterprise/litellm_enterprise/enterprise_callbacks/session_handler.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * bump: bump litellm enterprise * fixes: test_create_user_default_budget * fix(xai/): filter 'strict' on tool call * test: update test for new error string * fix(utils.py): default to None if not set in model cost map ensures consistent usage of 'supports_[x]' flags * fix(fireworks_ai/): support fireworks ai document inlining on pdf's sent via openai 'file' message type * test: update test * test: name filter_value_from_dict * fix(fireworks_ai/): handle cache control flag in messages * fix(xai/chat): fix check --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-05-31 14:26:16 -07:00
Krish Dholakia	4c82dd9b27	Ollama Chat - parse tool calls on streaming (#11171 ) * fix(user_api_key_auth.py): fix else block Fixes https://github.com/BerriAI/litellm/issues/11170 * refactor(ollama/chat): refactor to base config pattern easier to maintain fixes * fix(ollama/chat): support tool call parsing on streaming Closes https://github.com/BerriAI/litellm/issues/11104 * test: update import location * fix: cleanup unused import * fix: fix ruff check error * test: update import * test: update test on ci * ci: cleanup * fix: fix chekc * fix: fix api key check order * test: fix import * ci: fix script * test: fix imports * fix: fix tests	2025-05-27 16:14:49 -07:00
Ishaan Jaff	cd3e01ea4a	test fixes	2025-05-24 19:27:30 -07:00
Krish Dholakia	a44bc9f613	Remove + Check for unsafe `enterprise/` folder imports (#11107 ) * refactor(litellm/): wrap enterprise package usage in safe try-excepts prevents missing package error for OSS users Fixes https://github.com/BerriAI/litellm/issues/10795 * ci(config.yml): add script to check for unsafe imports to ci/cd Closes https://github.com/BerriAI/litellm/issues/10795 * fix: add missing docs * test: test_openai_image_edit_litellm_sdk * use n 4 for mapped tests (#11109) * Fix/background health check (#10887) * fix: improve health check logic by deep copying model list on each iteration * test: add async test for background health check reflecting model list changes * fix: validate health check interval before executing background health check * fix: specify type for health check results dictionary * fix(user_api_key_auth.py): handle user custom auth set with no custom settings * bump: version 0.1.21 → 0.2.0 * ci(config.yml): run enterprise and litellm tests separately * fix: fix linting error * docs: add missing docs * [Feat] Add content policy violation error mapping for image editd (#11113) * feat: add image edit mapping for content policy violations * test fix * Expose `/list` and `/info` endpoints for Audit Log events (#11102) * feat(audit_logging_endpoints.py): expose list endpoint to show all audit logs make it easier for user to retrieve individual endpoints * feat(enterprise/): add audit logging endpoint * feat(audit_logging_endpoints.py): expose new GET `/audit/{id}` endpoint make it easier to retrieve view individual audit logs * feat(key_management_event_hooks.py): correctly show the key of the user who initiated the change * fix(key_management_event_hooks.py): add key rotations as an audit log event ' * test(test_audit_logging_endpoints.py): add simple unit testing for audit log endpoint * fix: testing fixes * fix: fix ruff check * [Feat] Use aiohttp transport by default - 97% lower median latency (#11097) * fix: add flag for disabling use_aiohttp_transport * feat: add _create_async_transport * feat: fixes for transport * add httpx-aiohttp * feat: fixes for transport * refactor: fixes for transport * build: fix deps * fixes: test fixes * fix: ensure aiohttp does not auto set content type * test: test fixes * feat: add LiteLLMAiohttpTransport * fix: fixes for responses API handling * test: fixes for responses API handling * test: fixes for responses API handling * feat: fixes for transport * fix: base embedding handler * test: test_async_http_handler_force_ipv4 * test: fix failing deepeval test * fix: add YARL for bedrock urls * fix: issues with transport * fix: comment out linting issues * test fix * test: XAI is unstable * test: fixes for using respx * test: XAI fixes * test: XAI fixes * test: infinity testing fixes * docs(config_settings.md): document param * test: test_openai_image_edit_litellm_sdk * test: remove deprecated test * bump respx==0.22.0 * test: test_xai_message_name_filtering * test: fix anthropic test after bumping httpx * use n 4 for mapped tests (#11109) * fix: use 1 session per event loop * test: test_client_session_helper * fix: linting error * fix: resolving GET requests on httpx 0.28.1 * test fixes proxy unit tests * fix: add ssl verify settings * fix: proxy unit tests * fix: refactor * tests: basic unit tests for aiohttp transports * tests: fixes xai --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> * test: cleanup redundant test * fix: fix import * fix: fix linting errors --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: JuHyun Bae <jhyun0408@nate.com>	2025-05-24 08:59:02 -07:00
Krish Dholakia	fdfef04d93	Gemini Multimodal Live API support (#10841 ) * fix: initial commit * refactor(gemini/realtime/transformation.py): initial instrumentation * fix: fix default api base if not set * feat(gemini/): passes initial user request to backend * feat(realtime_streaming.py): support transforming message input before sending it enables gemini realtime streaming to be sent in correct format * feat: initial working commit of setup message being sent + working * fix(gemini/): initial commit supporting realtime response transformation * feat(gemini/realtime): transform session.created event correctly * fix(realtime_streaming.py): more gemini/realtime response mapping - handle new message * test(gemini/realtime/test_): add more unit tests * feat(gemini/realtime): handles consecutive deltas * feat(gemini/realtime): support openai 'response.text.done' event * feat(gemini/realtime): add openai 'response.text.done' and 'response.content_part.done' event support * fix(gemini/realtime): add openai 'response.done' event support unified realtime api support * fix: fix linting errors * fix: fix linting error * fix: fix linting error * fix: handle infinite loop * fix: fix linting error * fix: fix recursive detector * fix: fix file * fix: fix linting error * fix: fix linting error	2025-05-15 22:39:09 -07:00
Krish Dholakia	7210b713dc	Add target model name validation (#10722 ) * fix(auth_checks.py): enforce auth checks on target model names ensures user has access to models they are trying to call * test(test_auth_utils.py): add unit tests for auth check * fix(exception_mapping_utils.py): handle mistral 429 exception * fix: fix linting error * fix(auth_checks.py): add max fallback depth	2025-05-10 14:27:06 -07:00
Ishaan Jaff	3731ee436a	[Refactor] Use pip package for enterprise/ folder (#10709 ) * init enterprise pip * init enterprise pip * init enterprise pip * test: enterprise pip * add litellm-enterprise to pip * litellm ent check * litellm ent check * fix import email router * fix setup_litellm_enterprise_pip * fix local testing with enterprise pip	2025-05-09 17:18:48 -07:00
Krish Dholakia	500e6cddf5	Fix otel - follow genai semantic conventions + support 'instructions' param for tts (#10608 ) * fix(opentelemetry.py): fix passing otel semantic conventions Fixes SpanAttributes.LLM_PROMPTS to SpanAttributes.LLM_PROMPTS.value * fix(opentelemetry.py): ensure spanattributes always pass the actual enum value * fix(main.py): support passing 'instructions' param for gpt-4o-mini-tts * test: update tests	2025-05-06 21:57:01 -07:00
Krish Dholakia	bf9382a182	Handle more gemini tool calling edge cases + support bedrock 'stable-image-core' (#10351 ) * test(test_amazing_vertex_completion.py): try to repro https://github.com/BerriAI/litellm/issues/10319 * fix(common_utils.py): handle edge case on tools Fixes https://github.com/BerriAI/litellm/issues/10319 * test: add unit testing for infinite loops * fix(amazon_stability3_transformation.py): support 'stable-image-core' transformation Fixes https://github.com/BerriAI/litellm/issues/8488 * test: add unit testing for stable image core model * test: update test	2025-04-28 14:22:29 -07:00
Krish Dholakia	fdfa1108a6	Add property ordering for vertex ai schema (#9828 ) + Fix combining multiple tool calls (#10040 ) * fix #9783: Retain schema field ordering for google gemini and vertex (#9828) * test: update test * refactor(groq.py): initial commit migrating groq to base_llm_http_handler * fix(streaming_chunk_builder_utils.py): fix how tool content is combined Fixes https://github.com/BerriAI/litellm/issues/10034 * fix(vertex_ai/common_utils.py): prevent infinite loop in helper function * fix(groq/chat/transformation.py): handle groq streaming errors correctly * fix(groq/chat/transformation.py): handle max_retries --------- Co-authored-by: Adrian Lyjak <adrian@chatmeter.com>	2025-04-15 22:29:25 -07:00
Krish Dholakia	0dbd663877	fix(cost_calculator.py): handle custom pricing at deployment level fo… (#9855 ) * fix(cost_calculator.py): handle custom pricing at deployment level for router * test: add unit tests * fix(router.py): show custom pricing on UI check correct model str * fix: fix linting error * docs(custom_pricing.md): clarify custom pricing for proxy Fixes https://github.com/BerriAI/litellm/issues/8573#issuecomment-2790420740 * test: update code qa test * fix: cleanup traceback * fix: handle litellm param custom pricing * test: update test * fix(cost_calculator.py): add router model id to list of potential model names * fix(cost_calculator.py): fix router model id check * fix: router.py - maintain older model registry approach * fix: fix ruff check * fix(router.py): router get deployment info add custom values to mapped dict * test: update test * fix(utils.py): update only if value is non-null * test: add unit test	2025-04-09 22:13:10 -07:00
Krish Dholakia	87733c8193	Fix anthropic prompt caching cost calc + trim logged message in db (#9838 ) * fix(spend_tracking_utils.py): prevent logging entire mp4 files to db Fixes https://github.com/BerriAI/litellm/issues/9732 * fix(anthropic/chat/transformation.py): Fix double counting cache creation input tokens Fixes https://github.com/BerriAI/litellm/issues/9812 * refactor(anthropic/chat/transformation.py): refactor streaming to use same usage calculation block as non-streaming reduce errors * fix(bedrock/chat/converse_transformation.py): don't increment prompt tokens with cache_creation_input_tokens * build: remove redisvl from requirements.txt (temporary) * fix(spend_tracking_utils.py): handle circular references * test: update code cov test * test: update test	2025-04-09 21:26:43 -07:00
Krish Dholakia	8e3c7b2de0	fix(vertex_ai.py): move to only passing in accepted keys by vertex ai response schema (#8992 ) * fix(vertex_ai.py): common_utils.py move to only passing in accepted keys by vertex ai prevent json schema compatible keys like $id, and $comment from causing vertex ai openapi calls to fail * fix(test_vertex.py): add testing to ensure only accepted schema params passed in * fix(common_utils.py): fix linting error * test: update test * test: accept function	2025-04-07 18:07:01 -07:00

1 2

77 Commits