litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-18 17:28:19 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	1cdfb3da8f	[Bug Fix] - Fix `litellm_enterprise` ensure imported routes exist (#17337 ) * test_enterprise_routes.py * test_enterprise_routes_all_imports_exist	2025-12-01 19:14:12 -08:00
Ishaan Jaff	38ddd50628	[Bug fix] Vector Store List Endpoint Returns 404 (#17229 ) * fix vector store management * fix: add vector_store_management_router * TestVectorStoreManagementEndpointsExist * fix pass_through	2025-11-27 12:51:51 -08:00
Krish Dholakia	1cb5fcddba	make generic api OSS + support multiple generic API's (#17152 ) * feat(generic_api_callback.py): make generic api OSS + support multiple generic API's Enables https://github.com/BerriAI/litellm/pull/17094#discussion_r2562832967 * feat(callback_utils.py): support custom generic api callbacks * feat(generic_api_callback.py): support specifying which event types to run the generic api for * fix(litellm_logging.py): log system prompt for anthropic messages * feat(generic_api_callback.py): support generic api compatible api's - e.g. rubrik agent cloud * docs(sidebars.js): document new OSS generic api * docs(generic_api.md): document new OSS Generic API * docs(custom_webhook_api.md): document custom webhook api integration tutorial * docs(custom_webhook_api.md): cleanup * docs(custom_webhook_api.md): document what get's logged to custom webhook api * Refactor: Pass callback config to GenericAPILogger Co-authored-by: krrishdholakia <krrishdholakia@gmail.com> * Fix: Handle empty messages list in logging payload Co-authored-by: krrishdholakia <krrishdholakia@gmail.com> * Checkpoint before follow-up message Co-authored-by: krrishdholakia <krrishdholakia@gmail.com> * feat: Cache GenericAPILogger instances to improve performance Co-authored-by: krrishdholakia <krrishdholakia@gmail.com> --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com>	2025-11-26 18:38:38 -08:00
Ishaan Jaffer	d389b9dd66	bump litellm enterprise	2025-11-19 16:56:50 -08:00
Krish Dholakia	70e1e83102	feat(managed_files.py): support /delete for files + feat(managed_batches): support /cancel for batches (#16387 ) * feat(managed_files.py): initial commit fixing managed file delete on litellm * fix(managed_files.py): fix file delete * feat(batches_endpoints/endpoints.py): fix cancelling a batch ensures managed batches works --------- Co-authored-by: Ishaan Jaffer <ishaanjaffer0324@gmail.com>	2025-11-18 17:36:26 -08:00
Sameer Kankute	7cebc151b0	Add managed files support for responses API (#16733 ) * Fix responses api with managed files * fix litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py mypy * fix litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py mypy * fix mypy errors	2025-11-17 18:41:26 -08:00
Ishaan Jaff	83ea037bc9	[Feat] AI Gateway - Allow admins to disable, dynamic callback controls (#16750 ) * add flag to allow_dynamic_callback_disabling * fix EnterpriseCallbackControls * test controls * add docs on dynamic logging	2025-11-17 18:29:07 -08:00
Krish Dholakia	586f126395	Prometheus - make OSS (#16689 ) * refactor: remove enterprise license from prometheus metrics * refactor: complete refactoring imports * refactor: complete prometheus refactoring * bump: version 1.80.0 → 1.80.1 * bump: version 1.80.1 → 1.80.2 * build: update litellm lock file * docs(enterprise.md): cleanup enterprise docs * docs: more docs cleanup * docs: doc cleanup * fix: fix imports * fix: fix linting error	2025-11-15 11:06:34 -08:00
Krish Dholakia	06906534b3	feat(audio_transcriptions/): calculate duration of audio file for cost calculation + feat (image_generations): cost tracking accuracy improved with output_format, quality, size values fixed per openai model * feat(audio_transcriptions/): calculate duration of audio file for cost calculation Fixes https://github.com/BerriAI/litellm/issues/11846 Closes https://github.com/BerriAI/litellm/issues/14605 * fix(cost_calculator.py): correctly use base model, when set Fixes issue where azure base model was being ignored * feat(cost_calculator.py): fix default cost tracking quality param for image generation * feat(image_generations/): return output_format, quality, size aligns response to openai spec and improves cost tracking accuracy * fix(cost_calculator.py): refactor cost calculation for image generation to use image response instead of hidden params * build: update build * fix: fix cost calculation * build: update poetry lock * fix: fix ruff checks * fix: fix aembedding * fix: fix ruff errors * fix: modify to catch errors * fix: test * fix: loosen test to handle openai lib out of sync * fix: fix base models * fix: fix usage object	2025-11-08 16:24:31 -08:00
Krish Dholakia	202eaeb1a2	Revert "(feat) Audio transcription - cost tracking + (feat) image generation …" (#16409 ) This reverts commit `c96da44265`.	2025-11-08 15:38:16 -08:00
Krish Dholakia	c96da44265	(feat) Audio transcription - cost tracking + (feat) image generation - accurate cost tracking based on output_format/quality/size * feat(audio_transcriptions/): calculate duration of audio file for cost calculation Fixes https://github.com/BerriAI/litellm/issues/11846 Closes https://github.com/BerriAI/litellm/issues/14605 * fix(cost_calculator.py): correctly use base model, when set Fixes issue where azure base model was being ignored * feat(cost_calculator.py): fix default cost tracking quality param for image generation * feat(image_generations/): return output_format, quality, size aligns response to openai spec and improves cost tracking accuracy * fix(cost_calculator.py): refactor cost calculation for image generation to use image response instead of hidden params * build: update build * fix: fix cost calculation * build: update poetry lock * fix: fix ruff checks * fix: fix aembedding * fix: fix ruff errors * fix: modify to catch errors * fix: test * fix: loosen test to handle openai lib out of sync	2025-11-08 15:30:46 -08:00
Ishaan Jaffer	0a1fc0eeb2	Revert "fix: fix ruff errors" This reverts commit `eef864360e`.	2025-11-08 14:33:51 -08:00
Krrish Dholakia	eef864360e	fix: fix ruff errors	2025-11-08 14:15:12 -08:00
Ishaan Jaff	2541a54023	[Feat] Email Notifications - Ensure Users get Key Rotated Email (#16292 ) * add virtual_key_rotated * add KEY_ROTATED_EMAIL_TEMPLATE * fix base email * add _send_key_rotated_email * fix _send_key_rotated_email * key rotated email fix * fix: async_key_generated_hook ensure behavior matches regenerate * test_send_key_rotated_email * expose EMAIL_INCLUDE_API_KEY * docs mask api keys * add email regen * docs key regen * docs email * docs email * docs fix * fix code qa check	2025-11-05 15:56:49 -08:00
Sameer Kankute	bb86c94df4	Add Prometheus metric to track callback logging failures in S3 (#16209 ) * Add v1 cut of container api * fix lint errors * Add proxy support to container apis & logging support (#16049) * Add proxy support to container apis * Add logging support * Add cost tracking support for containers and documentation * Add new constant documentation * Add container cost in model map * fix failing azure tests * Update tests based on model map changes * fix model map tests * fix model map tests * Container modeshould be container * Container tests fix * Merge branch 'main' into litellm_sameer_oct_staging_2 * Add Prometheus metric to track callback logging failures in S3 (#16102) * Add proxy support to container apis * Add logging support * prometheus metric measures how often s3_v2 is failing * remove not needed files * remove not needed files * remove not needed files * fix mypy errors * Use logging_callback_manager to get all the callbacks --------- Co-authored-by: Ishaan Jaffer <ishaanjaffer0324@gmail.com>	2025-11-03 18:46:52 -08:00
Javier de la Torre	e6a7cae7e1	fix(apscheduler): prevent memory leaks from jitter and frequent job intervals (#15846 ) * fix(apscheduler): prevent memory leaks from jitter and frequent job intervals Fixes critical memory leak in APScheduler that causes 35GB+ memory allocations during proxy startup and operation. The leak was identified through Memray analysis showing massive allocations in normalize() and _apply_jitter() functions. Key changes: 1. Remove jitter parameters from all scheduled jobs - jitter was causing expensive normalize() calculations leading to memory explosion 2. Configure AsyncIOScheduler with optimized job_defaults: - misfire_grace_time: 3600s (increased from 120s) to prevent backlog calculations that trigger memory leaks - coalesce: true to collapse missed runs - max_instances: 1 to prevent concurrent job execution - replace_existing: true to avoid duplicate jobs on restart 3. Increase minimum job intervals: - PROXY_BATCH_WRITE_AT: 30s (was 10s) - add_deployment/get_credentials jobs: 30s (was 10s) 4. Use fixed intervals with small random offsets instead of jitter for job distribution across workers 5. Explicitly configure jobstores and executors to minimize overhead 6. Disable timezone awareness to reduce computation Memory impact: - Before: 35GB with 483M allocations during startup - After: <1GB with normal allocation patterns Performance notes: - Minimum job intervals increased from 10s to 30s (configurable via env vars) - Jobs can still be distributed across workers using random start offsets - No functional changes to job behavior, only timing and memory optimization Testing: - Added comprehensive test suite for scheduler configuration - Verified no job execution backlog on startup - Tested duplicate job prevention with replace_existing Related issue: Memory leak in production proxy servers with APScheduler \ud83e\udd16 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * docs: update PROXY_BATCH_WRITE_AT default value from 10s to 30s Update documentation to reflect the new default value for PROXY_BATCH_WRITE_AT changed in PR #15846. The default was increased from 10 seconds to 30 seconds to prevent memory leaks in APScheduler. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * refactor: Move APScheduler config to constants.py Address code review feedback from ishaan-jaff: - Move scheduler configuration variables (coalesce, misfire_grace_time, max_instances, replace_existing) to litellm/constants.py - Update all references in proxy_server.py to use the constants - Improves maintainability and makes configuration values centralized Requested-by: @ishaan-jaff Related: #15846 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-28 19:30:17 -07:00
Ishaan Jaff	95dd216150	[UI] Feature - Add Apply Guardrail Testing Playground (#16030 ) * add applyGuardrail endpoints * v0 testing apply guard * fix: use tabs * move apply guardrails endpoint * fix apply_guardrail * fix applyGuardrail * fix apply guardrail for bedrock * test guard endpoints * add tooltip for enter button * refactor * add guardrail test * tests guardrails selector * TestNomaApplyGuardrail	2025-10-28 16:41:17 -07:00
Krish Dholakia	6bb1d77f45	Org level tpm/rpm limits + Team tpm/rpm validation when assigned to org (#15549 ) * fix(support-model-specific-tpm/rpm-limits): Allows setting rate limits by tpm/rpm for models by team * fix(key_management_endpoints.py): enforce guaranteed throughput with key-level model tpm/rpm limits, when team-level tpm/rpm limits are set * test: add unit testing * feat(schema.prisma): add metadata to litellm budget table * feat(proxy/utils.py): add org limits to user api key auth allows org level tpm/rpm limiting to work * feat: add org level tpm/rpm limits + inherit org id in key from team enables org level tpm/rpm limits * feat: validated working org tpm/rpm limits * feat: support updating org level, model specific tpm/rpm limits * fix: working key validation for org level tpm/rpm limits * fix: working validation for orgs when giving tpm/rpm to teams * fix(key_management_endpoints.py): fix tpm/rpm limits on orgs * fix(key_management_endpoints.py): support limits * refactor: remove duplicate var * fix: refactor to avoid ruff errors * fix: fix typign * fix: fix linting error * fix: fix testing * fix(key_management_endpoints.py): document params	2025-10-25 13:40:29 -07:00
Krish Dholakia	1543891763	Responses API - support tags in metadata * fix(ui/): fix routing for custom server root path * fix: fix eslint errors * fix(vector_store_pre_call_hook.py): Fix https://github.com/BerriAI/litellm/issues/15724 * fix(responses/main.py): have 'tags' work across metadata + litellm_metadata * fix: add unit testing	2025-10-25 12:07:54 -07:00
Krish Dholakia	f8d6a6edb9	fix(managed_files.py): don't raise error if managed object is not found + (Feat) Azure AI - Search Vector Stores + (Fix) Batches - “User default_user_id does not have access to the object” when object not in db + (fix) Vector Stores - show config.yaml vector stores on UI (#15873 ) * fix(managed_files.py): don't raise error if managed object is not found * feat(vector_stores): add azure ai search vector store support Enables direct querying a vector store on azure * fix(azure/vector_stores): working azure ai search api vector stores allows azure direct querying on vector stores * test: update env vars * docs(docs/): document new azure ai vector store search * docs(azure_ai_vector_stores.md): add table * docs: clarify support for 'create' vector stores * fix(vector_stores/endpoints.py): Fixes https://github.com/BerriAI/litellm/issues/14606 * fix: fix linting errors	2025-10-25 12:06:24 -07:00
Sameer Kankute	0f9996a4d0	Litellm sameer oct staging (#15806 ) * Addd v2/chat support for cohere * fix streaming * Use v2_transformation for logging passthrough: * Use v2_transformation for logging passthrough: * Add test for checking if document and citation_options is getting passed * Update the cohere model * Add cost tracking for vertex ai passthrough batch jobs * Add full passthrough support * refactor code according to the comments * Add passthrough handler * remove invalid params * Updated documentation * Updated documentation * Updated documentation * Correct the import * Add openai videos generation and retrieval support * add retrieval endpoint * Add docs * Add imports * remove orjson * remove double import * fix openai videos format * remove mock code * remove not required comments * Add tests * Add tests * Add other video endpoints * Fix cost calculation and transformation * Fixed mypy tests * remove not used imports * fix documentation for get batch req (#15742) * Add grounding info to responses API (#15737) * Add grounding info to responses API * fix lint errors * Use typed objects for annotations * Use typed objects for annotations * fix mypy error * Litellm fix json serialize alreting 2 (#15741) * fix json serializable error for alerts * Add test * fix mypt errors * fix mypt errors * Add Qwen3 imported model support for AWS Bedrock (#15783) * Add qwen imported model support * fix mypy errors * fix empty user message error (#15784) * fix typed dict for list * Add azure supported videos endpoint * fix mapped tests * add azure sora models to model map * Add OpenAI video generation and content retrieval support (#15745) * Add openai videos generation and retrieval support * add retrieval endpoint * Add docs * Add imports * remove orjson * remove double import * fix openai videos format * remove mock code * remove not required comments * Add tests * Add tests * Add other video endpoints * Fix cost calculation and transformation * Fixed mypy tests * remove not used imports * fix typed dict for list * fix mypy errors * move directory * make v2 chat default * Fix mypy tests * Fix mypy tests * Fix mypy tests * Fix mypy tests * Revert "Add Azure Video Generation Support with Sora Integration" * refactor videos repo * add test * Add azure openai videos support * Add azure openai videos support * Add router endpoint support for videos * fix mypy error * add azure models * fix mapped test * fix mypy error * Add proxy router test * Add proxy router test * remove deprecated model name from tests * fix import error * fix import error * Add gaurdrail integration in videos endpoint * Add logging support for videos endpoint * Add final documentation supporting videos integration * fix model name and document input * Update literals to avoid mypy errors * Remove unused imports and print statements * revert guardrail support for video generation and video remix * revert guardrail support for video generation and video remix * Fix failing mapped and llm translation tests	2025-10-24 12:17:22 -07:00
Ishaan Jaff	cea318330e	[Feat] Add Guardrails for /v1/messages and /v1/responses API (#15686 ) * add get_guardrails_messages_for_call_type * fix call type for /messages * add anthropic endpoints * fix bedrock guardrails * fix config.yaml * fix types * fix async_pre_call_hook * ruff fix * fix guard * fix test bedrock guardrail * fix linting * fix linting * docs guardrails * fix mypy linting	2025-10-17 18:09:00 -07:00
Ishaan Jaff	3852fc96c1	[Oct Staging Branch] (#15460 ) * Implement fix for thinking_blocks and converse API calls This fixes Claude's models via the Converse API, which should also fix Claude Code. * Add thinking literal * Fix mypy issues * Type fix for redacted thinking * Add voyage model integration in sagemaker * Add config file logic * Use already exiting voyage transformation * refactor code as per comments * fix merge error * refactor code as per comments * refactor code as per comments * UI new build * [Fix] router - regression when adding/removing models (#15451) * fix(router): update model_name_to_deployment_indices on deployment removal When a deployment is deleted, the model_name_to_deployment_indices map was not being updated, causing stale index references. This could lead to incorrect routing behavior when deployments with the same model_name were dynamically removed. Changes: - Update _update_deployment_indices_after_removal to maintain model_name_to_deployment_indices mapping - Remove deleted indices and decrement indices greater than removed index - Clean up empty entries when no deployments remain for a model name - Update test to verify proper index shifting and cleanup behavior * fix(router): remove redundant index building during initialization Remove duplicate index building operations that were causing unnecessary work during router initialization: 1. Removed redundant `_build_model_id_to_deployment_index_map` call in __init__ - `set_model_list` already builds all indices from scratch 2. Removed redundant `_build_model_name_index` call at end of `set_model_list` - the index is already built incrementally via `_create_deployment` -> `_add_model_to_list_and_index_map` Both indices (model_id_to_deployment_index_map and model_name_to_deployment_indices) are properly maintained as lookup indexes through existing helper methods. This change eliminates O(N) duplicate work during initialization without any behavioral changes. The indices continue to be correctly synchronized with model_list on all operations (add/remove/upsert). * fix(prometheus): Fix Prometheus metric collection in a multi-workers environment (#14929) Co-authored-by: sotazhang <sotazhang@tencent.com> * Add tiered pricing and cost calculation for xai * Use generic cost calculator * Resolve conflicts in generated HTML files * Remove penalty params as supported params for gemini preview model (#15503) * fix conversion of thinking block * add application level encryption in SQS (#15512) * docs: fix doc * docs(index.md): bump rc * [Fix] GEMINI - CLI - add google_routes to llm_api_routes (#15500) * fix: add google_routes to llm_api_routes * test: test_virtual_key_llm_api_routes_allows_google_routes * build: bump version * bump: version 1.78.0 → 1.78.1 * add application level encryption in SQS * add application level encryption in SQS --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: deepanshu <deepanshu.lulla@hq.bill.com> * [Feat] Bedrock Knowledgebase - return search_response when using /chat/completions API with LiteLLM (#15509) * docs: fix doc * docs(index.md): bump rc * [Fix] GEMINI - CLI - add google_routes to llm_api_routes (#15500) * fix: add google_routes to llm_api_routes * test: test_virtual_key_llm_api_routes_allows_google_routes * add AnthropicCitation * fix async_post_call_success_deployment_hook * fix add vector_store_custom_logger to global callbacks * test_e2e_bedrock_knowledgebase_retrieval_with_llm_api_call * async_post_call_success_deployment_hook * add async_post_call_streaming_deployment_hook * async def test_e2e_bedrock_knowledgebase_retrieval_with_llm_api_call_streaming(setup_vector_store_registry): * fix _call_post_streaming_deployment_hook * fix async_post_call_streaming_deployment_hook * test update * docs: Accessing Search Results * docs KB * fix chatUI * fix searchResults * fix onSearchResults * fix kb --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> * [Feat] Add dynamic rate limits on LiteLLM Gateway (#15518) * docs: fix doc * docs(index.md): bump rc * [Fix] GEMINI - CLI - add google_routes to llm_api_routes (#15500) * fix: add google_routes to llm_api_routes * test: test_virtual_key_llm_api_routes_allows_google_routes * build: bump version * bump: version 1.78.0 → 1.78.1 * fix: KeyRequestBase * fix rpm_limit_type * fix dynamic rate limits * fix use dynamic limits here * fix _should_enforce_rate_limit * fix _should_enforce_rate_limit * fix counter * test_dynamic_rate_limiting_v3 * use _create_rate_limit_descriptors --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> * Add google rerank endpoint * Add docs * fix mypy error * fix mypy and lint errors * Add haiku 4.5 integration * Add haiku 4.5 integration for other regions as well * Handle citation field correctly * Fix filtering headers for signature calcs * Add haiku 4.5 integration (#15650) --------- Co-authored-by: Leslie Cheng <leslie.cheng5@gmail.com> Co-authored-by: Sameer Kankute <sameer@berri.ai> Co-authored-by: Alexsander Hamir <alexsanderhamirgomesbaptista@gmail.com> Co-authored-by: Lucas <10226902+LoadingZhang@users.noreply.github.com> Co-authored-by: sotazhang <sotazhang@tencent.com> Co-authored-by: Deepanshu Lulla <deepanshu.lulla@gmail.com> Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> Co-authored-by: deepanshu <deepanshu.lulla@hq.bill.com>	2025-10-17 17:52:25 -07:00
Copilot	f5359ba007	Fix apply_guardrail endpoint returning raw string instead of ApplyGuardrailResponse (#15436 ) * Initial plan * Fix apply_guardrail endpoint to return ApplyGuardrailResponse Co-authored-by: ishaan-jaff <29436595+ishaan-jaff@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ishaan-jaff <29436595+ishaan-jaff@users.noreply.github.com>	2025-10-11 17:50:37 -07:00
Krrish Dholakia	b4c4776f56	fix: fix linting errors	2025-10-04 14:47:13 -07:00
Krrish Dholakia	a1a0e99638	fix(prometheus.py): working e2e calls w/ userapikeymetadata	2025-09-30 21:23:25 -07:00
Krrish Dholakia	d6800ee706	feat(prometheus.py): initial working commit of passing team/key metadata as prometheus metrics Closes LIT-1006	2025-09-30 17:02:40 -07:00
Krrish Dholakia	6ca7752381	fix(prometheus.py): don't require metadata labels to be set for all requests add a default value if metadata label not set	2025-09-30 15:46:30 -07:00
Krrish Dholakia	c2e2b699e6	fix: fix linting error	2025-09-27 12:43:06 -07:00
Alexsander Hamir	eaa04cd8ce	fix: use fastuuid helper (#14903 ) * fix: use fastuuid helper across the codebase First batch of changes, simple drop in replacement. * second batch of changes * fixed: script mistake on helper file	2025-09-25 15:47:01 -07:00
Ishaan Jaff	b9ffa98c55	[Feat] Proxy CLI: Create a python method to login using litellm proxy (#14782 ) * fix: cli auth with SSO okta * fix: add LITTELM_CLI_SERVICE_ACCOUNT_NAME * fix: get_litellm_cli_user_api_key_auth * use existing_key CLI * fix: use existing key * test auth commands * test_cli_sso_callback_regenerate_vs_create_flow * feat: add CLI Token Utilities * fix: get_stored_api_key * move file * fix: get_valid_models * fix config.yaml * TestCLITokenUtils * TestGetValidModelsWithCLI * fix: tie user id to keys created through CLI * fix: add teams interface to CLI * add /keys/update to the list client commands * fix /sso/cli/poll to return the user_id * fix: working TeamsManagementClient * fix CLI Login command * fixes for auth * Potential fix for code scanning alert no. 3400: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * ruff fix --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-09-22 21:28:38 -07:00
Ishaan Jaffer	c656982f18	test fix	2025-09-20 16:55:23 -07:00
Krish Dholakia	270d612029	Merge branch 'main' into litellm_dev_09_10_2025_p1	2025-09-19 22:01:57 -07:00
Krrish Dholakia	565eeca92a	build(pyproject.toml): bump versions	2025-09-18 19:24:34 -07:00
Krrish Dholakia	4d87199266	fix(prometheus.py): fix spend metrics	2025-09-18 19:12:07 -07:00
Mubashir Osmani	8b804303ed	fix: ci/cd tests + lint errors (#14646 ) * fix: lint errors + tests * fixed ci tests * fixed tests --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-09-17 17:06:43 -07:00
Krrish Dholakia	fc2d1f2646	fix: fix import errors	2025-09-14 09:32:21 -07:00
Krrish Dholakia	459e66a9cd	fix: fix test	2025-09-13 13:38:35 -07:00
Krrish Dholakia	a9fddbf4ad	fix(prometheus.py): make prometheus work for multiple workers	2025-09-10 16:20:17 -07:00
Ishaan Jaff	555efd60b5	[Security] Fix: Ensure .info() logs are not used for request/responses + Add code QA check for possible violations (#14386 ) * test: info log * fixes for debug * add info_log_check * fix: debug level * fix: debug level * fix: use .debug * fix info statements * fix info log check * fix: fix emit_standard_logging_payload * fix RequestBody * Potential fix for code scanning alert no. 3339: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix check * fix code qa check --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-09-09 13:55:56 -07:00
mubashir1osmani	2626123aca	added litellm_spend_metric	2025-09-01 11:33:34 -04:00
mubashir1osmani	5649c0f0b9	Revert "feat: add persistence and management for guardrails on virtual keys" This reverts commit `6cd5afa8b1`.	2025-09-01 10:45:46 -04:00
mubashir1osmani	a6790ac158	Revert "feat: add UI support for displaying and managing guardrails on virtual keys" This reverts commit `83654c5608`.	2025-09-01 10:45:44 -04:00
mubashir1osmani	31cfe1fe0a	Revert "fix: import verbose_proxy_logger to resolve undefined name errors" This reverts commit `a8f2ebed9c`.	2025-09-01 10:45:38 -04:00
mubashir1osmani	8de65e8448	fix: added prometheus missing metrics	2025-09-01 10:36:51 -04:00
mubashir1osmani	af2f447698	added prometheus metrics	2025-09-01 10:28:48 -04:00
mubashir1osmani (aider)	a8f2ebed9c	fix: import verbose_proxy_logger to resolve undefined name errors	2025-09-01 01:17:46 -04:00
mubashir1osmani (aider)	83654c5608	feat: add UI support for displaying and managing guardrails on virtual keys	2025-09-01 01:17:08 -04:00
mubashir1osmani (aider)	6cd5afa8b1	feat: add persistence and management for guardrails on virtual keys	2025-09-01 00:56:22 -04:00
Krrish Dholakia	dba78046b9	fix(internal_user_endpoints.py): show remaining users when free SSO users is enabled	2025-08-20 22:26:01 -07:00

1 2 3 4 5 ...

274 Commits