mirror of
https://github.com/tiennm99/litellm.git
synced 2026-06-18 09:32:08 +00:00
9cc39af131
* refactor(vertex_ai/llama): handle response transformation within config Allows us to handle https://github.com/BerriAI/litellm/issues/10441#issuecomment-2844975599 * fix(vertex_ai/llama): handle tool call in content Fixes https://github.com/BerriAI/litellm/issues/10441 * fix(vertex_ai/llama): return 'tool_calls' as finish reason if tool call returned vertex ai returns stop * feat(vertex_ai/): cost tracking for vertex_ai/meta/llama-4 * ci(test-linting.yml): pin openai version * build: reorder pinning * ci(pyproject.toml): limit openai version temporary patch as new version has linting errors * ci(pyproject.toml): limit openai version temporary patch around linting errors * ci(limit-openai-version): temporary patch * fix: fix linting errors * fix: fix linting error * fix(parallel_request_limiter_v2.py): add team based multi-instance rate limiting * fix: fix linting errors * build(pyproject.toml): modify pin * ci: bump pin