mirror of
https://github.com/tiennm99/litellm.git
synced 2026-06-17 22:48:35 +00:00
8f242c42a1
* fix(batch_completion): submit all model futures before waiting * test: add batch_completion all responses concurrency regression * fix(batch_completion): continue collecting responses on per-model failures * fix(batch_completion): handle empty and string models in all responses * test(batch_completion): avoid blocking wait in concurrency regression
Implementation of litellm.batch_completion, litellm.batch_completion_models, litellm.batch_completion_models_all_responses
Doc: https://docs.litellm.ai/docs/completion/batching
LiteLLM Python SDK allows you to:
litellm.batch_completionBatch litellm.completion function for a given model.litellm.batch_completion_modelsSend a request to multiple language models concurrently and return the response as soon as one of the models responds.litellm.batch_completion_models_all_responsesSend a request to multiple language models concurrently and return a list of responses from all models that respond.