tiennm99/litellm

mirror of https://github.com/tiennm99/litellm.git synced 2026-06-22 01:42:49 +00:00

Files

T

ishaan-jaff e8f48c8a72 update docs

2023-08-05 15:31:12 -07:00

936 B

Raw Blame History

Streaming Responses & Async Completion

Streaming Responses
Async Completion

LiteLLM supports streaming the model response back by passing stream=True as an argument to the completion function

Streaming Responses

Usage

response = completion(model="gpt-3.5-turbo", messages=messages, stream=True)
for chunk in response:
    print(chunk['choices'][0]['delta'])

Asynchronous Completion with LiteLLM LiteLLM provides an asynchronous version of the completion function called acompletion

Async Completion

Usage

from litellm import acompletion
import asyncio

async def test_get_response():
    user_message = "Hello, how are you?"
    messages = [{"content": user_message, "role": "user"}]
    response = await acompletion(model="gpt-3.5-turbo", messages=messages)
    return response

response = asyncio.run(test_get_response())
print(response)