docs vertex ft models

2026-07-04 07:06:26 +00:00 · 2025-03-26 13:55:38 -07:00
parent 12eb77d02d
commit 82016eba0a
1 changed files with 30 additions and 4 deletions
@@ -1385,7 +1385,7 @@ Call fine-tuned Vertex AI Gemini models through LiteLLM. If you want to use Lite
 | Supported Operations | `/chat/completions`, `/completions`, `/embeddings`, `/images` |

 <Tabs>
-<TabItem value="sdk" label="SDK">
+<TabItem value="sdk" label="LiteLLM Python SDK">

 ```python showLineNumbers
 import litellm
@@ -1402,7 +1402,7 @@ response = litellm.completion(
 ```

 </TabItem>
-<TabItem value="proxy" label="PROXY">
+<TabItem value="proxy" label="LiteLLM Proxy">

 1. Add Vertex Credentials to your env 

@@ -1412,7 +1412,7 @@ response = litellm.completion(

 2. Setup config.yaml 

-```yaml
+```yaml showLineNumbers
 - model_name: finetuned-gemini
  litellm_params:
    model: vertex_ai/gemini/<ENDPOINT_ID>
@@ -1422,7 +1422,30 @@ response = litellm.completion(

 3. Test it! 

-```bash
+<Tabs>
+<TabItem value="openai" label="OpenAI Python SDK">
+
+```python showLineNumbers
+from openai import OpenAI
+
+client = OpenAI(
+    api_key="your-litellm-key",
+    base_url="http://0.0.0.0:4000"
+)
+
+response = client.chat.completions.create(
+    model="finetuned-gemini",
+    messages=[
+        {"role": "user", "content": "hi"}
+    ]
+)
+print(response)
+```
+
+</TabItem>
+<TabItem value="curl" label="curl">
+
+```bash showLineNumbers
 curl --location 'https://0.0.0.0:4000/v1/chat/completions' \
 --header 'Content-Type: application/json' \
 --header 'Authorization: <LITELLM_KEY>' \
@@ -1432,6 +1455,9 @@ curl --location 'https://0.0.0.0:4000/v1/chat/completions' \
 </TabItem>
 </Tabs>

+</TabItem>
+</Tabs>
+


 ## Model Garden