Commit Graph

1735 Commits

Author SHA1 Message Date
Ishaan Jaff d5f93048cc docs - lowest cost routing 2024-05-07 13:15:30 -07:00
Ishaan Jaff 4c909194c7 docs - lowest - latency routing 2024-05-07 12:43:44 -07:00
Krish Dholakia 93e5fb49d3 Merge pull request #3500 from ghaemisr/main
Added support for JWT auth with PEM cert public keys
2024-05-07 11:07:30 -07:00
Ishaan Jaff ee1b1fe4f8 Merge pull request #3464 from sepiatone/main
update langchain documentation to reflect refactor
2024-05-07 08:42:23 -07:00
Ishaan Jaff 8b15507add Merge pull request #3471 from paneru-rajan/Issue-3466-update-doc
Synced the doc with the Mistral
2024-05-07 08:41:18 -07:00
Sara Ghaemi 66b2b5fab9 made audience optional and updated docs 2024-05-07 11:37:04 -04:00
sepiatone 1518a7b42d update langchain documentation to reflect refactor 2024-05-07 17:01:55 +05:30
Krish Dholakia 30003afbf8 Merge pull request #3459 from alexanderepstein/langfuse_improvements
Update support for langfuse metadata
2024-05-06 21:56:29 -07:00
Ishaan Jaff ec63a30095 docs - deploy litellm on gcp cloud run 2024-05-06 08:10:35 -07:00
Ishaan Jaff e0001a9121 docs - add using vertex embedding models 2024-05-06 07:56:17 -07:00
Rajan Paneru 2347cb2247 bring back the to be depricated mistral-medium 2024-05-07 00:19:19 +09:30
Rajan Paneru d0d063e25a Synced the model with the Mistral
* Ordered them based on previous order
* Excluded to be depricated models
* Use the model name, instead of the slug in the first column as referred by the name of the column
2024-05-06 13:16:24 +09:30
Alex Epstein b82162832a Update support for langfuse metadata
- Added ability to set trace release, version, metadata
- Added ability to update fields during a trace continuation
- Added ability to update input and output during a trace continuation
- Wrote new test for verifying metadata is set correctly
- Small improvement to setting secret boolean, prevent unnecessary literal_eval
- Small improvements to langfuse tests
2024-05-05 14:56:36 -04:00
Ishaan Jaff bbf5d79069 docs - set retry policy 2024-05-04 17:52:01 -07:00
Ishaan Jaff d968dedd77 Merge pull request #1530 from TanaroSch/main
change max_tokens type to int
2024-05-04 12:47:15 -07:00
Rajan Paneru 7d9377f18a Improve the document of Traceloop
If we follow exact step I was getting two errors, which this Pull-Request will fix:
* Fixed the yaml snippet spacing issue: it was throwing error when running copy+paste code
* Added api_key: my-fake-key as to run litellm --config config.yaml --debug it will be required, otherwise it will fail

Need for Improvements:
* The traceloop is ambigous, because it's a company that maintains OpenLLMetry, and has observability solution and the sdk name is aslo traceloop-sdk
* The doc was missing several other observability solutions
* The steps were not quite obvious, added one more steps to make things clear
2024-05-04 22:42:53 +09:30
Krrish Dholakia f7eee60943 docs(exception_mapping.md): update exception mapping docs with 'should_retry' 2024-05-03 16:51:26 -07:00
Ishaan Jaff bf048ecda4 docs - titan embeddings v2 2024-05-03 15:59:02 -07:00
Ishaan Jaff 209baaca02 docs - simplify prod docs 2024-05-03 15:40:05 -07:00
Krish Dholakia 2200900ca2 Merge pull request #3393 from Priva28/main
Add Llama3 tokenizer and allow custom tokenizers.
2024-05-02 16:32:41 -07:00
greenscale-nandesh 6009632705 Merge branch 'main' into main 2024-05-02 08:22:11 -07:00
Christian Privitelli 3449a5e446 update docs, allow use with encode/decode 2024-05-02 14:56:38 +10:00
Krrish Dholakia cdd3e1eef3 build(ui): enable adding openmeter via proxy ui 2024-05-01 21:16:23 -07:00
Krrish Dholakia 61d680143f docs(openmeter.md): add openmeter to docs 2024-05-01 18:31:45 -07:00
Krrish Dholakia b187deb787 docs(deploy.md): cleanup docker quick start docs 2024-05-01 10:00:49 -07:00
Krrish Dholakia 6a2ddc2791 docs(routing.md): add docs on lowest latency routing buffer 2024-04-30 22:41:50 -07:00
Ishaan Jaff 81df36b298 docs - slack alerting 2024-04-29 21:33:03 -07:00
Ishaan Jaff b1e888edad docs example logging to langfuse 2024-04-29 21:26:27 -07:00
Ishaan Jaff 0cad58f5c6 docs logging to langfuse on proxy 2024-04-29 21:26:15 -07:00
Ishaan Jaff 0c99ae9451 docs - fix kub.yaml config on docs 2024-04-29 21:20:29 -07:00
Krrish Dholakia b46db8b891 feat(utils.py): json logs for raw request sent by litellm
make it easier to view verbose logs in datadog
2024-04-29 19:21:19 -07:00
Krrish Dholakia bd79e8b516 docs(langfuse_integration.md): add 'existing_trace_id' to langfuse docs 2024-04-29 16:40:38 -07:00
Krrish Dholakia 77f155d158 docs(load_test.md): cleanup docs 2024-04-29 16:27:58 -07:00
Krrish Dholakia af6a21f27c docs(load_test.md): add multi-instance router load test to docs 2024-04-29 16:25:56 -07:00
Krrish Dholakia 8f830bd948 docs(load_test.md): simplify doc 2024-04-29 16:00:02 -07:00
Krrish Dholakia fcb83781ec docs(load_test.md): formatting 2024-04-29 15:58:41 -07:00
Krrish Dholakia 5fe0f38558 docs(load_test.md): load test multiple instances of the proxy w/ tpm/rpm limits on deployments 2024-04-29 15:58:14 -07:00
Krrish Dholakia cef2d95bb4 docs(routing.md): add max parallel requests to router docs 2024-04-29 15:37:48 -07:00
Ishaan Jaff 8d26030b99 docs - track cost custom callbacks 2024-04-29 13:15:08 -07:00
Krrish Dholakia f74a43aa78 docs(vllm.md): update docs to tell people to check openai-compatible endpoint docs for vllm 2024-04-28 09:48:03 -07:00
Krish Dholakia 1841b74f49 Merge branch 'main' into litellm_common_auth_params 2024-04-28 08:38:06 -07:00
Ishaan Jaff 3b8126cf51 docs - alerting 2024-04-27 15:33:21 -07:00
Ishaan Jaff 6762d07c7f Merge pull request #3330 from BerriAI/litellm_rdct_msgs
[Feat] Redact Logging Messages/Response content on Logging Providers with `litellm.turn_off_message_logging=True`
2024-04-27 11:25:09 -07:00
Ishaan Jaff f55838d185 sentry redact messages 2024-04-27 11:23:46 -07:00
Ishaan Jaff 01478c9148 docs - langfuse redact messages 2024-04-27 11:23:35 -07:00
Krrish Dholakia 48f19cf839 feat(utils.py): unify common auth params across azure/vertex_ai/bedrock/watsonx 2024-04-27 11:06:18 -07:00
Simon S. Viloria 3c248e308e (docs) fixed typo in url for watsonx colab notebook 2024-04-27 16:43:49 +02:00
Krish Dholakia 2d976cfabc Merge pull request #3270 from simonsanvil/feature/watsonx-integration
(feat) add IBM watsonx.ai as an llm provider
2024-04-27 05:48:34 -07:00
Krrish Dholakia 2aa4976b82 docs(mistral.md): update to show tool calling example 2024-04-26 17:16:25 -07:00
Ishaan Jaff 644d8c71b8 docs - setting up litellm-database 2024-04-26 10:23:15 -07:00