liteLLM - Proxy-Server mit Unterstützung für mehr als 50 LLMs

xguru · 2023-08-14T10:18:01+09:00

Die /chat/completion-API kann für verschiedene LLMs wie Azure, OpenAI, Replicate, Anthropic und Hugging Face aufgerufen werden Einheitliches Input-/Output-Format im OpenAI-Format Unterstützt Model-Fallbacks (z. B. Aufruf von llama2, wenn GPT-4 fehlschlägt) Unterstützt Logging: Supabase, Posthog, Mixpanel, Sentry, Helicone Verfolgt die Token-Nutzung Implementiert Semantic Caching Unterstützt Streaming und Asynchronität

(github.com/BerriAI)

15 Punkte von xguru 2023-08-14 | Noch keine Kommentare. | Auf WhatsApp teilen

Die /chat/completion-API kann für verschiedene LLMs wie Azure, OpenAI, Replicate, Anthropic und Hugging Face aufgerufen werden
Einheitliches Input-/Output-Format im OpenAI-Format
Unterstützt Model-Fallbacks (z. B. Aufruf von llama2, wenn GPT-4 fehlschlägt)
Unterstützt Logging: Supabase, Posthog, Mixpanel, Sentry, Helicone
Verfolgt die Token-Nutzung
Implementiert Semantic Caching
Unterstützt Streaming und Asynchronität

liteLLM - Proxy-Server mit Unterstützung für mehr als 50 LLMs

Verwandte Beiträge

Noch keine Kommentare.