Skip to content

Latest commit

 

History

History
17 lines (15 loc) · 3.14 KB

File metadata and controls

17 lines (15 loc) · 3.14 KB

Supported LLM Engines

We support a broad range of LLM engines for agents and tools in factory.py, including vllm, GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro, and more.

Model Family Model_string Example Supported Models Official Model List
vLLM vllm-Qwen/Qwen2.5-7B-Instruct Various vLLM-supported models (e.g., Qwen2.5-7B-Instruct, Qwen2.5-VL-3B-Instruct). Supports local checkpoint models for customization and local inference. vLLM Models
DashScope (Qwen) dashscope-qwen2.5-7b-instruct Qwen models via Alibaba Cloud DashScope API DashScope Models
OpenAI gpt-4o, o1-mini gpt-4-turbo, gpt-4o, gpt-4o-mini, gpt-4.1, gpt-4.1-mini, gpt-4.1-nano, gpt-3.5-turbo, gpt-4, o1, o1-mini, o3, o3-mini, o1-pro, o4-mini OpenAI Models
Azure OpenAI azure-gpt-4o gpt-4o, gpt-4o-mini, gpt-4.1, gpt-4.1-mini, gpt-4.1-nano, gpt-3.5-turbo, gpt-4, o1, o1-mini, o3, o3-mini, o1-pro, o4-mini Azure OpenAI Models
Anthropic claude-3-5-sonnet-20241022 claude-3-haiku-20240307, claude-3-sonnet-20240229, claude-3-opus-20240229, claude-3-5-sonnet-20240620, claude-3-5-sonnet-20241022, claude-3-5-haiku-20241022, claude-3-7-sonnet-20250219 Anthropic Models
TogetherAI together-meta-llama/Llama-3-70b-chat-hf Most models including meta-llama/Llama-4-Scout-17B-16E-Instruct, Qwen/QwQ-32B, Qwen/Qwen2-VL-72B-Instruct, meta-llama/Llama-3-70b-chat-hf, Qwen/Qwen2-72B-Instruct TogetherAI Models
DeepSeek deepseek-chat, deepseek-reasoner deepseek-chat, deepseek-reasoner DeepSeek Models
Gemini gemini-2.0-flash gemini-1.5-pro, gemini-1.5-flash-8b, gemini-1.5-flash, gemini-2.0-flash-lite, gemini-2.0-flash, gemini-2.5-pro-preview-03-25 Gemini Models
Grok grok-3, grok-2-vision grok-2-vision-1212, grok-2-vision, grok-2-vision-latest, grok-3-mini-fast-beta, grok-3-mini-fast, grok-3-mini-fast-latest, grok-3-mini-beta, grok-3-mini, grok-3-mini-latest, grok-3-fast-beta, grok-3-fast, grok-3-fast-latest, grok-3-beta, grok-3, grok-3-latest Grok Models
LiteLLM litellm-gpt-4o Any model supported by LiteLLM, including models from OpenAI, Anthropic, Google, Gemini, Mistral, Cohere, and more. LiteLLM Models
Ollama ollama-qwen2.5 Any model supported by Ollama, such as DeepSeek-R1, Qwen 3, Llama 3.3, Gemma 3, Qwen 2.5‑VL, and other models. Ollama Models