We support a broad range of LLM engines for agents and tools in factory.py, including vllm, GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro, and more.
| Model Family | Model_string Example | Supported Models | Official Model List |
|---|---|---|---|
| vLLM | vllm-Qwen/Qwen2.5-7B-Instruct |
Various vLLM-supported models (e.g., Qwen2.5-7B-Instruct, Qwen2.5-VL-3B-Instruct). Supports local checkpoint models for customization and local inference. |
vLLM Models |
| DashScope (Qwen) | dashscope-qwen2.5-7b-instruct |
Qwen models via Alibaba Cloud DashScope API | DashScope Models |
| OpenAI | gpt-4o, o1-mini |
gpt-4-turbo, gpt-4o, gpt-4o-mini, gpt-4.1, gpt-4.1-mini, gpt-4.1-nano, gpt-3.5-turbo, gpt-4, o1, o1-mini, o3, o3-mini, o1-pro, o4-mini |
OpenAI Models |
| Azure OpenAI | azure-gpt-4o |
gpt-4o, gpt-4o-mini, gpt-4.1, gpt-4.1-mini, gpt-4.1-nano, gpt-3.5-turbo, gpt-4, o1, o1-mini, o3, o3-mini, o1-pro, o4-mini |
Azure OpenAI Models |
| Anthropic | claude-3-5-sonnet-20241022 |
claude-3-haiku-20240307, claude-3-sonnet-20240229, claude-3-opus-20240229, claude-3-5-sonnet-20240620, claude-3-5-sonnet-20241022, claude-3-5-haiku-20241022, claude-3-7-sonnet-20250219 |
Anthropic Models |
| TogetherAI | together-meta-llama/Llama-3-70b-chat-hf |
Most models including meta-llama/Llama-4-Scout-17B-16E-Instruct, Qwen/QwQ-32B, Qwen/Qwen2-VL-72B-Instruct, meta-llama/Llama-3-70b-chat-hf, Qwen/Qwen2-72B-Instruct |
TogetherAI Models |
| DeepSeek | deepseek-chat, deepseek-reasoner |
deepseek-chat, deepseek-reasoner |
DeepSeek Models |
| Gemini | gemini-2.0-flash |
gemini-1.5-pro, gemini-1.5-flash-8b, gemini-1.5-flash, gemini-2.0-flash-lite, gemini-2.0-flash, gemini-2.5-pro-preview-03-25 |
Gemini Models |
| Grok | grok-3, grok-2-vision |
grok-2-vision-1212, grok-2-vision, grok-2-vision-latest, grok-3-mini-fast-beta, grok-3-mini-fast, grok-3-mini-fast-latest, grok-3-mini-beta, grok-3-mini, grok-3-mini-latest, grok-3-fast-beta, grok-3-fast, grok-3-fast-latest, grok-3-beta, grok-3, grok-3-latest |
Grok Models |
| LiteLLM | litellm-gpt-4o |
Any model supported by LiteLLM, including models from OpenAI, Anthropic, Google, Gemini, Mistral, Cohere, and more. | LiteLLM Models |
| Ollama | ollama-qwen2.5 |
Any model supported by Ollama, such as DeepSeek-R1, Qwen 3, Llama 3.3, Gemma 3, Qwen 2.5‑VL, and other models. |
Ollama Models |