vLLM 模型支持 - 支持的 LLM 列表

🦙 Llama 系列

vllm serve meta-llama/Llama-3.3-70B-Instruct
vllm serve meta-llama/Llama-3.2-3B-Instruct
vllm serve meta-llama/Llama-3.1-8B-Instruct

vllm serve mistralai/Mistral-7B-Instruct-v0.3
vllm serve mistralai/Mixtral-8x7B-Instruct-v0.1
vllm serve mistralai/Mistral-Large-Instruct-2407

vllm serve Qwen/Qwen2.5-72B-Instruct
vllm serve Qwen/Qwen2.5-7B-Instruct
vllm serve Qwen/Qwen2-VL-7B-Instruct

vllm serve google/gemma-2-27b-it
vllm serve google/gemma-2-9b-it

vllm serve microsoft/Phi-3-medium-128k-instruct
vllm serve microsoft/Phi-3-mini-128k-instruct