Model Recipes
Qwen

Qwen/Qwen3.5-2B

Qwen3.5 mini dense multimodal model (2B) — edge / low-VRAM serving with 262K context

Edge-scale Qwen3.5 dense — fits on 8 GB GPUs

dense2B262,144 ctxvLLM 0.17.0+multimodaltext
Guide