Model Recipes
Qwen

Qwen/Qwen3.5-27B

Qwen3.5 dense multimodal model (27B) with gated delta networks hybrid attention, MTP, and 262K context

Qwen3.5 flagship dense — single-GPU FP8 or 2x GPU BF16

dense27B262,144 ctxvLLM 0.17.0+multimodaltext
Guide