Model Recipes
Qwen

Qwen/Qwen3.5-9B

Qwen3.5 dense multimodal model (9B) with gated delta networks hybrid attention, MTP, and 262K context

Single-GPU Qwen3.5 dense with MTP-accelerated decoding

dense9B262,144 ctxvLLM 0.17.0+multimodaltext
Guide