Model Recipes
Qwen

Qwen/Qwen3.5-122B-A10B

Mid-size Qwen3.5 multimodal MoE (122B total / 10B active) with gated delta networks, 256 experts, and 262K context

Qwen3.5 mid-tier MoE — fits on 4x H200 BF16 or 2x H200 FP8

moe122B / 10B262,144 ctxvLLM 0.17.0+multimodaltext
Guide