Model Recipes
Qwen

Qwen/Qwen3-Next-80B-A3B-Instruct

Advanced Qwen3-Next MoE model (80B total / 3B active) with hybrid attention, highly sparse experts, and multi-token prediction.

Highly sparse MoE with MTP-accelerated decoding, runs on 4x H200/H20/A100/A800

moe80B / 3B262,144 ctxvLLM 0.10.0+text
Guide