Qwen/Qwen3-Next-80B-A3B-Instruct
Advanced Qwen3-Next MoE model (80B total / 3B active) with hybrid attention, highly sparse experts, and multi-token prediction.
Highly sparse MoE with MTP-accelerated decoding, runs on 4x H200/H20/A100/A800
Advanced Qwen3-Next MoE model (80B total / 3B active) with hybrid attention, highly sparse experts, and multi-token prediction.
Highly sparse MoE with MTP-accelerated decoding, runs on 4x H200/H20/A100/A800