Qwen/Qwen3.5-9B
Qwen3.5 dense multimodal model (9B) with gated delta networks hybrid attention, MTP, and 262K context
Single-GPU Qwen3.5 dense with MTP-accelerated decoding
Qwen3.5 dense multimodal model (9B) with gated delta networks hybrid attention, MTP, and 262K context
Single-GPU Qwen3.5 dense with MTP-accelerated decoding