Model Recipes
Qwen

Qwen/Qwen3-32B

Qwen3 32B dense model with hybrid thinking/non-thinking modes — verified on TPU v6e (Trillium).

Verified on TPU v6e (Trillium) and v7 (Ironwood) with BF16

dense32B40,960 ctxvLLM 0.8.5+text
Guide