Model Recipes
Qwen

Qwen/Qwen2.5-32B

Qwen2.5 32B dense base (pretrained) language model for text completion — verified on TPU v6e (Trillium).

Verified on TPU v6e (Trillium) with BF16, TP=4 on a 2x2 slice

dense32B131,072 ctxvLLM 0.6.2+text
Guide