Qwen/Qwen2.5-32B
Qwen2.5 32B dense base (pretrained) language model for text completion — verified on TPU v6e (Trillium).
Verified on TPU v6e (Trillium) with BF16, TP=4 on a 2x2 slice
Qwen2.5 32B dense base (pretrained) language model for text completion — verified on TPU v6e (Trillium).
Verified on TPU v6e (Trillium) with BF16, TP=4 on a 2x2 slice