Model Recipes
GLM (Z-AI)

zai-org/GLM-4.6

GLM-4.6 MoE language model (~357B total parameters, BF16) with MTP speculative decoding, native tool calling and reasoning

Updated GLM-4.X series MoE model with native FP8 and BF16, MTP speculative decoding

moe357B / 32B202,752 ctxvLLM 0.11.0+text
Guide