Model Recipes
GLM (Z-AI)

zai-org/GLM-4.5V

GLM-4.5 vision-language MoE model (~107B parameters, BF16) with image-text-to-text capability, 64K context, expert parallelism, and native FP8

Multimodal GLM-4.5V with native FP8 and expert parallelism, deploys on 4xH100

moe107B / 12B65,536 ctxvLLM 0.12.0+multimodal
Guide