Model Recipes
GLM (Z-AI)

zai-org/GLM-4.6V

GLM-4.6 vision-language MoE model — image-text-to-text with 128K context, native FP8 checkpoint, and expert parallelism

Updated GLM-V series with 128K context length and native FP8

moe107B / 12B131,072 ctxvLLM 0.12.0+multimodal
Guide