Model Recipes
GLM (Z-AI)

zai-org/GLM-GA

GLM-GA dense vision-language model (~10B) — image and video understanding with 128K context and dedicated Glmga video processor (fps=2, up to 640 frames)

Dense VLM based on GLM-4.6V-Flash with dedicated video processor supporting long videos up to 640 frames

dense10B131,072 ctxvLLM 0.21.0+multimodal
Guide