zai-org/GLM-GA
GLM-GA dense vision-language model (~10B) — image and video understanding with 128K context and dedicated Glmga video processor (fps=2, up to 640 frames)
Dense VLM based on GLM-4.6V-Flash with dedicated video processor supporting long videos up to 640 frames