Model Recipes
NVIDIA

nvidia/Cosmos3-Nano

Compact 16B omnimodal world model (Mixture-of-Transformers) for multimodal understanding, world simulation, future prediction, action reasoning, and Physical AI

16B omnimodal world model — single-GPU H200 video/audio generation

dense16B262,144 ctxvLLM 0.21.0+vLLM-Omninightlyomni
Guide