Model
Recipes
Browse
Docs
GitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
JetBrains
Jina AI
LongCat (Meituan)
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16
Cosmos3-Super-Text2Image
Cosmos3-Super-Image2Video
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
Cosmos3-Nano
Cosmos3-Super
NVIDIA-Nemotron-3-Super-120B-A12B-BF16
NVIDIA-Nemotron-3-Nano-4B-BF16
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
NVIDIA-Nemotron-Nano-12B-v2-VL-BF16
NVIDIA-Nemotron-Nano-9B-v2
OpenAI
MiniCPM (OpenBMB)
InternVL (OpenGVLab)
PaddlePaddle
Preferred Networks
Poolside
Qwen
Stability AI
StepFun
Hunyuan (Tencent)
Wan (Alibaba)
MiMo (Xiaomi)
GLM (Z-AI)
nvidia/
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
NVIDIA Nemotron-3-Nano Mamba-hybrid MoE (30B total / ~3B active) with BF16 and FP8 variants
View on HuggingFace
View on ModelScope
moe
30B / 3B
262,144 ctx
vLLM 0.11.2+
text
Guide