Model Recipes
NVIDIA

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

NVIDIA Nemotron-3-Nano Mamba-hybrid MoE (30B total / ~3B active) with BF16 and FP8 variants

moe30B / 3B262,144 ctxvLLM 0.11.2+text
Guide