Model Recipes
DeepSeek

deepseek-ai/DeepSeek-V3

DeepSeek-V3 is a 671B-parameter Mixture-of-Experts model with native FP8 weights and strong reasoning, coding, and math capabilities.

Frontier open-weights MoE with native FP8 and FP4 variants

moe671B / 37B163,840 ctxvLLM 0.12.0+text
Guide