Model Recipes
DeepSeek

deepseek-ai/DeepSeek-R1

DeepSeek-R1 is a 671B-parameter MoE reasoning model built on the DeepSeek-V3 architecture, trained with large-scale reinforcement learning for strong chain-of-thought capabilities.

Open-weights RL-trained reasoning model with native FP8 / FP4 variants

moe671B / 37B163,840 ctxvLLM 0.12.0+text
Guide