deepseek-ai/DeepSeek-R1
DeepSeek-R1 is a 671B-parameter MoE reasoning model built on the DeepSeek-V3 architecture, trained with large-scale reinforcement learning for strong chain-of-thought capabilities.
Open-weights RL-trained reasoning model with native FP8 / FP4 variants