Model
Recipes
Browse
Docs
GitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
Ring-2.6-1T
Ling-2.6-1T
Ling-2.6-flash
Ring-1T-FP8
InternLM
JetBrains
Jina AI
LongCat (Meituan)
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
OpenAI
MiniCPM (OpenBMB)
InternVL (OpenGVLab)
PaddlePaddle
Preferred Networks
Poolside
Qwen
Stability AI
StepFun
Hunyuan (Tencent)
Wan (Alibaba)
MiMo (Xiaomi)
GLM (Z-AI)
inclusionAI/
Ring-1T-FP8
Ring-1T (BailingMoeV2) FP8 model (~1T total params) for 8xH200 or 8xMI300X deployment
View on HuggingFace
View on ModelScope
moe
1T / 50B
65,536 ctx
vLLM 0.11.0+
text
Guide