Model Recipes
MiMo (Xiaomi)

XiaomiMiMo/MiMo-V2.5-Pro

Xiaomi's flagship MoE reasoning model (1.02T total / 42B active) with hybrid attention, native FP8 weights, and Multi-Token Prediction

moe1T / 42B1,048,576 ctxvLLM 0.21.0+text
Guide