Model Recipes
MiMo (Xiaomi)

XiaomiMiMo/MiMo-V2-Flash

Xiaomi's MoE reasoning model (309B total / 15B active) with hybrid attention and MTP for fast agentic workflows

moe309B / 15B262,144 ctxvLLM 0.11.0+text
Guide