Model Recipes
Qwen

Qwen/Qwen3.5-0.8B

Qwen3.5 tiny dense multimodal model (0.8B) — ultra-low-VRAM / edge serving with 262K context

Tiny Qwen3.5 dense for edge / draft-model use

dense0.8B262,144 ctxvLLM 0.17.0+multimodaltext
Guide