Four-legged robots that scramble up stairs, stride over rubble, and stream inspection data — no preorder, no lab coat required.
We present Kimi-VL, an efficient open-source Mixture-of-Experts (MoE) vision-language model (VLM) that offers advanced multimodal reasoning, long-context understanding, and strong agent ...