From:Internet Info Agency 2026-05-13 17:34:00
On May 13, Xiaomi officially launched and open-sourced XiaomiOneVL, a one-step latent-space language-vision reasoning framework. This framework unifies multiple technical approaches—including Vision-Language-Action (VLA), world models, and latent-space reasoning—within a single architecture for the first time, achieving performance improvements in perception, reasoning, and planning tasks for autonomous driving. XiaomiOneVL attains state-of-the-art (SOTA) results on three major benchmarks: ROADWork, Impromptu, and Alpamayo-R1, and demonstrates strong performance on the NAVSIM benchmark. Its reasoning accuracy surpasses explicit Chain-of-Thought (CoT) methods, while its inference speed matches that of latent-space CoT approaches that predict answers directly without intermediate reasoning steps. The framework supports dual interpretability in both language and vision, enabling it to simultaneously explain decision rationales in text and visualize future scenarios through predicted images. Xiaomi has open-sourced the model weights, training and inference code for XiaomiOneVL, along with its technical report and project homepage, making them available to the broader research and industry community.

Duan Jianjun Appointed President and CEO of Volvo Cars Greater China; Yuan Xiaolin Steps Down
XPeng G9L Specs Revealed: Mid-to-Large All-Electric SUV Starts at ¥248,800
2026 BYD Seagull Launches May 11 as First A00-Class EV with Optional LiDAR
Chinese Driver Ma Qinghua Wins FIA TCR World Tour Italy Round in Geely Xingrui TCR
Mercedes-Benz, BMW, and Audi Slash Dealer Sales Targets to Ease Channel Pressure
BMW Unveils First Alpina Concept Car Since Full Acquisition on May 15
Japanese Automakers' China Sales Keep Sliding; Honda Hits Record Monthly Low