From:Internet Info Agency 2026-05-13 17:34:00
On May 13, Xiaomi officially launched and open-sourced XiaomiOneVL, a one-step latent-space language-vision reasoning framework. This framework unifies multiple technical approaches—including Vision-Language-Action (VLA), world models, and latent-space reasoning—within a single architecture for the first time, achieving performance improvements in perception, reasoning, and planning tasks for autonomous driving. XiaomiOneVL attains state-of-the-art (SOTA) results on three major benchmarks: ROADWork, Impromptu, and Alpamayo-R1, and demonstrates strong performance on the NAVSIM benchmark. Its reasoning accuracy surpasses explicit Chain-of-Thought (CoT) methods, while its inference speed matches that of latent-space CoT approaches that predict answers directly without intermediate reasoning steps. The framework supports dual interpretability in both language and vision, enabling it to simultaneously explain decision rationales in text and visualize future scenarios through predicted images. Xiaomi has open-sourced the model weights, training and inference code for XiaomiOneVL, along with its technical report and project homepage, making them available to the broader research and industry community.

China's Top 10 Passenger Car Sales in May 2026: Domestic Brands Dominate as Joint Ventures Retreat
BYD Launches New Flagship D-Class Sedan "Han" to Enter Premium Market Above RMB 300,000
AIVA Unveils First BEV Range-Extended Crossover Priced at RMB 100,000–200,000, Launching This Year
iOS 27 Brings In-Car Video Playback and Major Upgrades to CarPlay
EV Sales Hit Record High in 37 Countries as Global Markets Accelerate Shift to Electric Vehicles
Toyota Invests in Tier IV to Build Global Autonomous Driving Ecosystem
EV Catches Fire While Charging at Guangzhou Station; Skywell HiT PHEV Suspected
Xiaomi SU7 Ultra Fire in Nanchang Ruled Out as Battery Self-Ignition