From:Internet Info Agency 2026-05-13 20:44:00
Xiaomi recently unveiled XiaomiOneVL, a one-step latent-space language-vision reasoning framework that unifies, for the first time in the industry, technical approaches including Vision-Language-Action (VLA) models, world models, and latent-space reasoning. Through a dual-supervision mechanism combining "language-based reasoning" and "visual future prediction," XiaomiOneVL integrates interpretability and future-scenario prediction capabilities into the latent-space reasoning process. It surpasses explicit Chain-of-Thought (CoT) methods in reasoning accuracy while matching the inference speed of latent-space CoT approaches that directly output answers. The framework is built upon three core technologies: the model reasons using an "internal language," possesses the ability to predict future visual frames, and compresses the entire reasoning process into a single step. These innovations aim to enhance autonomous driving systems' understanding of both current scenes and future spatiotemporal causal relationships, thereby enabling higher-quality decision-making. Xiaomi has fully open-sourced the model weights, training code, and inference code of XiaomiOneVL, making them available to developers and researchers worldwide to accelerate technological iteration and advancement in large autonomous-driving models.

China's Top 10 Passenger Car Sales in May 2026: Domestic Brands Dominate as Joint Ventures Retreat
BYD Launches New Flagship D-Class Sedan "Han" to Enter Premium Market Above RMB 300,000
Dreame CEO Yu Hao Banned Across Platforms Over Controversial Marketing Remarks
AIVA Unveils First BEV Range-Extended Crossover Priced at RMB 100,000–200,000, Launching This Year
iOS 27 Brings In-Car Video Playback and Major Upgrades to CarPlay
Toyota Invests in Tier IV to Build Global Autonomous Driving Ecosystem
BYD Delays Hungary Plant Launch to Q4 2024, Halts Turkey Factory Plans
Akio Toyoda Admits His Stance on Combustion Engines Leaves Him Feeling "Lonely"