Aktualności29 kwietnia 2026
World Action Models: What They Are and How They Work
Vision-Language-Action (VLA) models have become the dominant approach to building AI-driven robot control systems over the past several years. Their emerging successor — the World Action Model (WAM) — represents a distinct architectural category that replaces direct image-to-action mapping with video generation as an intermediate planning mechanism. DreamZero, developed by a team at NVIDIA and published in February 2026 as a research paper on arXiv, is the first publicly described system of this class to operate in real time on a real robot. Understanding how it works matters because it points toward a plausible direction for the next generation of robotic foundation models.