Robots AtlasRobots Atlas
Gemini Robotics-ER 1.6 Logo
Preview
Apr 14, 2026
APIHosted UICloud

Gemini Robotics-ER 1.6

MultimodalRobotics FM

Vision-Language Model by Google DeepMind with advanced spatial and embodied reasoning, designed for robotics applications.

Technical specification

Context window
0K
Max output
0K
Tools
Yes
Fine-tuning
No
Weights access
Closed
Last updated: May 2, 2026

Modalities

Input
Text
Image
Audio
Video
Output
Text

Capabilities

9

Reasoning

Reasoning

Multi-step reasoning

Reasoning

Planning

Planning

Image understanding

Vision

Multimodal understanding

Multimodality

Function Calling

Planning

Structured output

Structured gen.

Video Understanding

Other

Audio understanding

Audio

Architecture and technologies

Applications