TL;DR A 7B-parameter open-source VLA trained on 970k real-world robot demonstrations with strong multi-task generalization and language grounding, supporting effective fine-tuning for new settings.

Appeared in surveys