TL;DR A large transformer-based policy trained on 800k trajectories from Open X-Embodiment serving as a versatile initialization that can be finetuned to new observation and action spaces.
TL;DR A large transformer-based policy trained on 800k trajectories from Open X-Embodiment serving as a versatile initialization that can be finetuned to new observation and action spaces.