Cross-Embodiment Robot Manipulation — Benchmarks & Datasets

Research Focus

Survey-style literature collection on benchmarks and datasets for cross-embodiment robot manipulation, with emphasis on 2024–2026 works. Context: factory/industrial manipulation, survey-style writing.

DOI List

10.48550/arXiv.1910.10897
10.1109/LRA.2020.2974707
10.48550/arXiv.2302.04659
10.48550/arXiv.2307.00595
10.48550/arXiv.2308.12952
10.48550/arXiv.2306.03310
10.48550/arXiv.2310.08864
10.48550/arXiv.2307.15818
10.48550/arXiv.2508.17449
10.48550/arXiv.2403.12945
10.48550/arXiv.2405.05941
10.48550/arXiv.2405.12213
10.48550/arXiv.2406.09246
10.48550/arXiv.2410.00425
10.48550/arXiv.2410.24164
10.48550/arXiv.2412.13877
10.48550/arXiv.2603.04356
10.48550/arXiv.2602.16710

Paper Nodes

Synthesis Matrix

論文年份貢獻類型Dataset/Benchmark# TasksCross-embodiment主要指標
Meta-World2019BenchmarkMeta-World (50 tasks)50NoSuccess rate
RLBench2019BenchmarkRLBench (100 tasks)100NoSuccess rate
ManiSkill22023BenchmarkManiSkill2 (20 envs)~20NoSuccess rate, SPL
RH20T2023DatasetRH20T (110k+ demos)~140PartialSkill diversity
BridgeData V22023DatasetBridgeData V2~71NoSuccess rate
LIBERO2023BenchmarkLIBERO (130 tasks)130NoSuccess rate
Open X-Embodiment2023DatasetOXE (22 datasets)527YesCross-robot transfer
RT-22023Model+EvalRT-2 evalPartialGeneralization score
IL Survey2025SurveyMulti-benchmarkYesComprehensive review
DROID2024DatasetDROID (76k trajs)84NoTask success
SIMPLER2024BenchmarkSIMPLER (sim eval)NoSim-to-real correlation
Octo2024Model+EvalOXE (800k trajs)YesFinetuning success
OpenVLA2024Model+EvalOXE (970k demos)YesTask success, lang grounding
ManiSkill32024BenchmarkManiSkill3 (12 domains)NoGPU sim throughput
π02024Model+EvalInternal + OXEYesZero-shot & finetuned SR
RoboMIND2024DatasetRoboMIND (479 tasks)479YesSuccess rate, 4 robot types
RoboCasa3652026BenchmarkRoboCasa365 (household)NoMulti-task, lifelong SR
EgoScale2026Dataset+MethodEgoScale (20,854h)YesSuccess rate (+54%)