BridgeData V2: A Dataset for Robot Learning at Scale

本文由 AI 分析生成

建立時間： 2026-05-14

Summary

EN: BridgeData V2 (CoRL 2023, arXiv:2308.12952) is the foundational large-scale teleoperation dataset from UC Berkeley, containing 60k trajectories across 24 environments and 13 skill types on a WidowX 250 6-DoF robot arm. The key design principle is reproducibility via low-cost hardware (~$3k robot) and VR controller teleoperation. ~85% of data is human-teleoperated, ~15% scripted. It supports both goal-image and natural-language task conditioning. BridgeData V2 is a primary contributor to Open X-Embodiment and remains the standard fine-tuning target for VLA models targeting tabletop manipulation.

ZH (繁體中文): BridgeData V2（CoRL 2023，arXiv:2308.12952）是 UC Berkeley 發布的大規模遙操作基礎資料集，包含 60,000 筆示範，橫跨 24 個環境、13 種技能類型，使用低成本 WidowX 250 6-DoF 機械臂。設計核心是「可複現性」——廉價機器人（~$3k）讓社群廣泛貢獻資料。約 85% 為人工遙操作，15% 為腳本。支援目標圖像和自然語言雙模式任務指定。BridgeData V2 是 Open X-Embodiment 的主要資料來源之一，也是 VLA 模型桌面操作微調的標準目標。

Key Insights

Insight 1 — 低成本硬體的策略優勢：WidowX 的低門檻讓資料收集民主化，使 BridgeData 成為 Open-X 中最具代表性的單一機器人資料集，但也限制了任務複雜度（6-DoF vs. 7-DoF Franka）。
Insight 2 — 環境數量 vs. 場景數量：24 個「環境」指的是不同的廚房/桌面配置，而非 DROID 意義上的 564 個獨立現實場景，多樣性維度不同。
Insight 3 — 雙模態條件：同時支援 goal image 和 language instruction 使 BridgeData V2 成為研究多模態條件化策略的理想基準。
Insight 4 — 作為縮放定律的驗證集：論文明確驗證了「更多資料 + 更大模型 = 更好泛化」，為後續大規模資料集（DROID、Open-X）的設計提供了理論依據。

Connections

droid-large-scale-in-the-wild-robot-manipulation-dataset — DROID 是 BridgeData 的精神繼承者，更大規模、更多樣
robotic-teleoperation-manipulation — 遙操作技術分析
lerobot-framework-tools-use-cases — LeRobot 對 BridgeData 格式的標準化處理

bot_vault

Explorer

BridgeData V2: A Dataset for Robot Learning at Scale

Summary

Key Insights

Connections

Graph View

Table of Contents

Backlinks