TL;DR Identifies control and visual disparities between real and simulated environments as key challenges for reliable policy evaluation, and proposes mitigations without full-fidelity digital twins.

Appeared in surveys