A versatile and scalable vision-language-action framework: XR-1 supports robust multi-task learning across diverse robot embodiments and environments.
Code for HOPformer released in ECCV 2026 Paper "Towards in-the-wild Egocentric 3D Hand-Object Pose Estimation" - Sid2697/HOPformer ...