Field
Robotics
Embodied systems, control, manipulation, and navigation.
8 papers · latest 2026-04-23
Common topics in this field
Tianle Zhang, Zhihao Yuan, Dafeng Chi et al.
Introduces JoyAI-RA, a vision-language-action foundation model that enhances robotic autonomy through improved generalization across diverse robotic embodiments and tasks.
Yupeng Zheng, Xiang Li, Songen Gu et al.
Presents a lightweight VLA model with world knowledge integration for efficient robot manipulation, enhancing spatial reasoning and task execution in compact robotic systems.
Michael Ziegltrum, Jianhao Jiao, Tianhu Peng et al.
First to apply sparsely gated MoE to quadruped parkour, enabling efficient, high-performance locomotion on extreme terrain—reduces compute by 40% vs MLPs, making complex robotics feasible on edge hardware.
Physical Intelligence, Bo Ai, Ali Amin et al.
$π_{0.7}$ delivers emergent, zero-shot robotic capabilities via a steerable foundation model, enabling complex multi-stage tasks in unseen environments—transforming how robots generalize across tasks and embodiments in real-world settings.
Yunsong Zhou, Hangxu Liu, Xuekun Jiang et al.
SIM1 builds physics-aligned real-to-sim twins for deformable manipulation, letting purely synthetic training reach real-data parity at a fraction of collection cost and making sim-scaled robotics learning much more practical.
Ruihang Xu, Dewei Zhou, Xiaolong Shen et al.
Adds 3D geometry and physical constraints to image editing, plus a new benchmark, making object manipulation edits far more reliable for world-model, simulation, and synthetic-data workflows.
Jiajun Zhai, Hao Shi, Shangwei Guo et al.
E-VLA uses event cameras—normally used in robotics—to let robots see and act in near-total darkness or blur, where normal cameras fail. This enables real-world robotic systems to operate reliably in challenging environments like smoke-filled rooms or fast-moving scenes.
Songyuan Yang, Huibin Tan, Kailun Yang et al.
Programming domestic robots is often too complex for non-experts. This system allows users to instruct robots by simply sketching on a camera feed, removing the need for coding or pre-existing maps and making home robotics accessible to everyone.