← Back to topics
Topic
World Models
Representation learning for long-horizon decision making and planning.
1 papers · latest 2026-04-14
Most active fields for this topic
Xiaomeng Hu, Yinger Zhang, Fei Huang et al.
cs.CLcs.CL
OccuBench is a 100-scenario benchmark for professional agents across 65 domains that also injects hidden environment faults, exposing how brittle frontier models still are in real work settings.