Taxonomy
Fields
Start broad. Fields are the top-level map of what we track, from NLP and multimodal systems to robotics and efficient inference.
Reasoning & Agents
Reasoning, planning, tool use, and agentic workflows.
Recent picks
UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
NLP
Language understanding, generation, extraction, and evaluation.
Recent picks
Relax: An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale
Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning
Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo
Machine Learning
Core modeling, optimization, inference, and systems efficiency.
Recent picks
Three Roles, One Model: Role Orchestration at Inference Time to Close the Performance Gap Between Small and Large Agents
MEMENTO: Teaching LLMs to Manage Their Own Context
KV Cache Offloading for Context-Intensive Tasks
Computer Vision
Image, video, and 3D perception plus visual generation.
Recent picks
AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation
INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling
SEM-ROVER: Semantic Voxel-Guided Diffusion for Large-Scale Driving Scene Generation
Robotics
Embodied systems, control, manipulation, and navigation.
Recent picks
SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds
PhyEdit: Towards Real-World Object Manipulation via Physically-Grounded Image Editing
E-VLA: Event-Augmented Vision-Language-Action Model for Dark and Blurred Scenes