What is MANSION?
MANSION is designed around a simple idea: long-horizon tasks become much more realistic (and harder) when an agent must operate across multiple floors with persistent object state.
Key goals:
- Long-horizon: tasks that require dozens to hundreds of low-level actions
- Multi-floor: stair/elevator transitions, floor-specific visibility, and navigation costs
- Reproducible: scenes and task definitions are versioned; results are repeatable
- Composable: a skills layer so planners can reason at a higher level than primitive actions
If you’re building LLM/VLM planners, hierarchical RL, or program-of-thought agents, MANSION is meant to be the substrate that makes failures interesting.