Reactive Task Planning

- Achieved 71.2% F1 and near-instant (0.28±0.64 steps) plan-invalidity detection for online replanning by developing EBNF-guided VLM prompting methods and an autoregressive validity-checking algorithm
- Developed benchmark system for reactive task planning with 100 interruption scenarios in AI2-THOR
