Abstract: This paper explores zero-shot Vision-and-Language Navigation (VLN), enabling agents to generalize navigation to unseen data classes. Most current approaches rely on large models, but these ...
If you make a purchase from our site, we may earn a commission. This does not affect the quality or independence of our editorial content.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results