Abstract: Vision-And-Language Navigation (VLN) suffers from the limited diversity and scale of training data, primarily constrained by the manual curation of existing simulators. To address this, we ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results