Abstract: Large Language Models significantly improve 3D scene understanding, especially in 3D Question Answering. However, reasoning about positional relationships between objects remains a challenge ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results