Hefei Mei, Zirui Wang, Shen You, Minjing Dong, Chang Xu For captioning and VQA tasks, evaluation can be performed by modifying the -- eval_coco instruction in the args to eval_flicker30, eval_textvqa, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results