Prompting Large Language Models with Answer Heuristics for Knowledge-Based Visual Question Answering
Abstract: Knowledge-based visual question answering (VQA) requires external knowledge beyond the image to answer the question. Early studies retrieve required knowledge from explicit knowledge bases ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results