Visual Question Answering (QA) Task

From GM-RKB
Jump to navigation Jump to search

A Visual Question Answering (QA) Task is a QA task that is also a vision-and-language task.



References

2015

  • Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C Lawrence Zitnick, and Devi Parikh. 2015. "Vqa: Visual question answering. In: Proceedings of the IEEE International Conference on computer vision, pages 2425–2433.