Vision-Language Pre-training Task

From GM-RKB
Jump to navigation Jump to search

A Vision-Language Pre-training Task is a pre-training task that aligns visual representations with textual representations through contrastive learning objectives for cross-modal understanding tasks.