YouTube-8M Dataset

From GM-RKB
(Redirected from YouTube-8M)
Jump to navigation Jump to search

A YouTube-8M Dataset is an large-scale labeled video dataset that contains YouTube videos.



References

2019

2017

  • https://research.google.com/youtube8m/
    • QUOTE: YouTube-8M is a large-scale labeled video dataset that consists of millions of YouTube video IDs, with high-quality machine-generated annotations from a diverse vocabulary of 3,800+ visual entities. It comes with precomputed audio-visual features from billions of frames and audio segments, designed to fit on a single hard disk. This makes it possible to train a strong baseline model on this dataset in less than a day on a single GPU! At the same time, the dataset's scale and diversity can enable deep exploration of complex audio-visual models that can take weeks to train even in a distributed fashion.

      Our goal is to accelerate research on large-scale video understanding, representation learning, noisy data modeling, transfer learning, and domain adaptation approaches for video. More details about the dataset and initial experiments can be found in our technical report and in last year's workshop. Some statistics from the latest version of the dataset are included below.

2016