Google Open Images Dataset v4

From GM-RKB
Jump to navigation Jump to search

A Google Open Images Dataset v4 is a Google Open Images Dataset published in 2018-05-01.



References

2018

  • https://storage.googleapis.com/openimages/web/visualizer/index.html
    • QUOTE: Open Images is a dataset of ~9 million images that have been annotated with image-level labels and object bounding boxes. The training set of V4 contains 14.6M bounding boxes for 600 object classes on 1.74M images, making it the largest existing dataset with object location annotations. The boxes have been largely manually drawn by professional annotators to ensure accuracy and consistency. The images are very diverse and often contain complex scenes with several objects (8.4 per image on average). Moreover, the dataset is annotated with image-level labels spanning thousands of classes.
    • Data organization The dataset is split into a training set (9,011,219 images), a validation set (41,620 images), and a test set (125,436 images). The images are annotated with image-level labels and bounding boxes as described below.

      Table 1 shows an overview of the image-level labels in all splits of the dataset. All images have machine generated image-level labels automatically generated by a computer vision model similar to Google Cloud Vision API. These automatically generated labels have a substantial false positive rate.

  • https://storage.googleapis.com/openimages/2018_04/bbox_labels_600_hierarchy_visualizer/circle.html
    • OpenImagesV4Hierarchy.180501.png