Home·Curated Catalog·Computer Vision
👁 Curated Catalog · Computer Vision

Pascal VOC 2012

Classic 20-class detection and segmentation benchmark — still a default for quick experiments.

LQS 83 · gold ⚠ Research-only 11.5K images 2 GB JPG · XML Released 2012
Browse commercial Computer Vision → Visit original source ↗
Source: kaggle.com · maintained by Pascal VOC Consortium (Oxford, Leeds, Microsoft)
11.5K
images
2 GB
Size on disk
83
LQS · gold
2012
First released

About this dataset

Pascal VOC 2012 is the final release of the classic Visual Object Classes challenge from Oxford and partners. 11,530 images with 27,450 ROI annotations across 20 object classes, plus 6,929 segmentation masks. Smaller than modern benchmarks, but extremely well-documented and still widely used for teaching and baseline comparison.

Formats
JPG · XML

LabelSets Quality Score

LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →

83
out of 100
gold tier

Solid dataset with some trade-offs

Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.

Completeness 95
No public completeness metric; using prior for 'expert_curated' datasets.
Uniqueness 90
Benchmark-grade splits with leakage prevention.
Validation 92
Labels produced by domain experts or trained annotators.
Size adequacy 63
11,530 images — below 100,000 target for Computer Vision, but usable.
Format compliance 95
Industry-standard format — drop-in compatible with mainstream tooling.
Label density 61
Average 2.4 labels per item (moderate).
Class balance 75
Moderate class skew — realistic production distribution.

What it's used for

Common tasks and benchmarks where Pascal VOC 2012 is the default or competitive choice.

Sample statistics

What's actually in the dataset — from the maintainer's published stats.

11,530 images, 27,450 object bounding boxes, 6,929 segmentation masks, 20 classes.

License

Pascal VOC 2012 is distributed under Flickr Terms (permissive for research). This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.

Heads up: this dataset's license restricts commercial use. If you need computer vision data for production, check LabelSets' paid datasets below — every listing has an explicit commercial license.

Need commercial-licensed Computer Vision data?

LabelSets sellers offer paid computer vision datasets with what public datasets often can't give you:

Browse paid Computer Vision → Sell your dataset

Similar public datasets

Other entries in the Computer Vision catalog.

Frequently Asked Questions

Pascal VOC 2012 is distributed under Flickr Terms (permissive for research), which restricts commercial use. For a commercially-licensed alternative in computer vision, see LabelSets' paid datasets.
Pascal VOC 2012 contains 11,530 images. 11,530 images, 27,450 object bounding boxes, 6,929 segmentation masks, 20 classes.
Pascal VOC 2012 is maintained by Pascal VOC Consortium (Oxford, Leeds, Microsoft) and is available at https://www.kaggle.com/datasets/gopalbhattrai/pascal-voc-2012-dataset. LabelSets indexes and scores this dataset for discoverability but does not redistribute it.
LQS is a 7-dimension quality score (completeness, uniqueness, validation, size adequacy, format compliance, label density, class balance) computed from the dataset's published statistics. Composite scores map to tiers: platinum (≥90), gold (≥75), silver (≥60), bronze (<60). Read the full methodology.