CARS-196
Overview
The Cars dataset contains 16,185 images of 196 classes of cars. The data is split into 8,144 training images and 8,041 testing images, where each class has been split roughly in a 50-50 split. Classes are typically at the level of Make, Model, Year, e.g. 2012 Tesla Model S or 2012 BMW M3 coupe. Image resolutions vary across samples. No resizing is applied by default.
Train: 8144 images
Test: 8041 images
Data Structure
When accessing an example using ds[i], you will receive a dictionary with the following keys:
Key |
Type |
Description |
|---|---|---|
|
|
H×W×3 RGB image |
|
int |
Class label (0-195) |
Usage Example
Basic Usage
from stable_datasets.images.cars196 import Cars196
# First run will download + prepare cache, then return the split as a HF Dataset
ds = Cars196(split="train")
# If you omit the split (split=None), you get a DatasetDict with all available splits
ds_all = Cars196(split=None)
sample = ds[0]
print(sample.keys()) # {"image", "label"}
# Optional: make it PyTorch-friendly
ds_torch = ds.with_format("torch")
References
Official website: https://ai.stanford.edu/~jkrause/cars/car_dataset.html
License: MIT License
Citation
@inproceedings{krause20133d,
title={3d object representations for fine-grained categorization},
author={Krause, Jonathan and Stark, Michael and Deng, Jia and Fei-Fei, Li},
booktitle={Proceedings of the IEEE international conference on computer vision workshops},
pages={554--561},
year={2013}}