v2.5.0
Datasets ⭐
EXAMPLE CODE
NABirds Dataset
Load NABirds in Python in seconds with one line of code. Stream images of over 400 types of North American birds while training ML models. Visualize NABirds.
Visualization of the NABirds dataset on the Activeloop Platform

NABirds dataset

What is NABirds Dataset?

The NABirds dataset is a collection of 48,000 annotated photographs of the 400 species of birds that are commonly seen across North America. In the dataset, there are more than 100 photographs for each species, including separate annotations for males, females, and juveniles that comprise 700 visual categories.

Download NABirds Dataset in Python

Instead of downloading the NABirds dataset in Python, you can effortlessly load it in Python via our open-source package Hub with just one line of code.

Load NABirds Dataset Training Subset in Python

1
import hub
2
ds = hub.load("hub://activeloop/nabirds-dataset-train")
Copied!

Load NABirds Dataset Validation Subset in Python

1
import hub
2
ds = hub.load("hub://activeloop/nabirds-dataset-val")
Copied!

NABirds Dataset Structure

NABirds Data Fields

  • images: tensor containing the face image.
  • labels: tensor containing labels of birds categories.
  • boxes: tensor to localise the bird in image.

NABirds Data Splits

How to use NABirds Dataset with PyTorch and TensorFlow in Python

Train a model on NABirds dataset with PyTorch in Python

Let's use Hub's built-in PyTorch one-line dataloader to connect the data to the compute:
1
dataloader = ds.pytorch(num_workers=0, batch_size=4, shuffle=False)
Copied!

Train a model on NABirds dataset with TensorFlow in Python

1
dataloader = ds.tensorflow()
Copied!

Additional Information about NABirds Dataset

NABirds Dataset Description

NABirds Dataset Curators

Grant Van Horn, Steve Branson, Ryan Farrell, Scott Haber, Jessie Barry, Panos Ipeirotis, Serge Belongie

NABirds Dataset Licensing Information

Hub users may have access to a variety of publicly available datasets. We do not host or distribute these datasets, vouch for their quality or fairness, or claim that you have a license to use the datasets. It is your responsibility to determine whether you have permission to use the datasets under their license.
If you're a dataset owner and do not want your dataset to be included in this library, please get in touch through a GitHub issue. Thank you for your contribution to the ML community!

NABirds Dataset Citation Information

1
@inproceedings{,
2
title = {Building a Bird Recognition App and Large Scale Dataset With Citizen Scientists: The Fine Print in Fine-Grained Dataset Collection},
3
author = {Horn},
4
}
Copied!

NABirds Dataset FAQs

What is the NABirds dataset for Python?

NABirds is a collection of 48,000 annotated photographs of 400 species of birds commonly observed in North America. This dataset is often used for fine-grained visual categorization experiments.
How to download the NABirds dataset in Python?
You can load NABirds dataset fast with one line of code using the open-source package Activeloop Hub in Python. See detailed instructions on how to load NABirds dataset training subset or NABirds validation subset in Python.

How can I use NABirds dataset in PyTorch or TensorFlow?

You can stream NABirds dataset while training a model in PyTorch or TensorFlow with one line of code using the open-source package Activeloop Hub in Python. See detailed instructions on how to train a model on NABirds dataset with PyTorch in Python or train a model on NABirds dataset with TensorFlow in Python.