v2.5.0
Datasets ⭐
EXAMPLE CODE
GTZAN Music Speech Dataset
Load the GTZAN Music Speech dataset in Python. 120 tracks, each containing 30 seconds of audio. Stream GTZAN Music Speech dataset while training ML models.
Visualisation of Gtzan Music Speech on the Activeloop Platform

GTZAN Music Speech Dataset

What is GTZAN Music Speech Dataset?

GTZAN Music Speech dataset was created for the purposes of music/speech discrimination and is similar to the GTZAN Genre dataset. The dataset consists of 120 tracks, each containing 30 seconds of audio. The tracks in the dataset are all 22050Hz Mono 16-bit audio files in .wav format. Also, each class (music/speech) in the GTZAN Music Speech dataset has 60 samples.

Download GTZAN Music Speech Dataset in Python

Instead of downloading the GTZAN Music Speech dataset in Python, you can effortlessly load it in Python via our open-source package Hub with just one line of code.

Load GTZAN Music Speech Dataset Training Subset in Python

1
import hub
2
ds = hub.load("hub://activeloop/gtzan-music-speech")
Copied!

GTZAN Music Speech Dataset Structure

GTZAN Music Speech Data Fields

  • audio: tensor contains audio file .wav format.
  • label: tensor representing the .wav files as music or speech.

GTZAN Music Speech Data Splits

How to use GTZAN Music Speech Dataset with PyTorch and TensorFlow in Python

Train a model on GTZAN Music Speech dataset with PyTorch in Python

Let's use Hub's built-in PyTorch one-line dataloader to connect the data to the compute:
1
dataloader = ds.pytorch(num_workers=0, batch_size=4, shuffle=False)
Copied!

Train a model on GTZAN Music Speech dataset with TensorFlow in Python

1
dataloader = ds.tensorflow()
Copied!

Additional Information about GTZAN Music Speech Dataset

GTZAN Music Speech Dataset Description

GTZAN Music Speech Dataset Curators

Tzanetakis, George

GTZAN Music Speech Dataset Licensing Information

Hub users may have access to a variety of publicly available datasets. We do not host or distribute these datasets, vouch for their quality or fairness, or claim that you have a license to use the datasets. It is your responsibility to determine whether you have permission to use the datasets under their license.
If you're a dataset owner and do not want your dataset to be included in this library, please get in touch through a GitHub issue. Thank you for your contribution to the ML community!

GTZAN Music Speech Dataset Citation Information

1
@ONLINE {Music Speech,
2
author = "Tzanetakis, George",
3
title = "GTZAN Music/Speech Collection",
4
year = "1999",
5
url = "http://marsyas.info/index.html"
6
}
Copied!

GTZAN Music Speech Dataset FAQs

What is the GTZAN Music Speech dataset for Python?

The GTZAN Music Speech dataset consists of 120 tracks, each containing 30 seconds of audio. The dataset was created for music/speech discrimination and is similar to the GTZAN Genre dataset. The tracks in the dataset are 22050Hz Mono 16-bit audio files and each class (music/speech) has 60 samples.

What is the GTZAN Music Speech dataset used for?

The GTZAN Music Speech dataset is often used for music-speech classification in the domain of machine learning.
How to download the GTZAN Music Speech dataset in Python?
You can load GTZAN Music Speech dataset fast with one line of code using the open-source package Activeloop Hub in Python. See detailed instructions on how to load GTZAN Music Speech dataset training subset in Python.

How can I use GTZAN Music Speech dataset in PyTorch or TensorFlow?

You can stream GTZAN Music Speech dataset while training a model in PyTorch or TensorFlow with one line of code using the open-source package Activeloop Hub in Python. See detailed instructions on how to train a model on GTZAN Music Speech dataset with PyTorch in Python or train a model on GTZAN Music Speech dataset with TensorFlow in Python.