v2.5.0
Datasets ⭐
EXAMPLE CODE
VCTK Dataset
Load VCTK in Python fast with one line of code. VCTK has ~400 audio clips from 109 English speakers. Stream VCTK while training ML models. Visualize VCTK.
Visualization of the VCTK dataset on the Activeloop Platform

VCTK dataset

What is VCTK Dataset?

The VCTK dataset includes speech data spoken by 109 native speakers of English with diverse accents. Every speaker reads out about 400 sentences, most of which were selected from a newspaper plus the Rainbow Passage and an elicitation paragraph that identifies the speaker's accent. The Rainbow Passage and elicitation paragraph are the same for all speakers. The newspaper texts were taken from The Herald (Glasgow), with permission from Herald & Times Group. Each speaker reads a different set of newspaper sentences, where each set was selected using a greedy algorithm.

Download VCTK Dataset in Python

Instead of downloading the VCTK dataset in Python, you can effortlessly load it in Python via our open-source package Hub with just one line of code.

Load VCTK Dataset Training Subset in Python

1
import hub
2
ds = hub.load("hub://activeloop/vctk")
Copied!

VCTK Dataset Structure

VCTK Data Fields

  • audios: tensor containing audio file in wav format.
  • texts: tensor containing text transcript of the audio.

VCTK Data Splits

How to use VCTK Dataset with PyTorch and TensorFlow in Python

Train a model on VCTK dataset with PyTorch in Python

Let's use Hub's built-in PyTorch one-line dataloader to connect the data to the compute:
1
dataloader = ds.pytorch(num_workers=0, batch_size=4, shuffle=False)
Copied!

Train a model on VCTK dataset with TensorFlow in Python

1
dataloader = ds.tensorflow()
Copied!

Additional Information about VCTK Dataset

VCTK Dataset Description

  • Paper: Yamagishi, Junichi and Veaux, Christophe and MacDonald, Kirsten. in CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit
  • Point of Contact: N/A

VCTK Dataset Curators

Yamagishi, Junichi and Veaux, Christophe and MacDonald, Kirsten

VCTK Dataset Licensing Information

Hub users may have access to a variety of publicly available datasets. We do not host or distribute these datasets, vouch for their quality or fairness, or claim that you have a license to use the datasets. It is your responsibility to determine whether you have permission to use the datasets under their license.
If you're a dataset owner and do not want your dataset to be included in this library, please get in touch through a GitHub issue. Thank you for your contribution to the ML community!

VCTK Dataset Citation Information

1
@misc{yamagishi2019vctk,
2
author={Yamagishi, Junichi and Veaux, Christophe and MacDonald, Kirsten},
3
title={ {CSTR VCTK Corpus}: English Multi-speaker Corpus for {CSTR} Voice Cloning Toolkit (version 0.92)},
4
publisher={University of Edinburgh. The Centre for Speech Technology Research (CSTR)},
5
year=2019,
6
doi={10.7488/ds/2645},
7
}
Copied!

VCTK Dataset FAQs

What is the VCTK dataset for Python?

The VCTK dataset is an audio dataset. The dataset was created for the purpose of building HMM-based text-to-speech synthesis systems, especially for speaker-adaptive HMM-based speech synthesis using average voice models trained on multiple speakers and speaker adaptation technologies.
How to download the VCTK dataset in Python?
You can load VCTK dataset fast with one line of code using the open-source package Activeloop Hub in Python. See detailed instructions on how to load VCTK dataset training subset in Python.

How can I use VCTK dataset in PyTorch or TensorFlow?

You can stream VCTK dataset while training a model in PyTorch or TensorFlow with one line of code using the open-source package Activeloop Hub in Python. See detailed instructions on how to train a model on VCTK dataset with PyTorch in Python or train a model on VCTK dataset with TensorFlow in Python.