The VCTK dataset includes speech data spoken by 109 native speakers of English with diverse accents. Every speaker reads out about 400 sentences, most of which were selected from a newspaper plus the Rainbow Passage and an elicitation paragraph that identifies the speaker's accent. The Rainbow Passage and elicitation paragraph are the same for all speakers. The newspaper texts were taken from The Herald (Glasgow), with permission from Herald & Times Group. Each speaker reads a different set of newspaper sentences, where each set was selected using a greedy algorithm.