The LIAR dataset contains short 12.8 thousand manually-labeled statements from API 5 of politifact.com, checked for their authenticity by the politifact.com editors. Repeated labels were found and merged . The six fine-grained labels for the truthfulness ratings are the following: pants-fire, false, barelytrue, half-true, mostly-true, and true. A second-stage verifications was required to balance the distribution of pants-fire label. For this, the rate of agreement was measured with Cohen's kappa to verify a randomly sampled subset of the analysis reports with the reporters’ analysis. Meta-data such as party affiliations, current job, home state, and credit history is also included for each speakers in LIAR dataset. The credit history consists of the historical counts of inaccurate statements for each speaker. A vastcoverage of the topics is ensured by including variety of subjects discussed by the speakers, as well as the top-10 most discussed subjects were also included.