There is a newer version of the record available.

Published 2019 | Version v2
Open dataset

State of the State

Description

Description

We conducted a text analysis of all 50 governors' 2019 state of the state speeches to see what issues were talked about the most and whether there were differences between what Democratic and Republican governors were focusing on.

index.csv contains a listing of each of the 50 speeches, one for each state as well as the name and party of the state's governor and a link to an official source for the speech.

words.csv contains every one-word phrase that was mentioned in at least 10 speeches and every two- or three-word phrase that was mentioned in at least five speeches after a list of stop-words was removed and the word "healthcare" was replaced with "health care" so that they were not counted as distinct phrases. It also contains the results of a chi^2 test that shows the statistical significance of and associated p-value of phrases.

Variables

Name Description
n-gram one-, two- or three-word phrase
category thematic categories for n-grams hand-coded by FiveThirtyEight staff: economy/fiscal issues, education, health care, energy/environment, crime/justice, mental health/substance abuse
d_speeches number of Democratic speeches containing the n-gram
r_speeches number of Republican speeches containing the n-gram
total total number of speeches containing the n-gram
percent_of_d_speeches percent of the 23 Democratic speeches containing the phrase
percent_of_r_speeches percent of the 27 Republican speeches containing the phrase
chi2 chi^2 statistic
pval p-value for chi^2 test

Details

Resource type Open dataset
Title State of the State
Creators
  • FiveThirtyEight
  • Formats Comma-separated values (CSV) (.csv)
    License(s) no license information available
    External Resource https://github.com/fivethirtyeight/data/tree/master/state-of-the-state