State of the State
Creators
Description
Description
We conducted a text analysis of all 50 governors' 2019 state of the state speeches to see what issues were talked about the most and whether there were differences between what Democratic and Republican governors were focusing on.
index.csv contains a listing of each of the 50 speeches, one for each state as well as the name and party of the state's governor and a link to an official source for the speech.
words.csv contains every one-word phrase that was mentioned in at least 10 speeches and every two- or three-word phrase that was mentioned in at least five speeches after a list of stop-words was removed and the word "healthcare" was replaced with "health care" so that they were not counted as distinct phrases. It also contains the results of a chi^2 test that shows the statistical significance of and associated p-value of phrases.
Files
index.csv
Variables
| Name | Description |
|---|---|
| n-gram | one-, two- or three-word phrase |
| category | thematic categories for n-grams hand-coded by FiveThirtyEight staff: economy/fiscal issues, education, health care, energy/environment, crime/justice, mental health/substance abuse |
| d_speeches | number of Democratic speeches containing the n-gram |
| r_speeches | number of Republican speeches containing the n-gram |
| total | total number of speeches containing the n-gram |
| percent_of_d_speeches | percent of the 23 Democratic speeches containing the phrase |
| percent_of_r_speeches | percent of the 27 Republican speeches containing the phrase |
| chi2 | chi^2 statistic |
| pval | p-value for chi^2 test |
Details
| Resource type | Open dataset |
| Title | State of the State |
| Creators |
|
| Size | 134.5 kB |
| Formats | Comma-separated values (CSV) (.csv) |
| License(s) | no license information available |
| External Resource | https://github.com/fivethirtyeight/data/tree/master/state-of-the-state |