A dataset compiled by Tomas Englethaler for his research on humor. https://github.com/tomasengelthaler/HumorNorms Please visit his page for more details on the methodology used to score words.

data(humor_dataset)

Format

A data frame with 4997 rows and 16 variables:

word

string of the actual word

mean

mean of humor rating across all audiences

mean_F

mean of humor rating (women)

mean_M

mean of humor rating (men)

mean_old

mean of humor rating (old)

mean_young

mean of humor rating (young)

n

audience size

n_F

audience size (women)

n_M

audience size (men)

n_old

audience size (old)

n_young

audience size (young)

sd

sd of humor rating across all audiences

sd_F

sd humor rating (women)

sd_M

sd of humor rating (men)

sd_old

sd humor rating (old)

sd_young

sd of humor rating (young)

Source

https://github.com/tomasengelthaler/HumorNorms