A dataset derived from https://www.kaggle.com/vered1986/propernames-categories/version/1. The words are British focused, and I have adjusted some classifications to be easier for users to work with.

data(proper_nouns)

Format

A data frame with 747 rows and 2 variables:

word

string of the actual word

pos

part of speech (aka celebrity, place, etc)

Source

https://www.kaggle.com/vered1986/propernames-categories/version/1