Return consistent version of a city names using stringr::str_*()
functions.
Letters are capitalized, hyphens and underscores are replaced with
whitespace, other punctuation is removed, numbers are removed, and excess
whitespace is trimmed and squished. Optionally, geographic abbreviations
("MT") can be replaced with their long form ("MOUNT"). Invalid addresses from
a vector can be removed (possibly using invalid_city) as well as single
(repeating) character strings ("XXXXXX").
normal_city(city, abbs = NULL, states = NULL, na = c("", "NA"), na_rep = FALSE)
A vector of city names.
A named vector or data frame of abbreviations passed to
expand_abbrev; see expand_abbrev for format of abb
argument or use
the usps_city tibble.
A vector of state abbreviations ("VT") to remove from the end (and only end) of city names ("STOWE VT").
A vector of values to make NA
(useful with the invalid_city
vector).
logical; If TRUE
, replace all single digit (repeating)
strings with NA
.
A vector of normalized city names.
Other geographic normalization functions:
abbrev_full()
,
abbrev_state()
,
check_city()
,
expand_abbrev()
,
expand_state()
,
fetch_city()
,
normal_address()
,
normal_state()
,
normal_zip()
,
str_normal()