• Make use_diary() more flexible with custom file name argument.
  • Remove the flawed character vector method for the dplyr::count() function.
  • Update template diary to match changes to TAP GitHub repository.
  • Do not remove “AA” when using normal_state(c("AA", "XX"), na_rep = TRUE).
  • The end argument of str_normal() can now be controlled with abb_end in normal_address() (default TRUE).
  • Simplify normal_address() by leaving number/letter mixes alone.
  • Change the punct argument of str_normal() to take a replacement string.
  • Add string argument to guess_delim() to read the input as a single line of a file regardless of \n presence.
  • Have the delim argument of read_names() default to guess_delim().
  • Add pad argument to normal_zip() (default FALSE) to control the use of str_pad() on ZIP codes without a leading zero.
  • The end argument has been added to abbrev_full to target only full values at the end of a string. Defaults to FALSE.
  • normal_address() uses end = TRUE in abbrev_full to target only the street type.
abbrev_full("123 MOUNTAIN ROAD", full = usps_street, end = FALSE)
#> "123 MTN RD"
normal_address("123 MOUNTAIN ROAD", abb = usps_street)
#> "123 MOUNTAIN RD"
  • Deprecate col_date_usa() in favor of col_date_mdy().
  • normal_address() now only puts spaces between letters and numbers if the string either starts with letters or ends with numbers:
  • normal_address() now keeps the forward slash in C/O.
  • Rename which_in() to what_in() to avoid confusion with which().
  • Remove http_filename().
  • Remove count_vec() in favor of new dplyr::count() method for characters.
  • Valid ZIP codes 22222, 44444, 55555 are not removed with normal_zip(na.na_rep = TRUE).

normal_address("12east 2nd street, 3rd floor", abbs = usps_street)
#> "12 E 2ND ST 3 RD FL" # old output
#> "12 E 2ND ST 3RD FL" # fixed output
  • file_encoding() now fails on Solaris and tests are skipped.
    • The use of the command line file tool via system2() returns results on Solaris OS that are not the same as on a Unix-alike or Windows system and thus the results are unreliable for consistent replication.
  • Tests and examples run more confidently.
  • Improve the handling of internal data and vignette.
  • Create read_names().
  • flag_dupes() has an argument to flag both duplicates.
  • Add encoding check to use_diary().
  • Improve documentation examples.
  • flag_dupes() now also calls duplicated(fromLast = TRUE) to capture all.
  • Separate address [:digit:] from [:alpha:] with space.
  • Create prop_distinct(), which_in() and which_out().
  • Comment out examples for fetching and checking cities.
  • Use ‘fs’ bytes and paths.
  • Create use_diary() with template diary.
  • Remove print_all()