Package: clustringr 1.0

clustringr: Cluster Strings by Edit-Distance

Returns an edit-distance based clusterization of an input vector of strings. Each cluster will contain a set of strings w/ small mutual edit-distance (e.g., levenshtein, optimum-sequence-alignment, damerau-lev), as computed by stringdist::stringdist(). The set of all mutual edit-distances is then used by g graph algorithms (from package igraph) to single out subsets of high connectivity.

Authors:Dan S. Reznik

clustringr_1.0.tar.gz
clustringr_1.0.zip(r-4.5)clustringr_1.0.zip(r-4.4)clustringr_1.0.zip(r-4.3)
clustringr_1.0.tgz(r-4.4-any)clustringr_1.0.tgz(r-4.3-any)
clustringr_1.0.tar.gz(r-4.5-noble)clustringr_1.0.tar.gz(r-4.4-noble)
clustringr_1.0.tgz(r-4.4-emscripten)clustringr_1.0.tgz(r-4.3-emscripten)
clustringr.pdf |clustringr.html
clustringr/json (API)

# Install 'clustringr' in R:
install.packages('clustringr', repos = c('https://dan-reznik.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/dan-reznik/clustringr/issues

Datasets:

On CRAN:

clusteringgraphsstrings

3.88 score 15 stars 4 scripts 174 downloads 2 exports 56 dependencies

Last updated 6 years agofrom:ad2b777bbf. Checks:OK: 1 NOTE: 6. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 10 2024
R-4.5-winNOTENov 10 2024
R-4.5-linuxNOTENov 10 2024
R-4.4-winNOTENov 10 2024
R-4.4-macNOTENov 10 2024
R-4.3-winNOTENov 10 2024
R-4.3-macNOTENov 10 2024

Exports:cluster_plotcluster_strings

Dependencies:assertthatcachemclicolorspacecpp11dplyrfansifarverfastmapforcatsgenericsggforceggplot2ggraphggrepelgluegraphlayoutsgridExtragtableigraphisobandlabelinglatticelifecyclemagrittrMASSMatrixmemoisemgcvmunsellnlmepillarpkgconfigpolyclippurrrR6RColorBrewerRcppRcppArmadilloRcppEigenrlangscalesstringdiststringistringrsystemfontstibbletidygraphtidyrtidyselecttweenrutf8vctrsviridisviridisLitewithr