Package: fedmatch 2.1.0

fedmatch: Fast, Flexible, and User-Friendly Record Linkage Methods

Provides a flexible set of tools for matching two un-linked data sets. 'fedmatch' allows for three ways to match data: exact matches, fuzzy matches, and multi-variable matches. It also allows an easy combination of these three matches via the tier matching function.

Authors:Melanie Friedrichs [aut], Chris Webster [aut, cre], Blake Marsh [aut], Jacob Dice [aut], Seung Lee [aut]

fedmatch_2.1.0.tar.gz
fedmatch_2.1.0.zip(r-4.7)fedmatch_2.1.0.zip(r-4.6)fedmatch_2.1.0.zip(r-4.5)
fedmatch_2.1.0.tgz(r-4.6-x86_64)fedmatch_2.1.0.tgz(r-4.6-arm64)fedmatch_2.1.0.tgz(r-4.5-x86_64)fedmatch_2.1.0.tgz(r-4.5-arm64)
fedmatch_2.1.0.tar.gz(r-4.7-arm64)fedmatch_2.1.0.tar.gz(r-4.7-x86_64)fedmatch_2.1.0.tar.gz(r-4.6-arm64)fedmatch_2.1.0.tar.gz(r-4.6-x86_64)
fedmatch_2.1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION |NEWS
card.svg |card.png
fedmatch/json (API)

# Install 'fedmatch' in R:
install.packages('fedmatch', repos = c('https://c0webster.r-universe.dev', 'https://cloud.r-project.org'))
Uses libs:
  • c++– GNU Standard C++ Library v3
  • openmp– GCC OpenMP (GOMP) support library
Datasets:

On CRAN:

Conda:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

cppopenmp

4.89 score 1 stars 1 packages 103 scripts 843 downloads 14 exports 25 dependencies

Last updated from:3aae39ec12. Checks:13 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-arm64OK205
linux-devel-x86_64OK158
source / vignettesOK182
linux-release-arm64OK198
linux-release-x86_64OK157
macos-release-arm64OK157
macos-release-x86_64OK212
macos-oldrel-arm64OK184
macos-oldrel-x86_64OK292
windows-develOK123
windows-releaseOK129
windows-oldrelOK131
wasm-releaseOK163

Exports:%>%build_clean_settingsbuild_corpusbuild_fuzzy_settingsbuild_multivar_settingsbuild_score_settingsbuild_tiercalculate_weightsclean_stringsmatch_evaluatemerge_plustier_matchwgt_jaccard_distanceword_frequency

Dependencies:BHclidata.tablefarverforcatsgluelabelinglifecyclemagrittrpillarpkgconfigpurrrR6RColorBrewerRcpprlangscalesSnowballCstringdiststringistringrtibbleutf8vctrsviridisLite

Fuzzy Matching
Intro | Fuzzy matching theory | Jaro-Winkler | Weighted Jaccard Similarity | Other similarity metrics | Using fuzzy matching in fedmatch | Basic Syntax | The order of which is data1 and which is data2 matters! | An example - weighted Jaccard match | Weighted Jaccard Match

Last update: 2025-01-31
Started: 2021-07-20

Introduction to fedmatch
Background | Name cleaning | Basics: merge_plus | Exact matching | Fuzzy matching | Multivar matching | Summary and next steps

Last update: 2025-01-31
Started: 2021-07-20

Tier Matching
Overview | Syntax

Last update: 2025-01-31
Started: 2021-07-20

Multivar Matching
Background | Syntax | Return value | Calculating weights from a training set | Using a logit model instead of a linear sum | Parallelization and saving time

Last update: 2024-05-21
Started: 2021-07-20

Using clean_strings
sp_char_words | common_words | Remove characters and words | stem

Last update: 2021-07-20
Started: 2021-07-20