Skip to content

String Processing

Utilities for pre-processing strings

processString()
Process a String

Fuzz Ratios

Fuzzy string matching ratio functions

fuzz_ratio()
Simple Ratio Calculation
fuzz_partial_ratio()
Partial Ratio Calculation
fuzz_token_sort_ratio()
Token Sort Ratio Calculation
fuzz_token_set_ratio()
Token Set Ratio Calculation
fuzz_token_ratio()
Combined Token Ratio
fuzz_partial_token_sort_ratio()
Partial Token Sort Ratio Calculation
fuzz_partial_token_set_ratio()
Partial Token Set Ratio Calculation
fuzz_partial_token_ratio()
Combined Partial Token Ratio
fuzz_WRatio()
Weighted Ratio Calculation
fuzz_QRatio()
Quick Ratio Calculation

Functions for extracting best matches from a set of choices

extract_similar_strings()
Extract Matches
extract_best_match()
Extract Best Match
extract_matches()
Extract Matches with Scoring and Limit

Levenshtein

Levenshtein distance and similarity

levenshtein_distance()
Levenshtein Distance
levenshtein_normalized_distance()
Normalized Levenshtein Distance
levenshtein_normalized_similarity()
Normalized Levenshtein Similarity
levenshtein_similarity()
Levenshtein Similarity

Damerau-Levenshtein

Damerau-Levenshtein distance and similarity

damerau_levenshtein_distance()
Damerau-Levenshtein Distance
damerau_levenshtein_normalized_distance()
Normalized Damerau-Levenshtein Distance
damerau_levenshtein_normalized_similarity()
Normalized Damerau-Levenshtein Similarity
damerau_levenshtein_similarity()
Damerau-Levenshtein Similarity

Hamming

Hamming distance and similarity

hamming_distance()
Hamming Distance
hamming_normalized_distance()
Normalized Hamming Distance
hamming_normalized_similarity()
Normalized Hamming Similarity
hamming_similarity()
Hamming Similarity

Jaro

Jaro distance and similarity

jaro_distance()
Jaro Distance
jaro_normalized_distance()
Normalized Jaro Distance
jaro_normalized_similarity()
Normalized Jaro Similarity
jaro_similarity()
Jaro Similarity

Jaro-Winkler

Jaro-Winkler distance and similarity

jaro_winkler_distance()
Jaro-Winkler Distance
jaro_winkler_normalized_distance()
Normalized Jaro-Winkler Distance
jaro_winkler_normalized_similarity()
Similaridade Normalizada Jaro-Winkler
jaro_winkler_similarity()
Jaro-Winkler Similarity

Indel

Insertion/deletion distance and similarity

indel_distance()
Indel Distance
indel_normalized_distance()
Normalized Indel Distance
indel_normalized_similarity()
Normalized Indel Similarity
indel_similarity()
Indel Similarity

LCSseq

Longest Common Subsequence distance and similarity

lcs_seq_distance()
LCSseq Distance
lcs_seq_editops()
LCSseq Edit Operations
lcs_seq_normalized_distance()
Normalized LCSseq Distance
lcs_seq_normalized_similarity()
Normalized LCSseq Similarity
lcs_seq_similarity()
LCSseq Similarity

OSA

Optimal String Alignment distance and similarity

osa_distance()
Distance Using OSA
osa_editops()
Edit Operations Using OSA
osa_normalized_distance()
Normalized Distance Using OSA
osa_normalized_similarity()
Normalized Similarity Using OSA
osa_similarity()
Similarity Using OSA

Prefix

Prefix distance and similarity

prefix_distance()
Calculate the prefix distance between two strings
prefix_normalized_distance()
Calculate the normalized prefix distance between two strings
prefix_normalized_similarity()
Calculate the normalized prefix similarity between two strings
prefix_similarity()
Calculate the prefix similarity between two strings

Postfix

Postfix distance and similarity

postfix_distance()
Postfix Distance
postfix_normalized_distance()
Normalized Postfix Distance
postfix_normalized_similarity()
Normalized Postfix Similarity
postfix_similarity()
Postfix Similarity

Edit Operations

Functions for computing and applying edit operations

get_editops()
Get Edit Operations
editops_apply_str()
Apply Edit Operations to String
editops_apply_vec()
Apply Edit Operations to Vector
opcodes_apply_str()
Apply Opcodes to String
opcodes_apply_vec()
Apply Opcodes to Vector