Do my texts get uploaded?

No. This tool runs entirely in your browser. Your text never leaves your device.

What’s the difference between character and word mode?

Character mode compares at the character level (good for short strings). Word mode compares tokens split by whitespace (good for sentences/paragraphs).

How is similarity calculated?

Similarity is shown as 1 - (distance / max(length(A), length(B))). In word mode, length means token count.

Levenshtein Distance — Compare Two Strings

Compute edit distance & similarity and see a highlighted visual diff. Private by design—everything runs locally.

Input & Options

String A

String B

Character mode Word mode Case sensitive Trim leading/trailing whitespace

A: 0 chars · B: 0 chars

Tips: Ctrl/Cmd + K focuses “A”. Ctrl/Cmd + Enter runs Compare.

Results

Distance: —

Similarity: —

Ops: —

Lengths: —

Deletion (A only) Insertion (B only) Substitution

A with changes highlighted

B with changes highlighted

Show aligned operations

What is Levenshtein Distance?

Levenshtein distance (also called edit distance) is the smallest number of single-character edits needed to turn one string into another. The allowed edits are insertion, deletion, and substitution. For example, turning kitten into sitting takes three edits (k→s, e→i, insert g), so the distance is 3.

Why people use it

Spell checking & fuzzy search: find words that are “close enough” to a query.
Data cleaning: match near-duplicate names, addresses, or product titles.
Natural language & bioinformatics: compare tokens, lines, or genetic sequences.
UX matching: tolerate typos in forms and search boxes.

Character vs. Word mode

Character mode: great for short strings, usernames, variable names, and typos.
Word mode: splits on whitespace and compares tokens, which is clearer for sentences and paragraphs. It answers “how many word-level changes are needed?”

Similarity score

We show a simple similarity percentage: 1 − (distance / max(length(A), length(B))). In word mode, “length” means word count.

Case, Unicode, and language notes

Case sensitivity: toggle on/off. For case-insensitive matching, we compare lowercase forms but keep your original text for display.
Unicode: operates on code points. Some grapheme clusters (e.g., emoji + modifiers) may count as multiple edits.

How it’s computed (light overview)

We use dynamic programming over a (len(A)+1) × (len(B)+1) grid. The bottom-right cell is the distance; backtracking yields the sequence of edits we highlight.

Related metrics

Damerau–Levenshtein: counts adjacent transpositions as one edit.
Jaro / Jaro–Winkler: emphasize matching characters and order.
Cosine / Jaccard on tokens: compare word sets or vectors for document similarity.

5 Fun Facts about Levenshtein Distance

DNA to typos

The same edit-distance math powers spellcheckers and genetic alignment—letters or nucleotides, it’s edits all the way down.

Dual purpose

1965 origin story

Vladimir Levenshtein introduced the metric for error-correcting codes; the internet later turned it into fuzzy search fame.

Paper roots

Transposition needs a cousin

Plain Levenshtein counts “teh”→“the” as two edits. Damerau–Levenshtein treats that swap as one.

Swap-aware

Similarity ≠ distance

Distance is raw edits; similarity scales by length. One edit in a 3-letter word hurts more than one edit in a 30-letter string.

Scale matters

DP is the engine

A dynamic-programming grid finds the minimum edits in O(m·n). The highlighted path is the story of how A becomes B.

Algorithm core

Levenshtein Distance — Compare Two Strings

Input & Options

Results

A with changes highlighted

B with changes highlighted

What is Levenshtein Distance?

Why people use it

Character vs. Word mode

Similarity score

Case, Unicode, and language notes

How it’s computed (light overview)

Related metrics

🧩 5 Fun Facts about Levenshtein Distance

DNA to typos

1965 origin story

Transposition needs a cousin

Similarity ≠ distance

DP is the engine

Explore more tools

5 Fun Facts about Levenshtein Distance