# What algorithms are used by Delpha

## What Algorithms Are Used by Delpha to Detect Duplicates?

Delpha uses a set of advanced matching algorithms and normalization techniques to identify duplicate records across different field types in Salesforce. The logic determines how values are compared between records to calculate similarity and duplication scores.

## Matching Algorithms & Normalization by Field Type

| **Field Type**  | **Matching Algorithm**                           | **Normalization**                                                                          |
| --------------- | ------------------------------------------------ | ------------------------------------------------------------------------------------------ |
| **String**      | Exact, Jaro-Winkler, QGram Tokenizer, TF-IDF     | Lowercase, ASCII table                                                                     |
| **Boolean**     | Exact                                            | None                                                                                       |
| **Email**       | Exact, Partial segmentation, Domain name average | Lowercase, Validity checks                                                                 |
| **Phone**       | Exact                                            | ASCII table, e164 normalization                                                            |
| **Number**      | Exact                                            | Rounded                                                                                    |
| **URL**         | Exact                                            | ASCII & percent-encoded triplet, Path ending normalization (`"/"`), Protocol normalization |
| **Address**     | Partial, Euclidian distance                      | Coordinate normalization                                                                   |
| **Coordinates** | –                                                | Coordinate normalization                                                                   |

## Why It Matters

Delpha’s duplicate detection engine uses both **exact and fuzzy matching algorithms** to catch near-duplicates often missed by basic tools. For example:

* Emails are split and scored by domain and format.
* URLs are normalized to avoid false negatives caused by minor syntax differences.
* Addresses use **Euclidian distance** to detect close matches even with typos or format variations.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://help.delpha.io/delpha-for-salesforce/how-to-faq/delpha-duplicate/what-algorithms-are-used-by-delpha.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
