LogoLogo
  • Delpha Documentation
    • Welcome to Delpha Documentation
    • Delpha Setup
      • Install Delpha
        • Install Delpha package from the Salesforce App Exchange
        • Activate Delpha
        • Assign a Delpha Permission Set Group
        • Assign a Delpha licence to the users
        • Connect the org to Delpha
      • Import Conversations
      • Extra Tasks
        • Check my org settings
        • Install Delpha Connector for LinkedIn Enrichment
    • Delpha Upgrade
    • Delpha Apps
      • Delpha Setup
        • Delpha configuration
          • First Steps
          • Token Usage
          • Data Quality - Account
          • Data Quality - Contact
          • Duplicates
          • Job Tracker
          • Default Values
        • Conversations
        • Conversation Builder
          • High-level overview of the conversation builder
      • Delpha Data Quality
        • Data Quality Steward view
        • Duplicate Data Steward view
      • Delpha Score Meter
    • Use Cases Setup
      • Duplicate
        • Setup
          • Initialize the default settings
          • Activate Auto Merge
        • Duplicate detection
          • Properties section
          • Fields section
        • Run your first duplicate detection
        • Duplicate remediation
          • Merge Object Rules section
            • Master Record Selection
            • Custom rule for Master Record selection
            • Default rules for field selection
          • Merge Field Rules section
      • Data Quality
  • Delpha Campaigns
    • Create a Delpha Campaign
    • Configure a Delpha Campaign
      • Select the Campaign Type
        • Lead Generation
        • Job Tracking
        • Account Generation
      • Review and update the Settings
      • Add Campaign Members
  • HOW TO - FAQ
    • Quick Start Guide
    • Delpha Integration
      • How to add Delpha components in my standard layout
      • How can I add Delpha fields in my standard layout
      • How to manage conversation priority
      • How to manage the conversation auto opening
    • Delpha Job Tracking
      • How to display the Job History
      • How does job tracking works
    • Delpha LinkedIn Connector
      • How can I connect my LinkedIn Account to Delpha
      • I am not allowed to install Delpha Connector on my browser
      • How many records can be enriched with LinkedIn in a day per user?
      • How can I check if a LinkedIn cookie is properly set or valid
      • How is used my LinkedIn Cookie
      • How can I automate my lead generation
    • Delpha Duplicate
      • What is a Filtering Rule and how to use it
      • What is the Expression and how to use it
      • How to exclude records from the analysis
      • How to make Duplicate Records exclusion dynamic
      • How to Fix Duplicates in Salesforce with Delpha – Automatic, Bulk & Manual Options
      • How to modify the detection threshold
      • How to modify the auto process threshold
      • What algorithms are used by Delpha
      • How does the Duplicate scoring work
      • How to define a Golden Record for Duplicate
      • How do you differentiate Do Not Compare & Is Golden Record
      • How to sync Salesforce & Hubspot to deduplicate records
      • Duplicate detection - When does it happen?
      • How to set the frequency of the Auto Merge
      • How to create a Master selection custom rule
      • How to create a Master selection custom rule - Advanced
      • What are the duplicate status?
      • How to Merge 2 leads with different currencies
      • What is a Duplicate credit?
      • Do I consume a credit when merging a pair?
      • What is field grouping?
      • How can I hide a field from the Delpha Bot conversation?
      • How can I keep both values of a field after the merge
      • Some duplicate are not detected, what can I do?
      • Can I ignore some field values when detecting duplicates?
      • What data is available for Duplicate?
      • How can I create custom reports on Duplicate
    • Delpha Data quality
      • What are the 6 data quality dimensions
      • How to exclude records from the analysis
      • How to fix my data quality
      • What is a Token?
      • Do I consume a token when applying a Delpha recommendation?
      • Data Quality for Contacts
        • What are Data Quality Dimensions for Email
        • What are the Data Quality fields for Email
  • Delpha Campaigns
    • How to add records to a campaign from reports?
  • TROUBLESHOOT
    • Grant access for Delpha Support
Powered by GitBook
On this page
  • What Algorithms Are Used by Delpha to Detect Duplicates?
  • Matching Algorithms & Normalization by Field Type
  • Why It Matters

Was this helpful?

Export as PDF
  1. HOW TO - FAQ
  2. Delpha Duplicate

What algorithms are used by Delpha

Explore the advanced duplicate detection algorithms Delpha uses for string, email, phone, and address fields. Learn how Delpha ensures accurate, AI-powered record matching.

What Algorithms Are Used by Delpha to Detect Duplicates?

Delpha uses a set of advanced matching algorithms and normalization techniques to identify duplicate records across different field types in Salesforce. The logic determines how values are compared between records to calculate similarity and duplication scores.

Matching Algorithms & Normalization by Field Type

Field Type

Matching Algorithm

Normalization

String

Exact, Jaro-Winkler, QGram Tokenizer, TF-IDF

Lowercase, ASCII table

Boolean

Exact

None

Email

Exact, Partial segmentation, Domain name average

Lowercase, Validity checks

Phone

Exact

ASCII table, e164 normalization

Number

Exact

Rounded

URL

Exact

ASCII & percent-encoded triplet, Path ending normalization ("/"), Protocol normalization

Address

Partial, Euclidian distance

Coordinate normalization

Coordinates

–

Coordinate normalization

Why It Matters

Delpha’s duplicate detection engine uses both exact and fuzzy matching algorithms to catch near-duplicates often missed by basic tools. For example:

  • Emails are split and scored by domain and format.

  • URLs are normalized to avoid false negatives caused by minor syntax differences.

  • Addresses use Euclidian distance to detect close matches even with typos or format variations.

PreviousHow to modify the auto process thresholdNextHow does the Duplicate scoring work

Last updated 24 days ago

Was this helpful?