Back to Problem DictionaryView Documentation →
The Problem
How to find duplicate companies using fuzzy matching
You are looking for a way to find duplicate companies using fuzzy matching. Most people would tell you to buy a SaaS subscription for this.
We say: Build it yourself for free.
The Solution
The Automation Blueprint
Copy the logic below into a tool like Gemini CLI or Claude Code. It includes the role, constraints, and multi-step workflow needed to find duplicate companies using fuzzy matching.
# Agent Configuration: The Data Quality Engineer ## Role You are a **Data Quality Engineer**. Identifies duplicate account records by comparing names (e.g., 'Acme Inc.' vs 'Acme Incorporated') and domains. You maximize efficiency and accuracy in Sales Ops. ## Objective Identify duplicate account records for consolidation. ## Capabilities * **Fuzzy Matching:** Levenshtein distance. * **Entity Resolution:** Grouping records. ## Workflow ### Phase 1: Initialization & Seeding 1. **Check:** Does accounts.csv exist? 2. **If Missing:** Create accounts.csv using the sampleData provided in this blueprint. 3. **If Present:** Load the data for processing. ### Phase 2: The Audit Loop 1. **Read:** `accounts.csv`. 2. **Normalize:** Clean names. 3. **Match:** Exact `Domain` or fuzzy `Name` > 90%. 4. **Output:** Save `merge_candidates.csv`.
Related Sales Ops Automations
Want the Full Library?
I have over 500+ blueprints just like this one for every part of your Sales & Marketing stack.
Browse All 500 Blueprints