Skip to main content

Fuzzy Lookup

Matches items against a lookup table using fuzzy matching algorithms.

Overview

The Fuzzy Lookup node finds matching entries in a table using approximate matching. It supports three algorithms — semantic (AI-powered), trigram (character similarity), and exact — with optional AI post-processing for refined results.

Use it to:

  • Match customer names against a master list
  • Find products from a catalog with typos or abbreviations
  • Reconcile data between two sources with inconsistent formatting
  • Match addresses, company names, or any fuzzy text data

Parameters

ParameterDescriptionRequired
Source TypeLookup table source: Lido Table, Excel File, or Google SheetYes
Table / FileThe lookup table to match againstYes
Field MappingsMatch rules with algorithm, input field, and lookup columnYes
Score Combine MethodHow to combine scores: Weighted Average, Max, or MinYes
Minimum ConfidenceConfidence threshold (0-100, default 30)Yes
Output ModeBest Match, All Matches, or Top NYes
Split Matches as ItemsOutput each match as a separate workflow itemNo
No Match BehaviorWhat to do when no match found: Output null, Skip, or ErrorYes
Additional Output ColumnsExtra columns from the lookup table to include in outputNo

Source Types

SourceDescription
Lido TableUse a table from the workflow's Lido spreadsheet
Excel FileUse an .xlsx or .xls file from cloud storage
Google SheetUse a Google Sheets spreadsheet

Match Algorithms

Each field mapping uses one algorithm:

AlgorithmBest ForDescription
SemanticNames, descriptions, meaningAI-powered meaning-based matching. Supports multiple lookup columns combined.
TrigramTypos, abbreviationsCharacter-level similarity (3-character sequences)
ExactIDs, codes, strict valuesExact string matching with optional case sensitivity

Field Mapping Options

Semantic: Select input expression + one or more lookup columns (joined with spaces for matching) + optional PCA settings + weight.

Trigram: Select input expression + single lookup column + weight.

Exact: Select input expression + single lookup column + case sensitivity + weight.

Scoring

When multiple field mappings are used, scores are combined using the Score Combine Method:

MethodDescription
Weighted AverageAverage scores weighted by each field mapping's weight
MaxUse the highest score across all field mappings
MinUse the lowest score across all field mappings

AI Post-Processing (Optional)

ParameterDescription
Enable AI Post-ProcessingTurn on AI re-ranking of match results
Top N for AINumber of candidate matches to send to AI (1-100)
AI InstructionsNatural language instructions for the AI re-ranker
AI Output ColumnsAdditional columns for the AI to generate

Settings

SettingDescription
Execution ModeOnce per item (default) or Once
Output ModeHow to output results when running once
Batch SizeItems to process concurrently (1-5, default 5)
Stop on ErrorStop workflow if a match operation fails

Output

For each match found:

{
"lookupColumn1": "Acme Corporation",
"lookupColumn2": "New York",
"confidence": 85,
"additionalColumn": "value"
}

Access matched data in expressions:

  • Confidence score: {{$item.data.confidence}}
  • Lookup column value: {{$item.data.companyName}}

With Split Matches as Items enabled, each match becomes a separate item.

With Output Mode "Top N", multiple matches are returned ranked by confidence.

Examples

Match Customer Names

Match incoming order names against a customer database:

[Get Table (orders)] → [Fuzzy Lookup (customer table, semantic)] → [Insert Rows]
  1. Source Type: Lido Table
  2. Add Semantic field mapping: input = {{$item.data.customerName}}, lookup column = "Company Name"
  3. Set Minimum Confidence to 60
  4. Set Output Mode to Best Match

Match with Multiple Algorithms

Combine algorithms for better accuracy:

[Data Extractor] → [Fuzzy Lookup (product catalog)] → [If (confidence > 70)] → Insert Rows
  1. Add Semantic mapping for product name (weight 2)
  2. Add Exact mapping for product code (weight 5)
  3. Score Combine Method: Weighted Average

Fuzzy Lookup from Excel File

Match against an external Excel reference file:

[Google Drive Trigger] → [Data Extractor] → [Fuzzy Lookup (Excel file)] → [Insert Rows]
  1. Source Type: Excel File
  2. Upload or reference the Excel lookup table
  3. Select the worksheet and starting cell if needed

Tips

  • Semantic matching works best for natural language fields (names, descriptions)
  • Trigram matching is faster and handles typos well
  • Combine multiple algorithms for more robust matching
  • Use higher weights for more reliable fields
  • Set Minimum Confidence to filter out poor matches — 30 is a good starting point
  • AI post-processing can refine results but adds latency and cost
  • The lookup table data is cached with content hashing — repeated lookups are fast
  • For Lido Tables, data is loaded client-side; for Excel/Google Sheets, fetched server-side