V.Ch.K Overview

V.Ch.K, the list de-duplication server, is a secure, high performance matching and extracting server that includes phonetic and geographical analysis of names & addresses. By working with files directly without loading them into a database, it optimizes the usage of memory and computer resources to handle millions of records at a very high speed.

V.Ch.K :: Merge

The merging process includes a phase of standardizing geographical locations by querying a knowledge database and correcting common typing and syntax errors.

Merging is at least as crucial as the purge process and a good preparation of the data improves dramatically the final results.

Features

  • Country and City verification against V.Ch.K. knowledge servers
  • Reformats all the files in a Common Postal Address file format.
  • Extracts addresses with errors for manual correction according to rules defined by the system administrator.

Roadmap

  • Increasing the granularity of address analysis to provide a more accurate Common Postal Address format.
  • Linking to the Postman Beat encoding process.

V.Ch.K :: Purge

The purge process can be adapted per job and file batch to provide best end results. Most common processes involve comparisons of countries, Full name and address of the postal address using a combination of phonetic and text encoding algorithms.

Features

  • Multi-pass matching process to analyze data using a different algorithm at each pass
  • Support for major phonetic algorithms (Soundex, Soundex 2, Metaphon)
  • Reverse name analysis
  • While processing a special attention is given to Asian addresses
  • Secure data server
  • Reports: intra & extra file duplicates, file similarity, country breakdown

Roadmap

  • Support for Asian Character sets (Chinese, Japanese).