서비스 검색

도구와 페이지를 빠르게 찾아 엽니다

제품 가이드 · 약 5분 읽기

Finding duplicates in CSV files without a database

Sort by candidate keys, scan runs, and use filters, lightweight dedup recon before SQL DISTINCT.

게시 2025년 3월 21일 · FreeLocalConverter

Without SQL, sort by the natural key (email, order_id, device_id) and look for adjacent identical keys. For composite keys, concatenate in a scratch column or sort by multiple columns.

Limits

  • Case sensitivity can hide dupes, normalize case upstream.
  • Trailing spaces break key equality.

← 모든 글