Deduplication
Encyclopedia
The term deduplication refers generally to eliminating duplicate or redundant information.
- Data deduplicationData deduplicationIn computing, data deduplication is a specialized data compression technique for eliminating coarse-grained redundant data. The technique is used to improve storage utilization and can also be applied to network data transfers to reduce the number of bytes that must be sent across a link...
, in computer storage, refers to the elimination of redundant data - Record linkageRecord linkageRecord linkage refers to the task of finding records in a data set that refer to the same entity across different data sources...
, in databases, refers to the task of finding entries that refer to the same entity in two or more files