Deduplication
Encyclopedia
The term deduplication refers generally to eliminating duplicate or redundant information.
  • Data deduplication
    Data deduplication
    In computing, data deduplication is a specialized data compression technique for eliminating coarse-grained redundant data. The technique is used to improve storage utilization and can also be applied to network data transfers to reduce the number of bytes that must be sent across a link...

    , in computer storage, refers to the elimination of redundant data
  • Record linkage
    Record linkage
    Record linkage refers to the task of finding records in a data set that refer to the same entity across different data sources...

    , in databases, refers to the task of finding entries that refer to the same entity in two or more files
The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK