A way to identify documents that are duplicated across multiple custodians or other production data sets. See De-Duplication..”