nucleus.local_deduplication¶
Local pHash-based deduplication utilities.
Output of a local pHash deduplication run. |
- class nucleus.local_deduplication.LocalDeduplicationResult¶
Output of a local pHash deduplication run.
- unique¶
Input objects that survived deduplication. If you passed rows from
items_and_annotation_generator(), this contains those same row dictionaries. If you passedDatasetItemobjects, it containsDatasetItemobjects.
- unique_dataset_items¶
The DatasetItem for each object in
unique.
- unique_reference_ids¶
Reference IDs for the DatasetItems in
unique. Entries can beNoneif a DatasetItem has no reference ID.
- stats¶
Summary statistics for the run.