nucleus.local_deduplication

Local pHash-based deduplication utilities.

LocalDeduplicationResult

Output of a local pHash deduplication run.

class nucleus.local_deduplication.LocalDeduplicationResult

Output of a local pHash deduplication run.

unique

Input objects that survived deduplication. If you passed rows from items_and_annotation_generator(), this contains those same row dictionaries. If you passed DatasetItem objects, it contains DatasetItem objects.

unique_dataset_items

The DatasetItem for each object in unique.

unique_reference_ids

Reference IDs for the DatasetItems in unique. Entries can be None if a DatasetItem has no reference ID.

stats

Summary statistics for the run.