😎 Finding duplicate images made easy!
-
Updated
Aug 15, 2025 - Python
😎 Finding duplicate images made easy!
Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.
Image similarity in Golang. Version 4 (LATEST)
Tool to detect (and get rid of) similar images using perceptual hashing (pHash lib)
A utility for locating near duplicate photos irrespective of image resolution, compression settings or file format.
高性能图像去重工具:基于深度学习特征与余弦相似度,从海量图片中定位重复项
A Python tool to identify and remove similar-looking images from a dataset. Utilizes image preprocessing and hashing techniques for efficient comparison.
Downloader with custom wildcard system: cherry-picking internet with asterisks for HTML or right-carets for API, whether it's for time-critical website moments or just for laziness. Features directory listing and serve, alarm (essentially in-stock tracker), file sorter (organizer), image duplicate finder and tools for naked eyes.
高效的Python图像查重工具,支持百万级图片文件的重复检测。集成多种算法包括MD5哈希、感知哈希(dHash/pHash/aHash)和C++加速库,可识别完全相同、分辨率调整、部分截取和水印变更的重复图像。
🏍️ A clustering tool providing exact and near de-duplication of images using vector embeddings.
A CLI tool for images analysis: checking image integrity, images deduplication, image retrieval.
a Python command-line tool that identifies and groups similar images using average hashing. It supports single-level and recursive directory scanning, adjustable similarity threshold, and presents results in JSON format. Ideal for image deduplication, organization, and content-based retrieval tasks.
The extended version of simhash supports fingerprint extraction of documents and images.
一个专为AI素材管理、YOLO 模型训练设计的全链路数据闭环系统,实现了从“智能去重”到“交互标注”,再到“本地/云端多场景算力调度”的生产力飞跃
CLIP image deduplication toolkit
Sobel Gradient Image Deduplication
This Python script helps in identifying and moving duplicate images within a specified directory to a designated duplicates folder.
A utility for testing the performance of de-duplication algorithms by randomly generating “noisy” images in a dataset.
A python program to detect duplicate images in a specified folder.
Finds duplicates across every file type, not just photos. Compares images by appearance, audio by acoustic fingerprint, video by keyframes, and documents and source code by content similarity, catching renames, re-encodes, and format conversions that byte-comparison tools miss. Native app for macOS, Windows, and Linux.
Add a description, image, and links to the image-deduplication topic page so that developers can more easily learn about it.
To associate your repository with the image-deduplication topic, visit your repo's landing page and select "manage topics."