API Documentation
Duplicate Retrieval
Learn how to retrieve visually duplicate media items from a dataset using the API.
How This Helps
Identify duplicate media in your dataset using internal media IDs. This helps streamline cleanup, reduce redundancy, and improve data quality before training or export.
Step 1: Retrieve Internal Media ID
To begin, retrieve the internal media ID for a file based on its original_media_uri
.
Example
Response:
- ✅
200 OK
: Returns the internal media ID as text - ❌
404
: Media not found in the dataset
Step 2: Retrieve Duplicates Using Media ID
Once you have the media ID, use this endpoint to find duplicates:
Example
Response:
Returns a JSON array containing 0
or more duplicate media IDs.
This feature is currently under development and subject to change.