How This Helps
Identify duplicate media in your dataset using internal media IDs. This helps streamline cleanup, reduce redundancy, and improve data quality before training or export.
Step 1: Retrieve Internal Media ID
To begin, retrieve the internal media ID for a file based on itsoriginal_media_uri
.
Example
- ✅
200 OK
: Returns the internal media ID as text - ❌
404
: Media not found in the dataset
Step 2: Retrieve Duplicates Using Media ID
Once you have the media ID, use this endpoint to find duplicates:Example
0
or more duplicate media IDs.
This feature is currently under development and subject to change.