Data Leakage Detection and De-duplication in Large Scale Geospatial Image Datasets
The paper “Data Leakage Detection and De-duplication in Large Scale Geospatial Image Datasets” by Yeshwanth Kumar Adimoolam, Charalambos Poullis, and Melinos Averkiou will appear in IEEE/CVF Computer Vision and Pattern Recognition (CVPR), 2026. TL;DR: The AICrowd Mapping Challenge dataset is riddled with problems - ~90% duplicate training images