Skip to content

Latest commit

 

History

History
16 lines (9 loc) · 798 Bytes

README.md

File metadata and controls

16 lines (9 loc) · 798 Bytes

ImageNet URL Downloader

Python utility script for downloading subtrees of ImageNet using the URLs provided by the ImageNet API.

example usage:

./imagenetDownloader.py n03489162 ../dataset --humanreadable -F --images=50 --minsize=7000 -j10

In order to download 50 images from any category under any subtree (-F) of the handtools (n03489162), running 10 threads in parallel, and only downloading images larger than 7kB.

  • Image downloads don't fail if the URL (or rather IRI) includes non-ASCII characters
  • Retry getting list of URLs if the ImageNet API does not respond temporarily (this happens very frequently as of now!)
  • Resume downloads

Please note that this project is currently WIP (as of June 2019) and is expected to receive major updates during the next days!