Upload ZIP file (under 4MB) with images. Training takes ~3min for 2.2MB ZIP.
Optional caption/mask files supported