Massive 4K Resolution Woman & Man Class Ground Truth Stable Diffusion Regularization Images Dataset (Patreon)
Downloads
Content
Join discord and tell me your discord username to get a special rank : SECourses Discord
27 November 2023 Huge Update
- The dataset is improved and expanded to 5200 images for both Woman and Man dataset
- The cropping and resize scripts are further improved and all images are processed again
- Moreover all images are now sorted according to the face quality of the images
- So the training scripts will use the very best ones
- Naming is made starting from man_10001 or woman_10001 so when training script starts using reg images, the very best ones will be used
- Please re-download all of the new images for best quality
- Total time took is over 10 full days to prepare all reg images
- Both woman and man new datasets are added to the resources below
20 September 2023 Massive Update
- All of the images are reprocessed with a newer face detection algorithm RetinaFace
- RetinaFace is much better to detect and focus faces but it is really really slow
- Newest processing scripts are shared here (YOLO V7 cropper and RetinaFace resizer) : https://www.patreon.com/posts/sota-subject-and-88391247
- So with this update the datasets are much better. Please redownload them before using
- Processing all of the datasets took like 6 days with 13900K CPU and 3090 TI
How To Download All On A RunPod Or A Unix System
- download_man_reg_imgs.sh file will download and automatically extract 512x512, 768x768 and 1024x1024 man images. You can edit the file and add other resolutions if you need.
- download_woman_reg_imgs.sh file will download and automatically extract 512x512, 768x768 and 1024x1024 woman images. You can edit the file and add other resolutions if you need.
- These files can be used for Unix and possibly for MacOs systems as well. Don't forget to comment (put # beginning of a link) the links that you don't want to download and change folder paths if you wish.
- Upload into workspace folder of RunPod and execute below command
- cd /workspace
- chmod +x download_man_reg_imgs.sh
- ./download_woman_reg_imgs.sh
- cd /workspace
- chmod +x download_woman_reg_imgs.sh
- ./download_woman_reg_imgs.sh
How Datasets Are Prepared
I have gathered 40k images for woman and man class from unsplash . com. So total gathered images count is above 80k.
They are all real images. 0 AI image are used.
Then I post processed them with several AI models to clean the dataset. At the end, finally I checked each one of the images manually. Whole process took about 70 (for woman) + 70 (for man) hours.
The final output is 5200 perfect images for woman and 5200 for man. Minimum resolution of images are above 1536 x 1536 pixels and max resolution is up to 14999 x 9999 pixels.
The raw images and exact resolution having images are shared below. If you also need any other specific resolution let me know and hopefully I will update this post.
To use them on Windows you only need to extract zip images. If you can't make it install Winrar from https://www.rarlab.com/
Man Dataset
- Raw dataset is between 4 megapixels to 100 megapixels
- man_5200_imgs_raw (org resolutions - no cropped).zip - 15.3 GB
- man_5200_imgs_512x512.zip - 924 MB - 512x512 pixels
- man_5200_imgs_768x768.zip - 1.97 GB - 768x768 pixels
- man_5200_imgs_1024x1024.zip - 3.4 GB - 1024x1024 pixels
- man_5200_imgs_768x512.zip - 1.3 GB - 768x512 pixels
- man_5200_imgs_512x768.zip - 1.37 GB - 512x768 pixels
- man_5200_imgs_1024x768.zip - 2.52 GB - 1024x768 pixels
- man_5200_imgs_768x1024.zip - 2.63 GB - 768x1024 pixels
- man_5200_imgs_1536x640.zip - 2.99 GB - 1536x640 pixels
- man_5200_imgs_1216x832.zip - 3.19 GB - 1216x832 pixels
- man_5200_imgs_1344x768.zip - 3.2 GB - 1344x768 pixels
- man_5200_imgs_1368x768.zip - 3.26 GB - 1368x768 pixels
- man_5200_imgs_1152x896.zip - 3.28 GB - 1152x896 pixels
- man_5200_imgs_640x1536 - 3.32 GB - 640x1536 pixels
- man_5200_imgs_832x1216 - 3.35 GB - 832x1216 pixels
- man_5200_imgs_896x1152 - 3.4 GB - 896x1152 pixels
- man_5200_imgs_768x1344 - 3.44 GB - 768x1344 pixels
- man_5200_imgs_768x1368 - 3.5 GB - 768x1368 pixels
- man_5200_imgs_1280x1024 - 4.14 GB - 1280x1024 pixels
- man_5200_imgs_1024x1280 - 4.26 GB - 1024x1280 pixels
- man_5200_imgs_1536x1024 - 4.86 GB - 1536x1024 pixels
- man_5200_imgs_1024x1536 - 5.11 GB - 1024x1536 pixels
- man_5200_imgs_1536x1280 - 6.12 GB - 1536x1280 pixels
- man_5200_imgs_1280x1536 - 6.26 GB - 1280x1536 pixels
- man_5200_imgs_1536x1536 - 7.38 GB - 1536x1536 pixels
Woman Dataset
- woman_5200_imgs_raw_dataset (org resolutions not cropped).zip - 14 GB
- woman_5200_imgs_512x512.zip - 955 MB - 512x512 pixels
- woman_5200_imgs_768x768.zip - 2.03 GB - 768x768 pixels
- woman_5200_imgs_1024x1024.zip - 3.49 GB - 1024x1024 pixels
- woman_5200_imgs_1536x1536.zip - 7.49 GB - 1536x1536 pixels
- woman_5200_imgs_768x512.zip - 1.34 GB
- woman_5200_imgs_512x768.zip - 1.42 GB
- woman_5200_imgs_1024x768.zip - 2.6 GB
- woman_5200_imgs_768x1024.zip - 2.71 GB
- woman_5200_imgs_1536x640.zip - 3.07 GB
- woman_5200_imgs_640x1536.zip - 3.37 GB
- woman_5200_imgs_1216x832.zip - 2.58 GB
- woman_5200_imgs_832x1216.zip - 3.44 GB
- woman_5200_imgs_1344x768.zip - 3.29 GB
- woman_5200_imgs_768x1344.zip - 3.52 GB
- woman_5200_imgs_1368x768.zip - 3.34 GB
- woman_5200_imgs_768x1368.zip - 3.58 GB
- woman_5200_imgs_1152x896.zip - 3.37 GB
- woman_5200_imgs_896x1152.zip - 3.49 GB
- woman_5200_imgs_1024x1280.zip - 4.36 GB
- woman_5200_imgs_1024x1536.zip - 5.21 GB
- woman_5200_imgs_1280x1024.zip - 4.24 GB
- woman_5200_imgs_1280x1536.zip - 6.38 GB
- woman_5200_imgs_1536x1024.zip - 4.97 GB
- woman_5200_imgs_1536x1280.zip - 6.24 GB
How To Use On RunPod Or Other Cloud or Linux
To use these files unrunpod
First you need to install 7zip
- yes | apt-get install p7zip-full
Then download them with wget. Copy their link with right click and copy link then as below
wget
e.g. man:
- wget https://huggingface.co/MonsterMMORPG/SECourses/resolve/main/man_5200_imgs_512x512.zip
- or another one
- wget https://huggingface.co/MonsterMMORPG/SECourses/resolve/main/man_5200_imgs_1024x1024.zip
Then use below command to extract them
- 7z x man_5200_imgs_512x512.zip
- or another one
- 7z x man_5200_imgs_1024x1024.zip
e.g. woman :
- wget https://huggingface.co/MonsterMMORPG/SECourses/resolve/main/woman_5200_imgs_512x512.zip
- or another one
- wget https://huggingface.co/MonsterMMORPG/SECourses/resolve/main/woman_5200_imgs_1024x1024.zip
Then use below command to extract them
- 7z x woman_5200_imgs_512x512.zip
- or another one
- 7z x woman_5200_imgs_1024x1024.zip