Furkan Gözükara

Massive 4K Resolution Woman & Man Class Ground Truth Stable Diffusion Regularization Images Dataset (Patreon)

Published:

2023-08-15 00:37:09

Edited:

2023-11-26 22:49:48

Imported:

Tags:

class classification data set dataset image images man picture pictures reg regularization training woman

Downloads

Content

Patreon exclusive posts index

Join discord and tell me your discord username to get a special rank : SECourses Discord

27 November 2023 Huge Update

The dataset is improved and expanded to 5200 images for both Woman and Man dataset
The cropping and resize scripts are further improved and all images are processed again
Moreover all images are now sorted according to the face quality of the images
So the training scripts will use the very best ones
Naming is made starting from man_10001 or woman_10001 so when training script starts using reg images, the very best ones will be used
Please re-download all of the new images for best quality
Total time took is over 10 full days to prepare all reg images
Both woman and man new datasets are added to the resources below

20 September 2023 Massive Update

All of the images are reprocessed with a newer face detection algorithm RetinaFace
RetinaFace is much better to detect and focus faces but it is really really slow
Newest processing scripts are shared here (YOLO V7 cropper and RetinaFace resizer) : https://www.patreon.com/posts/sota-subject-and-88391247
So with this update the datasets are much better. Please redownload them before using
Processing all of the datasets took like 6 days with 13900K CPU and 3090 TI

How To Download All On A RunPod Or A Unix System

download_man_reg_imgs.sh file will download and automatically extract 512x512, 768x768 and 1024x1024 man images. You can edit the file and add other resolutions if you need.
download_woman_reg_imgs.sh file will download and automatically extract 512x512, 768x768 and 1024x1024 woman images. You can edit the file and add other resolutions if you need.
These files can be used for Unix and possibly for MacOs systems as well. Don't forget to comment (put # beginning of a link) the links that you don't want to download and change folder paths if you wish.
Upload into workspace folder of RunPod and execute below command
cd /workspace
chmod +x download_man_reg_imgs.sh
./download_woman_reg_imgs.sh
cd /workspace
chmod +x download_woman_reg_imgs.sh
./download_woman_reg_imgs.sh

How Datasets Are Prepared

I have gathered 40k images for woman and man class from unsplash . com. So total gathered images count is above 80k.

They are all real images. 0 AI image are used.

Then I post processed them with several AI models to clean the dataset. At the end, finally I checked each one of the images manually. Whole process took about 70 (for woman) + 70 (for man) hours.

The final output is 5200 perfect images for woman and 5200 for man. Minimum resolution of images are above 1536 x 1536 pixels and max resolution is up to 14999 x 9999 pixels.

The raw images and exact resolution having images are shared below. If you also need any other specific resolution let me know and hopefully I will update this post.

To use them on Windows you only need to extract zip images. If you can't make it install Winrar from https://www.rarlab.com/

Man Dataset

Raw dataset is between 4 megapixels to 100 megapixels
man_5200_imgs_raw (org resolutions - no cropped).zip - 15.3 GB
man_5200_imgs_512x512.zip - 924 MB - 512x512 pixels
man_5200_imgs_768x768.zip - 1.97 GB - 768x768 pixels
man_5200_imgs_1024x1024.zip - 3.4 GB - 1024x1024 pixels
man_5200_imgs_768x512.zip - 1.3 GB - 768x512 pixels
man_5200_imgs_512x768.zip - 1.37 GB - 512x768 pixels
man_5200_imgs_1024x768.zip - 2.52 GB - 1024x768 pixels
man_5200_imgs_768x1024.zip - 2.63 GB - 768x1024 pixels
man_5200_imgs_1536x640.zip - 2.99 GB - 1536x640 pixels
man_5200_imgs_1216x832.zip - 3.19 GB - 1216x832 pixels
man_5200_imgs_1344x768.zip - 3.2 GB - 1344x768 pixels
man_5200_imgs_1368x768.zip - 3.26 GB - 1368x768 pixels
man_5200_imgs_1152x896.zip - 3.28 GB - 1152x896 pixels
man_5200_imgs_640x1536 - 3.32 GB - 640x1536 pixels
man_5200_imgs_832x1216 - 3.35 GB - 832x1216 pixels
man_5200_imgs_896x1152 - 3.4 GB - 896x1152 pixels
man_5200_imgs_768x1344 - 3.44 GB - 768x1344 pixels
man_5200_imgs_768x1368 - 3.5 GB - 768x1368 pixels
man_5200_imgs_1280x1024 - 4.14 GB - 1280x1024 pixels
man_5200_imgs_1024x1280 - 4.26 GB - 1024x1280 pixels
man_5200_imgs_1536x1024 - 4.86 GB - 1536x1024 pixels
man_5200_imgs_1024x1536 - 5.11 GB - 1024x1536 pixels
man_5200_imgs_1536x1280 - 6.12 GB - 1536x1280 pixels
man_5200_imgs_1280x1536 - 6.26 GB - 1280x1536 pixels
man_5200_imgs_1536x1536 - 7.38 GB - 1536x1536 pixels

Woman Dataset

woman_5200_imgs_raw_dataset (org resolutions not cropped).zip - 14 GB
woman_5200_imgs_512x512.zip - 955 MB - 512x512 pixels
woman_5200_imgs_768x768.zip - 2.03 GB - 768x768 pixels
woman_5200_imgs_1024x1024.zip - 3.49 GB - 1024x1024 pixels
woman_5200_imgs_1536x1536.zip - 7.49 GB - 1536x1536 pixels
woman_5200_imgs_768x512.zip - 1.34 GB
woman_5200_imgs_512x768.zip - 1.42 GB
woman_5200_imgs_1024x768.zip - 2.6 GB
woman_5200_imgs_768x1024.zip - 2.71 GB
woman_5200_imgs_1536x640.zip - 3.07 GB
woman_5200_imgs_640x1536.zip - 3.37 GB
woman_5200_imgs_1216x832.zip - 2.58 GB
woman_5200_imgs_832x1216.zip - 3.44 GB
woman_5200_imgs_1344x768.zip - 3.29 GB
woman_5200_imgs_768x1344.zip - 3.52 GB
woman_5200_imgs_1368x768.zip - 3.34 GB
woman_5200_imgs_768x1368.zip - 3.58 GB
woman_5200_imgs_1152x896.zip - 3.37 GB
woman_5200_imgs_896x1152.zip - 3.49 GB
woman_5200_imgs_1024x1280.zip - 4.36 GB
woman_5200_imgs_1024x1536.zip - 5.21 GB
woman_5200_imgs_1280x1024.zip - 4.24 GB
woman_5200_imgs_1280x1536.zip - 6.38 GB
woman_5200_imgs_1536x1024.zip - 4.97 GB
woman_5200_imgs_1536x1280.zip - 6.24 GB

How To Use On RunPod Or Other Cloud or Linux

To use these files unrunpod

First you need to install 7zip

yes | apt-get install p7zip-full

Then download them with wget. Copy their link with right click and copy link then as below

wget

e.g. man:

Then use below command to extract them

7z x man_5200_imgs_512x512.zip
or another one
7z x man_5200_imgs_1024x1024.zip

e.g. woman :

Then use below command to extract them

7z x woman_5200_imgs_512x512.zip
or another one
7z x woman_5200_imgs_1024x1024.zip