safebooru 2022.11 rip + addons from yande-re, danbooru, e621, zerochan :: Nyaa ISS

safebooru 2022.11 rip + addons from yande-re, danbooru, e621, zerochan

Category:
Date:
2023-02-06 09:43
Submitter:
Seeders:
4
Information:
No information.
Leechers:
1
File size:
326.0 GiB
Completed:
46
Info hash:
b962161033c64c191b5066fe867611be7cb4c434

This is unusually big volume V2022D for interval 08.2022-11.2022 in series of composite safebooru-based rips
05.2022 - 08.2022 volume V2022C
01.2022 - 05.2022 volume V2022B double-size
11.2021 - 01.2022 volume V2022A
08.2021 - 11.2021 volume V2021D
06.2021 - 08.2021 volume V2021C
03.2021 - 06.2021 volume V2021B
12.2020 - 03.2021 volume V2021A
09.2020 - 12.2020 volume V2020D
06.2020 - 09.2020 volume V2020C
02.2020 - 05.2020 volume V2020B
08.2019 - 01.2020 volume V2020A
11.2018 - 08.2019 volume V2019
aimed to feed BOORU-CHARS OPEN DATASET 2021 and 2022 and later

This rips not intended to be “complete and maximum quality” but rather “representative the best of” to help users
not to loose interesting fandom, artist or even single prominent picture and get all stuff with several clicks.
Another reason to build this megalythe is neural network training over art images. There are promising results, stay tuned.

Sources used (priorities high to low when deduplicating):

207.609 images sorted and zipped according aspect ratio (dimensions 2 folders) priorities high to low :

  • 62718 “artbook pages” 7x10 (+/- 4%)
  • 32212 “wide pages” 3x4 (+/- 10%)
  • 39468 “squares” 1x1 (+/- 20%)
  • 39126 “wallpapers and computer screens” 3x2 (+/- 40%)
  • 34085 “high pages” 2x3 (+/- 40%)

and also for source and (sometimes) ID range, mentioned in folder/archive name.
You can browse pictures directly in archives with FastStone MaxView of something like it.

File names structure : %website% - %id% - %up_to_3_copyrights% ~ %up_to_5_characters% (%up_to_2_artists%).%ext% where

  • %copyright% , %character% and %artist% may be used as filter for search on source booru
  • %website% + %id% is unique and also may be used to get direct booru url

so you can extract subsets of interest with xcopy (from already unzipped images) or unzipping (from release on the fly) e.g.

for %%F in ("d:\Safebooru 2022d\*.zip") do 7z x -r -o"e:\sortarea\" "%%F" *lycoris*
xcopy /s d:\Safebooru 2022d\*lycoris* e:\sortarea

Transformations and filters:

  • initially filtered Mpixels >= 1.2, width >= 900, height >= 900
  • PNG converted to JPG (quality 94%), no animations
  • downsize to 60MPix and/or maxsize 9000 px, stripes dropped or adjusted to aspect ratio 0.4 … 2.1
  • manually (yep, plenty of handjob behind this release)
    • comic and 4koma, segmented scans and overtexted covers filtered out
    • real-life photos, no-character landscapes, most of line-arts and primitive chibi thrown away
    • too explicit images (uncensored nipples or vulva, obvious adult actions etc) excluded from “questionable” downloads
    • crops done when large simple or dirty background, most artbooks de-bordered
    • occationally gamma correction, denoise and other nontrivial improvements made
  • carefully deduplicatied (with AntiDupl NET up to 4% similarity) along with several past releases

Some meta-information included in tab delimited files :

  • V2022D_files.TSV post info (size, resolution, MD5 etc) with concatenated copyrights / characters / artists tags (Excel capable)
  • V2022D_tags.TSV all tags (incl. general and meta) one tag per line (not fit into Excel)

Using some database you can play with SQL and xcopy (from already unzipped images, copypasting query result) anything you want, e.g.

select 'xcopy "d:\'||torr_path||'\'||file_name||'" e:\sortarea ' xc
from files f
join tags t on t.booru=f.booru and t.fid=f.fid
where t.tag like '%chisato%takina%kicking%meme%' -- from the opening

File list

  • Safebooru 2022d
    • 1x1.d.556.zip (1.9 GiB)
    • 1x1.d.562.zip (1.7 GiB)
    • 1x1.d.567.zip (1.7 GiB)
    • 1x1.d.572.zip (1.3 GiB)
    • 1x1.d.577.zip (1.7 GiB)
    • 1x1.d.q.zip (880.3 MiB)
    • 1x1.e.34.q.zip (963.8 MiB)
    • 1x1.e.34.zip (1.4 GiB)
    • 1x1.e.35.q.zip (1.4 GiB)
    • 1x1.e.35.zip (2.2 GiB)
    • 1x1.e.36.q.zip (1.5 GiB)
    • 1x1.e.36.zip (2.6 GiB)
    • 1x1.sb.410.zip (2.4 GiB)
    • 1x1.sb.412.zip (2.4 GiB)
    • 1x1.sb.414.zip (2.4 GiB)
    • 1x1.sb.416.zip (2.4 GiB)
    • 1x1.sb.418.zip (2.6 GiB)
    • 1x1.y.q.zip (739.3 MiB)
    • 1x1.y.zip (570.5 MiB)
    • 1x1.z.372.zip (1.7 GiB)
    • 1x1.z.374.zip (1.7 GiB)
    • 1x1.z.376.zip (1.7 GiB)
    • 1x1.z.378.zip (1.8 GiB)
    • 1x1.z.380.zip (1.8 GiB)
    • 2x3.d.556.zip (5.0 GiB)
    • 2x3.d.562.zip (4.6 GiB)
    • 2x3.d.567.zip (4.4 GiB)
    • 2x3.d.572.zip (3.9 GiB)
    • 2x3.d.577.zip (4.2 GiB)
    • 2x3.d.q.zip (1.9 GiB)
    • 2x3.e.q.zip (1.4 GiB)
    • 2x3.e.zip (1.9 GiB)
    • 2x3.sb.410.zip (2.8 GiB)
    • 2x3.sb.412.zip (2.9 GiB)
    • 2x3.sb.414.zip (3.0 GiB)
    • 2x3.sb.416.zip (2.8 GiB)
    • 2x3.sb.418.zip (2.9 GiB)
    • 2x3.y.q.zip (6.4 GiB)
    • 2x3.y.zip (2.3 GiB)
    • 2x3.z.372.zip (2.7 GiB)
    • 2x3.z.374.zip (3.0 GiB)
    • 2x3.z.376.zip (2.7 GiB)
    • 2x3.z.378.zip (2.5 GiB)
    • 2x3.z.380.zip (3.3 GiB)
    • 3x2.d.556.zip (3.5 GiB)
    • 3x2.d.562.zip (2.7 GiB)
    • 3x2.d.567.zip (2.7 GiB)
    • 3x2.d.572.zip (2.1 GiB)
    • 3x2.d.577.zip (2.5 GiB)
    • 3x2.d.q.zip (1.3 GiB)
    • 3x2.e.34.zip (3.7 GiB)
    • 3x2.e.36.zip (2.3 GiB)
    • 3x2.e.q.zip (3.0 GiB)
    • 3x2.sb.410.zip (4.4 GiB)
    • 3x2.sb.412.zip (3.8 GiB)
    • 3x2.sb.414.zip (4.7 GiB)
    • 3x2.sb.416.zip (3.9 GiB)
    • 3x2.sb.418.zip (4.0 GiB)
    • 3x2.y.q.zip (3.0 GiB)
    • 3x2.y.zip (2.2 GiB)
    • 3x2.z.372.zip (3.1 GiB)
    • 3x2.z.374.zip (2.8 GiB)
    • 3x2.z.376.zip (3.1 GiB)
    • 3x2.z.378.zip (3.1 GiB)
    • 3x2.z.380.zip (3.1 GiB)
    • 3x4.d.556.zip (2.2 GiB)
    • 3x4.d.562.zip (2.2 GiB)
    • 3x4.d.567.zip (2.0 GiB)
    • 3x4.d.572.zip (1.8 GiB)
    • 3x4.d.577.zip (1.9 GiB)
    • 3x4.d.q.zip (1.1 GiB)
    • 3x4.e.q.zip (2.7 GiB)
    • 3x4.e.zip (3.5 GiB)
    • 3x4.sb.410.zip (2.4 GiB)
    • 3x4.sb.412.zip (2.3 GiB)
    • 3x4.sb.414.zip (2.5 GiB)
    • 3x4.sb.416.zip (2.5 GiB)
    • 3x4.sb.418.zip (2.4 GiB)
    • 3x4.y.q.zip (1.4 GiB)
    • 3x4.y.zip (817.7 MiB)
    • 3x4.z.372.zip (2.1 GiB)
    • 3x4.z.374.zip (1.9 GiB)
    • 3x4.z.376.zip (1.7 GiB)
    • 3x4.z.378.zip (1.9 GiB)
    • 3x4.z.380.zip (2.4 GiB)
    • 7x10.d.556.zip (8.0 GiB)
    • 7x10.d.562.zip (7.3 GiB)
    • 7x10.d.567.zip (6.6 GiB)
    • 7x10.d.572.zip (6.2 GiB)
    • 7x10.d.577.zip (5.9 GiB)
    • 7x10.d.q.zip (3.9 GiB)
    • 7x10.e.q.zip (2.8 GiB)
    • 7x10.e.zip (3.5 GiB)
    • 7x10.sb.410.zip (6.2 GiB)
    • 7x10.sb.412.zip (5.4 GiB)
    • 7x10.sb.414.zip (6.0 GiB)
    • 7x10.sb.416.zip (6.3 GiB)
    • 7x10.sb.418.zip (5.6 GiB)
    • 7x10.y.q.zip (7.5 GiB)
    • 7x10.y.zip (3.6 GiB)
    • 7x10.z.372.zip (5.5 GiB)
    • 7x10.z.374.zip (5.6 GiB)
    • 7x10.z.376.zip (5.3 GiB)
    • 7x10.z.378.zip (5.5 GiB)
    • 7x10.z.380.zip (6.0 GiB)
    • V2022D_files.tsv (58.8 MiB)
    • V2022D_tags.tsv (262.9 MiB)