New servers and file optimisation request!

Status
Not open for further replies.
Group Leader
Joined
Jun 20, 2018
Messages
307
@Rugid Thank you, I managed to get it to work.

@Ou_Ki Wow, didn't know this existed. This is even better than using actions. Thank you!
 
Group Leader
Joined
Jul 18, 2019
Messages
486
@Rugid um ... could you be kinder, my heart is a little disturbed by your sentence, if you want to give advice please use a better sentence, you know. And I just used Photoshop one year ago by self-taught learning, so I still don't understand. but thanks for your advice, maybe it's just me being stupid so I can't understand that ...😞
 
Power Uploader
Joined
Dec 16, 2019
Messages
225
Wow, I thought PNGGauntlet'd be good but it took 3min for a compressed size 300KB bigger than Pngyu, both lossless. Guess Pngyu's (it uses pngquant) indeed the best when it's able to go lossless, unless PingaGUI is better than PNGGauntlet
 

lrb

Joined
Jan 18, 2018
Messages
12
@ixlone is there any way I could download batches of images to process and re-upload? I'm not a scanlator, but I'd like to contribute if I can.
 
Aggregator gang
Joined
Jan 25, 2018
Messages
521
@Santiasan
Okay, I see what's going on. You are working with low-resolution, heavily-compressed JPGs of full-color images. The very first link in your post, which is supposed to be the original image (but clearly isn't because the text has already been removed from it), is already destroyed by compression artifacts. PNG will not help, and neither will reducing the color depth to 8 bit.

First off, where do you (or the cleaner who removes the text) get the original image from? I ripped out a page from chapter 1 hosted on Kakao which I assume to be the raw (720×1098, 66 KB), using the browser's built in Inspect Element tool. By itself, the image is already well-optimized (Kakao clearly cares about preserving bandwidth!), but since we still need to typeset it, we'll have to decide how to compress the result when we're done editing the image.

Doing so in Photoshop CS6, here's what I got by recompressing it with JPG (Save for Web... → JPEG, 80%, progressive) and running it through File Optimizer to trim off the fat: 91.2 KB. Yeah, we got a 38% size bloat, but we got it to look nearly indistinguishable from the raw. As the original was already well-compressed with no leeway for safe size reduction, if we had tried to match its size, we wouldn't be able to avoid noticeable quality loss. So that's how JPG fares.

How about PNG? Since the source is full-color and has soft color gradients, the only option to avoid major quality loss would be to save it as 24-bit, fully preserving fidelity at the cost of bloating the size to 302 KB (a massive 358% increase!). Worth it? I think not. Reducing the color depth to 8 bit would allow us to just barely beat the 80% quality JPG's size but immediately produce horrible color banding, so that is out of question. Even dithering won't help avoid very noticeable grain, not to mention it would explode the filesize again in this case (131 KB). Don't bother with it. Finally, there's the unpopular option of using the lossy mode of PNG which can be accessed by e.g. checking "allow lossy optimizations" on File Optimizer's PNG tab. This way we can reduce the size by a 1/3 to 203 KB at the cost of a minor fidelity loss (which in this case, somewhat ironically, smoothened out some of the artifacts originally present in the raw). Worth it? Not at all.

Hence, use JPG with high quality settings (80–85%, optimized or progressive) for full color digital images. And don't use second-hand originals; take the raws from the source if possible. Free webtoons allow easy (if a bit tedious) extraction of images from the online reader pages.
 
MD@Home
Joined
Jan 18, 2018
Messages
249
don't know if it was already said before, but shouldn't this be done on the server side when uploaded? youtube does this after every upload too.
 
Joined
Mar 29, 2019
Messages
27
This shouldn't be optional, it should be an automatic part of what MangaDex does when images are uploaded - lossless file size optimization. Wouldn't hurt to carefully go apply that logic to all current images too.

If it's not automatic, than uploads should FAIL if the file size is > 5% more than the optimized version.

This policy is to not allow people to carelessly harm the bandwidth of everyone else. Even before the bandwidth problems some images were clearly 10x larger than they needed to be, and some chapters would take a couple minutes to load.
 
Aggregator gang
Joined
Jan 25, 2018
Messages
521
@SirDuncan
If it's not automatic, than uploads should FAIL if the file size is > 5% more than the optimized version.
It's impossible to tell how far off the mark the uploaded files are unless you attempt to optimize them upon retrieval. So both of your suggested options presume server-side optimization.

I believe the reason why this is a call to uploaders and not a mandatory server-side procedure is that the server-side resources are limited and could create upload congestion on busy days if every chapter is processed this way upon uploading. Besides, it's better if uploaders pay more attention to this, too, and look into other aspects of size optimization on their own (such as the ones I posted earlier).
 
Joined
Mar 29, 2019
Messages
27
@moozooh
Yes, it needs to be detected eventually either way. I'd favor it being automatic, it should just happen like every other image posting service on the internet.

If posters want control of the minimization process settings, they should take change of the minimization ahead of time. The 5% (or 10%) allows MD to see that the user already did the minimization and to let the file pass unmodified.

Then uploadee sees either "Files are minimized, thank you, and will not be modified" OR "Your files are not minimized and have been automatically shrunk by 43%". Everyone wins.

But yes I realizes compute and programming resources are in short supply.
 
Fed-Kun's army
Joined
Jan 19, 2018
Messages
860
@Holo

VERY VERY NOICE! This needs to be a switch on the side panel!

I can now read native longstrip manhwa! yaaas yaas yaas!

The only glitch I've found so far is that GIFs won't load. e.g. https://mangadex.org/chapter/872834/3
 

e97

Joined
Jan 25, 2020
Messages
3
https://www.similarweb.com/website/mangadex.org#overview says about 40M page views per month = ~1.3M per day which is easily doable on micro AWS servers.

If using WordPress, caching can be a huge benefit!

Also what about a free tier CDN? Then could help alleviate bandwidth costs, though hitting limits is probably likely.
 
Miku best girl
Admin
Joined
May 29, 2012
Messages
1,442
Similarweb is wrong - we're approaching 200 million page views per month, and that's traffic that google analytics can detect. It doesn't include all the users with adblock.

AWS costs an arm and leg for bandwidth.
 
Aggregator gang
Joined
Jan 25, 2018
Messages
521
@SirDuncan
I mean if MD optimizes on their end, there is actually no reason to check if uploaders do that. It won't decrease the amount of work in any way.
 
Joined
Dec 15, 2019
Messages
30
@Holo

Bunny CDN overs a volume plan CDN with lesser pricing than most, but with less point of presence. You can outsource some of your traffic to that.
 
Supporter
Joined
Oct 2, 2018
Messages
527
@Santiasan He doesn't mean any harm. I work with people like that, blunt to a point but not malicious about it. Don't let it get to you, dude's helping people and helping the community regardless of how it comes across.
 
Joined
Oct 23, 2018
Messages
26
I think I see some scanners in here, so I just wanted to thank everyone for their work. As someone who has none of the technical skills to do any of this, it means a lot that you all put your all into getting all the manga you do translated for us.
 
Fed-Kun's army
Joined
Jan 20, 2018
Messages
1,405
It's crazy to me that PNG usage was known 10-15 years ago in the scanlation community and it's been forgotten over time 😱

or you know, young people just not looking at how it was done before because they don't care.
 
MD@Home
Joined
Jan 18, 2018
Messages
249
@moozooh

I believe the reason why this is a call to uploaders and not a mandatory server-side procedure is that the server-side resources are limited and could create upload congestion on busy days if every chapter is processed this way upon uploading.

chapters aren't uploaded that frequently... go to the front page and keep refreshing the page until you see a new chapter was uploaded, they're usually several minutes between each other, plenty of time to compress the images.
 
Status
Not open for further replies.

Users who are viewing this thread

Top