PSA: You can upload images to a Lemmy instance without anyone knowing

bmygsbvur@lemmy.ca · edit-2 1 year ago

PSA: You can upload images to a Lemmy instance without anyone knowing

sabreW4K3@lemmy.tf · 1 year ago

Perhaps someone should create a script to purge orphan images

Danc4498@lemmy.ml · 1 year ago

Seems like the logical fix

gravitas_deficiency@sh.itjust.works · edit-2 1 year ago

Or, just tighten up the api such that uploaded pictures have a relatively short TTL unless they become attached to a post or otherwise linked somewhere.

A script is a fine stopgap measure, but we should try to treat the cause wherever possible, instead of simply addressing the symptom.

chaorace@lemmy.sdf.org · edit-2 1 year ago

What’s the practical difference? In both cases you’re culling images based on whether they’re orphaned or not.

If you’re suggesting that the implementation be based on setting individual timers instead of simply validating the whole database at regular intervals, consider whether or not the complexity of such a system is actually worth the tradeoff.

“Complexity comshmexity”, you might say. “Surely it’s not a big deal!”. Well… what about an image that used to belong to a valid post that later got deleted? Guess you have to take that edge case into account and add a deletion trigger there as well! But what if there were other comments/posts on the same instance hotlinking the same image? Guess you have to scan the whole DB every time before running the deletion trigger to be safe! Wait… wasn’t the whole purpose of setting this up with individual jobs to avoid doing a scripted DB scan?

gravitas_deficiency@sh.itjust.works · 1 year ago

There are mechanisms that exist in a LOT of services for handling TTL expiry and any relevant purging that needs to be done.

That said, a cursory look at the pict-rs project doesn’t appear to have any provision for TTL, so it’s probably going to have to be done as a cron job anyways - or at least triggered by the lemmy service when an image upload isn’t used in an instance-local lemmy post within some reasonable interval.

Note that I’m specifically including “in an an instance-local post” because I am assuming admins don’t want to provide free cloud image hosting to random internet people for arbitrary non-lemmy use.

chaorace@lemmy.sdf.org · 1 year ago

Note that I’m specifically including “in an an instance-local post” because I am assuming admins don’t want to provide free cloud image hosting to random internet people for arbitrary non-lemmy use.

Note that I at no point allude to hotlinking from outside of the instance. Unless you want it to be possible to create an image post, delete the post, and then have an orphaned image forever (thereby creating an attack vector), you do need to solve that problem. If you solve that problem without considering crossposts and comment hotlinks within the scope of your own instance, you’re going to cause breakage. If you’re forced to consider these things before triggering the deletion regardless, then you’re not saving much on performance.

bmygsbvur@lemmy.ca · 1 year ago

Very much needed.

sunaurus@lemm.ee · edit-2 1 year ago

FYI to all admins: with the next release of pict-rs, it should be much easier to detect orphaned images, as the pict-rs database will be moved to postgresql. I am planning to build a hashtable of “in-use” images by iterating through all posts and comments by lemm.ee users (+ avatars and banners of course), and then I will iterate through all images in the pict-rs database, and if they are not in the “in-use” hash table, I will purge them.

Of course, Lemmy can be improved to handle this case better as well!

homesnatch@lemmy.one · 1 year ago

Because anyone can upload illegal images without the admin knowing and the admin will be liable for it.

The admin/company isn’t liable until it is reported to them and they don’t do anything about it… That’s how all social media sites work, Google isn’t immediately liable if you upload illegal materials to GDrive and share it anonymously.

bmygsbvur@lemmy.ca · 1 year ago

Doesn’t change the fact that this is an issue. Besides, do you think American law applies everywhere?

cwagner@lemmy.cwagner.me · edit-2 1 year ago

deleted by creator

bmygsbvur@lemmy.ca · 1 year ago

Good point but also consider disabling pictrs until they fix the caching problem!

infamousbelgian@waste-of.space · 1 year ago

Is there a description on how to disable pictrs?

bmygsbvur@lemmy.ca · 1 year ago

Remove it from docker compose.

cwagner@lemmy.cwagner.me · edit-2 1 year ago

deleted by creator

Dandroid@dandroid.app · 1 year ago

It didn’t work that easily for me. I had to redirect all pictrs traffic to 404 in my nginx config.

rektifier@sh.itjust.works · 1 year ago

Wasn’t facebook also found to store images that were uploaded but not posted? This is just a resource leak . I can’t believe no one has mentioned this phrase yet. I’m more concerned about DoS attacks that fill up the instance’s storage with unused images. I think the issue of illegal content is being blown out of proportion. As long as it’s removed promptly (I believe the standard is 1 hour) when the mods/admins learn about it, there should be no liabilities. Otherwise every site that allows users to post media would be dead by now.

newline@feddit.nl · 1 year ago

I’m a pentester and security consultant. From my point of view, this vulnerability has more impact than just a resource leak or DOS. We all know how often CSAM or other illegal material is uploaded to communities here as actual posts (where hundreds of viewers run into it to report it). Now imagine them uploading it and spreading it like this, and only the admin can catch it if they goes out of their way to check it?

I wouldn’t call this a high risk issue for sure. But a significant security risk regardless.

bmygsbvur@lemmy.ca · 1 year ago

Whether it’s illegal content or storage-filling DoS attacks, the issue needs to be addressed.

Nerd02@lemmy.basedcount.com · 1 year ago

I’m an instance administrator, what the fuck do I do?

There’s one more option. The awesome @[email protected] has made this tool to detect and automatically remove CSAM content from a pict-rs object storage.

https://github.com/db0/lemmy-safety

bmygsbvur@lemmy.ca · 1 year ago

This is a nice tool but orphaned images still need to be purged. Mentioned on the other thread that bad actors can upload spam to fill up object storage space.

PuppyOSAndCoffee@lemmy.ml · 1 year ago

wouldn’t it be just as easy to whitelist DNS?

shagie@programming.dev · 1 year ago

Try turning wifi off on your phone, getting the IP address, and then looking up the DNS entry for that and consider if you want to whitelist that? And then do this again tomorrow and check to see if it has a different value.

Once you get to the point of “whitelist everything in *.mobile.att.net” it becomes pointless to maintain that as a whitelist.

Likewise *.dhcp.dorm.college.edu is not useful to whitelist.

PuppyOSAndCoffee@lemmy.ml · edit-2 1 year ago

Yes. I am well aware and that would be by design.

remember - if someone on a major mobile network is uploading child photography, that device is radioactive and an instance admin is going to have options they may not have in other situations.

The idea is give instance admins control over who uploads content. Perhaps they don’t want mobile users to upload content, or perhaps they do but only major carriers, by their own definition of major.

Somewhere between “everyone” and “nobody” is an answer.

giving the instance administrator tools to help quarantine bad actors only helps, which will require layers. Reverse DNS is a cost, however; perhaps the tax is worth it when hosting images, where there is already a pause point in the end user experience, and the ramifications so severe.

Larger instances may dilligaf but a smaller instance may need to be very careful…

Just sayin…

shagie@programming.dev · 1 year ago

Do you have a good and reasonable reverse DNS entry for the device you’re writing this from?

FWIW, my home network comes nat’ed out as {ip-addr}.res.provider.com.

Under your approach, I wouldn’t have any system that I’d be able to upload a photo from.

PuppyOSAndCoffee@lemmy.ml · 1 year ago

why do you say that, knowing full well DNS whitelists rely on wildcards?

shagie@programming.dev · 1 year ago

If you’re whitelisting *.res.provider.com and *.mobile.att.com the whitelist is rather meaningless because you’ve whitelisted almost everything.

If you are not going to whitelist those, do you have any systems available to you (because I don’t) that would pass a theoretical whitelist that you set up?

PuppyOSAndCoffee@lemmy.ml · 1 year ago

Why does it matter? Read some of my other posts.

shagie@programming.dev · 1 year ago

Would you be able to post an image if neither *.res.provider.com nor *.mobile.att.com were whitelisted and putting 10-11-23-45.res.provider.com (and whatever it will be tomorrow) was considered to be too onerous to put in the whitelist each time your address changed?

cwagner@lemmy.cwagner.me · edit-2 1 year ago

deleted by creator

PuppyOSAndCoffee@lemmy.ml · 1 year ago

An option to prevent users to upload unless their DNS has been whitelisted. It would require explicit permission to upload, which could be handy for smaller instances.

cwagner@lemmy.cwagner.me · edit-2 1 year ago

deleted by creator

PuppyOSAndCoffee@lemmy.ml · 1 year ago

Not IP. DNS whitelist. This way if a geography or subnet is responsible for illegal material they are only allowed in if an instance granted +w.

cwagner@lemmy.cwagner.me · edit-2 1 year ago

deleted by creator

PuppyOSAndCoffee@lemmy.ml · 1 year ago

Every person on the internet has a DNS record that loops back to them. The DNS has a topography so that various elements of a domain could be whitelisted, or not.

It would be trivial to queue a request to white list, where an administrator could decide if it is worth it, having it auto expire over time.

Instance admins could share sources of bad actors.

heuristics could help determine the risk of an approval action.

cwagner@lemmy.cwagner.me · edit-2 1 year ago

deleted by creator

bmygsbvur@lemmy.ca · 1 year ago

Explain.

Ricaz@lemmy.ml · 1 year ago

He probably means whitelisting domains when posting already uploaded images, clearly not having read the post

bmygsbvur@lemmy.ca · 1 year ago

That’s another issue. Also a necessary feature.

PuppyOSAndCoffee@lemmy.ml · 1 year ago

No I mean the user’s DNS should be whitelisted to permit uploads. If DNS not on whitelist then no upload, period.

Ricaz@lemmy.ml · 1 year ago

What do you mean by “the user’s DNS” exactly??

Krapulaolut@sopuli.xyz · 1 year ago

Yes it would, if the problem had anything to do with the DNS.

PuppyOSAndCoffee@lemmy.ml · 1 year ago

The problem is people and people have DNS.

Dandroid@dandroid.app · 1 year ago

'm an instance administrator, what the fuck do I do?

Check your pictrs images (good luck) or nuke it. Disable pictrs, restrict sign ups, or watch your database like a hawk. You can also delete your instance.

How? I have checked, and there doesn’t seem to be any way to see the photos on my server.

I actually shut down pictrs entirely on my instance. Running pictrs in its current state is criminally negligent imo.

bmygsbvur@lemmy.ca · 1 year ago

They are stored in the pctrs folder. They don’t have file extensions but are viewable with many image programs.

Kecessa@sh.itjust.works · 1 year ago

Pedo trolls will be the death of Lemmy, you heard it here first!

bmygsbvur@lemmy.ca · 1 year ago

Which is why we need to act now.

AphoticDev@lemmy.dbzer0.com · 1 year ago

This can be solved very easily by a cron job to clean out the folder periodically, if you’re worried about it.

bmygsbvur@lemmy.ca · 1 year ago

Very easily you say? Maybe tell us what this cron job is so we can all add it?

AphoticDev@lemmy.dbzer0.com · 1 year ago

Just make a cron that runs the rm command every day or whatever to clean out the files. Then run a SQL query at the same time to truncate any draft posts in the database. There’s no logic to this method, it just clears out the files and records related to draft posts, but it’s fast and effective.

There’s a small chance it might fuck somebody up if they were writing a post at that exact moment, but you can schedule the cron for when your instance is the quietest.

30021190@lemmy.cloud.aboutcher.co.uk · 1 year ago

Other than fulling up storage, what is the actual issue? If the image is orphaned then surely nobody can actually access the content? Sure you could be blind hosting things but if nobody can get the content back out then the abuse is surely minimal apart from say a complex cyber and physical targetted campaign or simply fulling up storage…

bmygsbvur@lemmy.ca · 1 year ago

The issue is that you can share the image link to other people. People CAN get the content back out and admins or moderators WILL NOT KNOW about it.

So if someone uploads an illegal image in the comments, copies the link and does not post the comment, then they have a link of an illegal image hosted on someone’s Lemmy instance. They can share this image to other people or report it to the FBI. Admins won’t know about this UNLESS they look at their pictrs database. Nobody else can see it so nobody can report it.

@ryona.agency · 1 year ago

@bmygsbvur Pleroma is exactly the same and no one cared in six years.

bmygsbvur@lemmy.ca · 1 year ago

Doesn’t change the fact that this is an issue.

Rentlar@lemmy.ca · 1 year ago

How come this hasn’t been addressed before?

Because pictrs and most other components of Lemmy was designed for a much smaller use case by a very small development team. It was designed primarily by people volunteering their time and expertise. Most of the contributors have other things to do on a full-time basis. If you really want to see a change like this implemented NOW, then code it in yourself, file a new issue directly on their page with potential solutions, or donate to the people working on it.

Your post is good for the most part, but my patience is limited for the kind of entitled attitude you show under that heading specifically. Thanks for hearing me out.

bmygsbvur@lemmy.ca · 1 year ago

Entitled attitude? I’m just bringing it up again. It was brought up some time ago but wasn’t given attention so I’m bringing it up again after the recent CSAM attacks.

I didn’t demand anything in the post. I brought up the issue, explained why it’s important, and what admins could do about it.

I don’t know how to code but that doesn’t mean I’m not allowed to bring this issue to light…

Rentlar@lemmy.ca · edit-2 1 year ago

I have no issue with your post itself and discussing this issue it is important to highlight things like this. Thank you for bringing it up, and sorry if I sound mad at you for doing that.

I will point out, the specific thing that bothers me is that the heading

How come this hasn’t been addressed before?

contains an incomplete answer that ignores work that is currently in progress by devs to address. I don’t blame you for not knowing the answer but for including and answering that question when you don’t know the answer. To me it’s reminiscent of Tucker Carlson-style questioning, where some issue is brought up, questions are asked but then the answer is sparsely researched and the viewer is expected to come to some conclusion of who to blame. This specifically is what gets on my nerves.

If you can include where work to rectify the issue had been discussed and is in progress like github issues, discussion throughout Lemmy and other things, I’ll edit my first reply to note my concern is assuaged.

E: Here are some of the relevant issues and discussion:

This sort of thing has been discussed since 2020
Adding admin purging of DB items and pictures. #904 #1331 #1809
#3504, #2277, these and related issues are still open.

bmygsbvur@lemmy.ca · 1 year ago

I don’t care if you don’t like my English writing. I brought up the issue and if people don’t care about it then whatever. We’ll just have to wait until it’s abused then maybe people will be actually concerned.

habanhero@lemmy.ca · 1 year ago

OP is flagging a legitimate issue that can actually put instance owners at risk. Raising the issue that instance owners can unwittingly host illegal content and be liable for it - how is that entitled?

Totally understand that Lemmy devs are a small team, but the growth of use of the software is exploding now, and not being able to keep up is a problem of scale - gatekeeping others from raising issues does not help it get better and in fact discourages issue reports and promotes a head-in-the-sand culture.

Rentlar@lemmy.ca · edit-2 1 year ago

I understand and raising the issue and discussion is fine. With all due respect to OP, I take it personally when the discussion is framed with the implication that the developers should not have released a project with some bugs and they should have put more effort here or there. I’ve contributed to Lemmy both in coding, translation and small donations, but I’m not here for people to push blame on devs. This is why bringing up the question “Why hasn’t anything been done?”, while I recognize it is a question on some people’s minds, it gets on my nerves. It bothers me like a clickbait/ragebait title does for many.

I would rather the discussion focus on where efforts are made or will be made to mitigate and fix the problem.

Coki91@dormi.zone · 1 year ago

Entitlement? The “Subtitles” are acting as a Question the reader may have, and below the answer, OP is not demanding anything

Rentlar@lemmy.ca · 1 year ago

Fair point. That question itself is what bothers me even if it is a valid one people have on their minds. The answer to that question should highlight more clearly what has been done, and if OP doesn’t know, then IMO it would best be to not include that question/answer.

I have no problems with OP’s post and the fact to bring up this issue and dicsuss it. Including that question with an incomplete answer bothers me like a clickbait headline for an article does, or how Tucker Carlson’s show asks questions. This serves little purpose but put the people working on fixes in a bad light acting like they haven’t been working on anything.