He allegedly used Stable Diffusion, a text-to-image generative AI model, to create “thousands of realistic images of prepubescent minors,” prosecutors said.
This is tough, the goal should be to reduce child abuse. It’s unknown if AI generated CP will increase or reduce child abuse. It will likely encourage some individuals to abuse actual children while for others it may satisfy their urges so they don’t abuse children. Like everything else AI, we won’t know the real impact for many years.
I don’t know if we can say for certain it needs to be in the dataset, but I do wonder how many of the other models used to create CSAM are also trained on CSAM.
Generative Machine Learning models have been well documented as being able to produce explicit adult content, including child sexual abuse material (CSAM)
You can’t generate CSAM because there’s no C to A.
It should be illegal either way, to be clear. But you think theyre not training models on CSAM? Youre trusting in the morality/ethics of people creating AI generated child pornography?
The training doesn’t use csam, 0% chance big tech would use that in their dataset. The models are somewhat able to link concept like red and car, even if it had never seen a red car before.
Well, with models like SD at least, the datasets are large enough and the employees are few enough that it is impossible to have a human filter every image. They scrape them from the web and try to filter with AI, but there is still a chance of bad images getting through. This is why most companies install filters after the model as well as in the training process.
You make it sound like it is so easy to even find such content on the www. The point is, they do not need to be trained on such material. They are trained on regular kids, so they know their sizes, faces, etc. They’re trained on nude bodies, so they also know how hairless genitals or flat chests look like. You don’t need to specifically train a model on nude children to generate nude children.
We’re trusting that billion-dollar corporate efforts don’t possess and label hyper-illegal images, specifically so people can make more of them. Because why the fuck would they.
‘Google would love to be in the child pornography business’ is quite a fucking take.
These assholes are struggling to stop their networks from generating Mickey Mouse even when someone specifically asks for Mickey Mouse. Why would any organization that size want radioactive criminal-to-possess inputs stirred into their venture-capital cash cow?
They’re fine with platforming fascists for a buck. Why would they have a problem with kid porn, especially if they can maintain a veneer of plausible deniability
I suggest you actually download stable diffusion and try for yourself because it’s clear that you don’t have any clue what you’re talking about. You can already make tiny people, shaved, genitals, flat chests, child like faces, etc. etc. It’s all already there. Literally no need for any LoRAs or very specifically trained models.
This is tough, the goal should be to reduce child abuse. It’s unknown if AI generated CP will increase or reduce child abuse. It will likely encourage some individuals to abuse actual children while for others it may satisfy their urges so they don’t abuse children. Like everything else AI, we won’t know the real impact for many years.
How do you think they train models to generate CSAM?
Some of yall need to lookup what an LoRA is
Lol you don’t need to train it ON CSAM to generate CSAM. Get a clue.
https://purl.stanford.edu/kh752sm9123
I don’t know if we can say for certain it needs to be in the dataset, but I do wonder how many of the other models used to create CSAM are also trained on CSAM.
You can’t generate CSAM because there’s no C to A.
It should be illegal either way, to be clear. But you think theyre not training models on CSAM? Youre trusting in the morality/ethics of people creating AI generated child pornography?
The use of CSAM in training generative AI models is an issue no matter how these models are being used.
The training doesn’t use csam, 0% chance big tech would use that in their dataset. The models are somewhat able to link concept like red and car, even if it had never seen a red car before.
Well, with models like SD at least, the datasets are large enough and the employees are few enough that it is impossible to have a human filter every image. They scrape them from the web and try to filter with AI, but there is still a chance of bad images getting through. This is why most companies install filters after the model as well as in the training process.
You make it sound like it is so easy to even find such content on the www. The point is, they do not need to be trained on such material. They are trained on regular kids, so they know their sizes, faces, etc. They’re trained on nude bodies, so they also know how hairless genitals or flat chests look like. You don’t need to specifically train a model on nude children to generate nude children.
We’re trusting that billion-dollar corporate efforts don’t possess and label hyper-illegal images, specifically so people can make more of them. Because why the fuck would they.
If there was more money to be made than the cost of defending it they most definitely would.
‘Google would love to be in the child pornography business’ is quite a fucking take.
These assholes are struggling to stop their networks from generating Mickey Mouse even when someone specifically asks for Mickey Mouse. Why would any organization that size want radioactive criminal-to-possess inputs stirred into their venture-capital cash cow?
They’re fine with platforming fascists for a buck. Why would they have a problem with kid porn, especially if they can maintain a veneer of plausible deniability
I suggest you actually download stable diffusion and try for yourself because it’s clear that you don’t have any clue what you’re talking about. You can already make tiny people, shaved, genitals, flat chests, child like faces, etc. etc. It’s all already there. Literally no need for any LoRAs or very specifically trained models.
Does an AI image of Shrek riding an avocado motorcycle imply there’s a bunch of images of that, in the data set?