Setting aside the usual arguments on the anti- and pro-AI art debate and the nature of creativity itself, perhaps the negative reaction that the Redditor encountered is part of a sea change in opinion among many people that think corporate AI platforms are exploitive and extractive in nature because their datasets rely on copyrighted material without the original artists’ permission. And that’s without getting into AI’s negative drag on the environment.

  • Even_Adder@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    10
    ·
    edit-2
    8 months ago

    Yes, using existing works as reference is obviously something that real human artists do all the time, there’s no arguing that is the case. That’s how people learn to create art to begin with.

    But, the fact is, generative AI is not creative, nor does it understand what creativity is, nor will it ever. Because all it is doing is performing complex data statistical analysis algorithms to generate a matrix of pixels or a string of words.

    Im sorry, but the person entering in the prompt to instruct the algorithm is also not doing anything creative either. Do you think it is art to go through a fast food drive through and place an order? That’s what people are objecting to - people calling themselves artists because they put some nonsense word salad together and then think what they get out of it is some unique thing that they feel they created and take ownership of. If not for the AI model they are using and the creative works it was trained on, they could not have created it or likely even imagined it without it.

    I’d like to ask you what experience you have with generative art, because I’d like to explain a bit of what I know,

    There’s also a spectrum of involvement depending on what tool you’re using. I know with web based interfaces don’t allow for a lot of freedom due to wanting to keep users from generating things outside their terms of use, but with open source models based on Stable Diffusion you can get a lot more involved and get a lot more freedom. We’re in a completely different world from March 2023 as far as generative tools go. Take a quick look at things work.

    Let’s take these generation parameters for instance: sarasf, 1girl, solo, robe, long sleeves, white footwear, smile, wide sleeves, closed mouth, blush, looking at viewer, sitting, tree stump, forest, tree, sky, traditional media, 1990s \(style\), <lora:sarasf_V2-10:0.7>

    Negative prompt: (worst quality, low quality:1.4), FastNegativeV2

    Steps: 21, VAE: kl-f8-anime2.ckpt, Size: 512x768, Seed: 2303584416, Model: Based64mix-V3-Pruned, Version: v1.6.0, Sampler: DPM++ 2M Karras, VAE hash: df3c506e51, CFG scale: 6, Clip skip: 2, Model hash: 98a1428d4c, Hires steps: 16, "sarasf_V2-10: 1ca692d73fb1", Hires upscale: 2, Hires upscaler: 4x_foolhardy_Remacri, "FastNegativeV2: a7465e7cc2a2",

    ADetailer model: face_yolov8n.pt, ADetailer version: 23.11.1, Denoising strength: 0.38, ADetailer mask blur: 4, ADetailer model 2nd: Eyes.pt, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur 2nd: 4, ADetailer confidence 2nd: 0.3, ADetailer inpaint padding: 32, ADetailer dilate erode 2nd: 4, ADetailer denoising strength: 0.42, ADetailer inpaint only masked: True, ADetailer inpaint padding 2nd: 32, ADetailer denoising strength 2nd: 0.43, ADetailer inpaint only masked 2nd: True

    To break down a bit of what’s going on here, I’d like to explain some of the elements found here. sarasf is the token for the LoRA of the character in this image, and <lora:sarasf_V2-10:0.7> is the character LoRA for Sarah from Shining Force II. LoRA are like supplementary models you use on top of a base model to capture a style or concept, like a patch. Some LoRA don’t have activation tokens, and some with them can be used without their token to get different results.

    The 0.7 in <lora:sarasf_V2-10:0.7> refers to the strength at which the weights from the LoRA are applied to the output. Lowering the number causes the concept to manifest weaker in the output. You can blend styles this way with just the base model or multiple LoRA at the same time at different strengths. Furthurmore you can adjust the UNet and Text Encoder by adding another colon like so : <lora:sarasf_V2-10:1:0.7> for even more varied results. Doing this allows you to separate the “idea” from the “look” of the LoRA. You can even use a monochrome LoRA and take the weight into the negative to get some crazy colors.

    The Negative Prompt is where you include things you don’t want in your image. (worst quality, low quality:1.4), here are quality tags and have their attention set to 1.4. Attention is sort of like weight, but for tokens. LoRA bring their own weights to add onto the model, whereas attention on tokens works completely inside the weights they’re given. In this negative prompt FastNegativeV2 is an embedding known as a Textual Inversion. It’s sort of like a crystallized collection of tokens that tell the model something precise you want without having to enter the tokens yourself or mess around with the attention manually. Embeddings you put in the negative prompt are known as Negative Embeddings.

    In the next part, Steps stands for how many steps you want the model to take to solve the starting noise into an image. More steps take longer. VAE is the name of the Variational Autoencoder used in this generation. The VAE is responsible for working with the weights to make each image unique. A mismatch of VAE and model can yield blurry and desaturated images, so some models opt to have their VAE baked in, Size are the dimensions in pixels the image will be generated at. Seed is the number representation of the starting noise for the image. You need this to be able to reproduce a specific image.

    Model is the name of the model used, and Sampler is the name of the algorithm that solves the noise into an image. There are a few different samplers, also known as schedulers, each with their own trade-offs for speed, quality, and memory usage. CFG is basically how close you want the model to follow your prompt. Some models can’t handle high CFG values and flip out, giving over-exposed or nonsense output. Hires steps represents the amount of steps you want to take on the second pass to upscale the output. This is necessary to get higher resolution images without visual artifacts. Hires upscaler is the name of the model that was used during the upscaling step, and again there are a ton of those with their own trade-offs and use cases.

    After ADetailer are the parameters for Adetailer, an extension that does a post-process pass to fix things like broken anatomy, faces, and hands. We’ll just leave it at that because I don’t feel like explaining all the different settings found there.

    https://youtu.be/-JQDtzSaAuA?t=97

    https://youtu.be/1d_jns4W1cM

    https://www.youtube.com/watch?v=HtbEuERXSqk

    Not all selfies are art, but you can make art with cameras. I think the same applies here.

    People are actively losing their livelihoods because AI tech is being oversold and overhyped as something that it’s not. Execs are all jumping on the bandwagon and because they see AI as something that will save them a bunch of money, they are laying off people they think aren’t needed anymore. So, just try to incorporate that sentiment into your understanding of why people are also upset about AI. You may not be personally affected, but there are countless that are. In fact, over the next two years, as many as 203,000 entertainment workers in the US alone could be affected

    This EFF article by Katharine Trendacosta and Cory Doctorow touches on this. I think it’s worth a read.

    You want to have fun creating fancy kitbashed images based off of other people’s work, go right ahead.

    This is misinformation, and not how the technology works. Here’s a quick video explanation,

    Just don’t call it art and call yourself an artist, unless you could actually make it yourself using practical skills.

    This is just snobbery that people have always used to devalue the efforts of others. Punching down and gatekeeping won’t solve your problems, the people you’re really mad at are above you.

    Art is about bringing your ideas into the world, anything beyond that is fetish. Spending hundreds of hours learning a skill isn’t art, it’s work. While I believe the effort invested in a work can contribute to its depth and meaning, that doesn’t make them better than works without as much effort.

    cont.

    • deepblueseas
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      5
      ·
      8 months ago

      I appreciate that you taken time to explain the technical aspects into what generative AI is processing under the hood, but the reality is that no amount of programming will ever be able to recreate the uniqueness and infinite variability of human creativity, emotion, imagination or consciousness. There is an immeasurable difference between true creativity and producing variations on a data set. I say this as both an artist and a programmer. I’m not just talking out of my ass.

      I agree with you that a goal of art is to express ideas and that there’s are a lot of people in the art world that fetishize art in to being something more important than it is in certain contexts, but art is also a core component and something unique to humanity(and sometimes even to other species.) In that way, it’s something to be cherish and regarded - and throughout history it has been extremely culturally significant. Trying to translate these concepts into an algorithm, in my mind, nothing but an extremely arrogant waste of effort and time. Why not spend your time automating the boring shit no one wants to do rather than the creative things people actually enjoy doing?

      I am not gatekeeping. I am just stating simple facts. I find it offensive and demeaning that you are devaluing the immense amount of effort that artists undergo to hone their crafts and produce art. You’re damn right it’s work - if you want to get proficient at something, that’s what it takes. I don’t care how boomer-ey that sounds. Yes, some artists have natural talent and don’t need as much effort as others. But, nonetheless, effort is required to create. Anyone can create art, not just some elite select few. But, not everyone can create art that is universally recognized as great or masterful, and it’s not a problem that need to be solved by technology. Unfortunately, art is subjective, so not everything one creates is perceived the same. That’s why some are more successful than others. You may argue that AI levels the playing field, but the fact is that it leverages the work of “successful” artists or artworks, and generates results that are perceived as successful or appealing as a result. It’s a shortcut. You are bypassing the effort otherwise needed by using a tool, which allows most users to to be totally ignorant of the basic knowledge required to create an art work - shape language, color theory, composition, lighting, appeal, posing, etc.

      Entering a prompt into an AI model is akin to directing, producing or acting as a muse. It’s a very similar argument as to the validity or artistic merits of factory artists like Andy Warhol or Jeff Koons - While you are responsible for the idea that produces a result, you are still relying on the work and effort of not only the numerous team of people creating the AI model and its algorithms , but also the immeasurable amount of man-hours and creativity involved in creating the source content for the model training materials.

      It’s one thing to use generative AI as tool, with intent to make use of the output as reference for your own work in a larger context. But to take the direct output and call it art is morally and ethically wrong. In my eyes, it makes you look like a total hack who doesn’t want to put the effort in to make things for themselves…no matter how much time you put into coming up with the prompt for the output.

      I still stand by my original arguments - coming up with a prompt or a training data set to create an image is not art, because you are not actively involved with the creation of the imagery, itself. What an AI model generates is not a creative work and it is not your creation. If that is offensive to you, there’s nothing I can do about that, because it’s apparent that your arguments only serve to make yourself feel better about using generative AI.

      It’s also apparent that you have an extremely skewed view of what art is and what it means to be an artist. Art, at its base level is about expressing HUMAN creativity, not what an algorithm interprets it to be. It’s about making countless, specific choices for each step of the creative process and having complete control of the final outcome. It’s those choices that make your art truly unique and an expression of your creative vision. It doesn’t matter if it is objectively bad or good, just that it came from you, and that every detail, every color, every line, was your choice, not an interpretation of your words.

      Unless you are creating your own AI model from scratch and training it purely on your own artworks, I don’t see how you can, in good conscience, claim the results to be your own.

      Any one can create art, but an artist is someone who dedicates themselves to their craft, as with any other craftsman. That passion is what separates an artisan from a hobbyist. You may view this as snobbery, but I view it as respect and honoring a tradition that spans all of human kind, back to the earliest cave paintings tens of thousands of years ago. I know my limits and what I’m capable of and I have come to terms with those deficiencies in my work. I’m not delusional enough to think that by generating an image through AI, it somehow makes up for those shortcomings and makes me into something I am not.

      • Even_Adder@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        4
        ·
        8 months ago

        I appreciate that you taken time to explain the technical aspects into what generative AI is processing under the hood, but the reality is that no amount of programming will ever be able to recreate the uniqueness and infinite variability of human creativity, emotion, imagination or consciousness. There is an immeasurable difference between true creativity and producing variations on a data set. I say this as both an artist and a programmer. I’m not just talking out of my ass.

        The goal here isn’t to replace human uniqueness, creativity, emotion, imagination, or consciousness, but to give people a robust tool to help them explore concepts and express themselves.

        I agree with you that a goal of art is to express ideas and that there’s are a lot of people in the art world that fetishize art in to being something more important than it is in certain contexts, but art is also a core component and something unique to humanity(and sometimes even to other species.) In that way, it’s something to be cherish and regarded - and throughout history it has been extremely culturally significant. Trying to translate these concepts into an algorithm, in my mind, nothing but an extremely arrogant waste of effort and time. Why not spend your time automating the boring shit no one wants to do rather than the creative things people actually enjoy doing?

        If it allows people to more effectively communicate, express themselves, learn, and come together, it’s worth the trouble. The more people can participate in these conversations, the more we can all learn.

        I am not gatekeeping. I am just stating simple facts. I find it offensive and demeaning that you are devaluing the immense amount of effort that artists undergo to hone their crafts and produce art. You’re damn right it’s work - if you want to get proficient at something, that’s what it takes. I don’t care how boomer-ey that sounds. Yes, some artists have natural talent and don’t need as much effort as others. But, nonetheless, effort is required to create. Anyone can create art, not just some elite select few. But, not everyone can create art that is universally recognized as great or masterful, and it’s not a problem that need to be solved by technology. Unfortunately, art is subjective, so not everything one creates is perceived the same. That’s why some are more successful than others. You may argue that AI levels the playing field, but the fact is that it leverages the work of “successful” artists or artworks, and generates results that are perceived as successful or appealing as a result. It’s a shortcut. You are bypassing the effort otherwise needed by using a tool, which allows most users to to be totally ignorant of the basic knowledge required to create an art work - shape language, color theory, composition, lighting, appeal, posing, etc.

        You are putting words in my mouth. I never devalued anyone’s work, unless you read that when I said I don’t think pieces that take more work or skill aren’t inherently worth more than those that were easier to produce. Is it even possible for there to be shortcuts in art? It’s harder to erase a line and fix it on canvas than it is to draw an incorrect one and just resize it. Let’s not even talk about things like shrinking a whole head to make it fit. Where in the gradient from canvas to AI does creating become cheating?

        That reminds me of a quote from over a hundred years ago:

        As the photographic industry was the refuge of every would-be painter, every painter too ill-endowed or too lazy to complete his studies, this universal infatuation bore not only the mark of a blindness, an imbecility, but had also the air of a vengeance. I do not believe, or at least I do not wish to believe, in the absolute success of such a brutish conspiracy, in which, as in all others, one finds both fools and knaves; but I am convinced that the ill-applied developments of photography, like all other purely material developments of progress, have contrib­uted much to the impoverishment of the French artistic genius, which is already so scarce. It is nonetheless obvious that this industry, by invading the territories of art, has become art’s most mor­tal enemy, and that the confusion of their several func­tions prevents any of them from being properly fulfilled.

        ― Charles Baudelaire, On Photography, from The Salon of 1859

        It sounds a lot like what you’re trying to say.

        Entering a prompt into an AI model is akin to directing, producing or acting as a muse. It’s a very similar argument as to the validity or artistic merits of factory artists like Andy Warhol or Jeff Koons - While you are responsible for the idea that produces a result, you are still relying on the work and effort of not only the numerous team of people creating the AI model and its algorithms , but also the immeasurable amount of man-hours and creativity involved in creating the source content for the model training materials.

        This can be said of many tools, from graphics rendering engines and art software to mass-produced pigments and tools. It took us 100,000 years to get from cave drawings to Leonard Da Vinci. This is just another step for artists, like Camera Obscura was in the past. It’s important to remember that early man was as smart as we are, they just lacked the interconnectivity and tools that we get.

        It’s one thing to use generative AI as tool, with intent to make use of the output as reference for your own work in a larger context. But to take the direct output and call it art is morally and ethically wrong. In my eyes, it makes you look like a total hack who doesn’t want to put the effort in to make things for themselves…no matter how much time you put into coming up with the prompt for the output.

        I still stand by my original arguments - coming up with a prompt or a training data set to create an image is not art, because you are not actively involved with the creation of the imagery, itself. What an AI model generates is not a creative work and it is not your creation. If that is offensive to you, there’s nothing I can do about that, because it’s apparent that your arguments only serve to make yourself feel better about using generative AI.

        This is just personal option. And I never felt bad about using it in the first place. It feels like you’re projecting your own feelings onto me.

        It’s also apparent that you have an extremely skewed view of what art is and what it means to be an artist. Art, at its base level is about expressing HUMAN creativity, not what an algorithm interprets it to be. It’s about making countless, specific choices for each step of the creative process and having complete control of the final outcome. It’s those choices that make your art truly unique and an expression of your creative vision. It doesn’t matter if it is objectively bad or good, just that it came from you, and that every detail, every color, every line, was your choice, not an interpretation of your words.

        It is still a human making generative art, and they use their emotions and learned experiences to guide the creation of works. You should familiarize yourself with all the different forms of guidance available with generative art. I think you’ll be pleasantly surprised.

        Any one can create art, but an artist is someone who dedicates themselves to their craft, as with any other craftsman. That passion is what separates an artisan from a hobbyist. You may view this as snobbery, but I view it as respect and honoring a tradition that spans all of human kind, back to the earliest cave paintings tens of thousands of years ago. I know my limits and what I’m capable of and I have come to terms with those deficiencies in my work. I’m not delusional enough to think that by generating an image through AI, it somehow makes up for those shortcomings and makes me into something I am not.

        This is a no-true-scotsman fallacy, you’re attempting to narrowly define artists to serve your needs, when no definition of “true artists” has ever existed. The rest of this is personal perspective that you shouldn’t force onto others. Let them create how they want, and in time, I think we’ll all come to benefit from their labors.

      • barsoap@lemm.ee
        link
        fedilink
        English
        arrow-up
        7
        arrow-down
        6
        ·
        8 months ago

        Unless you are creating your own AI model from scratch and training it purely on your own artworks, I don’t see how you can, in good conscience, claim the results to be your own.

        Did you create all the textures you put onto your 3d models? Did you use substance painter? Any sort of asset library? If you’re working in 2d, did you create your own brush textures?

        Did you create colour and perspective theory from scratch? If not, how can you call yourself a painter?

        Did Duchamp study the manufacture of ceramics before putting a factory-made urinal on a pedestal and called it a piece of art?

        • deepblueseas
          link
          fedilink
          English
          arrow-up
          6
          arrow-down
          6
          ·
          8 months ago

          Wow, nice rhetorical questions you got there, bud.

          What the fuck do you think?

          If you had enough reading comprehension and read through my whole response, you would have got to the part where I said creating art is about the culmination of choices you make in each part of the process.

          Maybe you can point it out to me, but I don’t recall the part where I said you have to recreate the fucking wheel every time you create something.

          That particular quote you pointed out, was specific to generative AI, because you don’t make those same choices. The model and the training data is what produces those results for you.

          But since you asked, yes I do have the knowledge to create textures by hand without Substance Painter. I’ve been doing 3d art since 2003, before that shit even existed and we hand to do it all manually in Photoshop.

          No, I didn’t fucking create color and perspective theory. What do you think I am… like a fucking immortal from ancient times? But I did have to learn that shit and took multiple classes dedicated to each of those topics.

          Lastly, you must have skipped on your art history for the last one, because the whole concept of that particular piece was that it was absurdist - an every day object raised to status of art by the artist. He didn’t fucking sculpt the urinal himself. So it would have been more appropriate to say he was a janitor that got lucky. Nice try, though.

          • barsoap@lemm.ee
            link
            fedilink
            English
            arrow-up
            9
            arrow-down
            4
            ·
            8 months ago

            That particular quote you pointed out, was specific to generative AI, because you don’t make those same choices. The model and the training data is what produces those results for you.

            And for a photographer, their surroundings is what produces many results, leading them to not make choices about those things. They focus on other things, don’t express themselves in the arrangement of leaves on a tree, leave that stuff to chance.

            The important part is not that choices are made for you, but that you do make, at the very least, a choice. One single choice suffices to have intent. It is not even necessary to make that choice during the creation of the piece, splattering five buckets of paint onto five canvases and choosing the one that sparks the right impression a choice.

            because the whole concept of that particular piece was that it was absurdist - an every day object raised to status of art by the artist.

            Yes, precisely. That one concept, the single choice, “yep a urinal should be both provocative and banal enough”, is what made it art.

            There is no minimum level of craft necessary for art.

            • deepblueseas
              link
              fedilink
              English
              arrow-up
              4
              arrow-down
              3
              ·
              8 months ago

              Ah, very interesting that you want to focus on photography as a comparison. To me, this just infers that you are not familiar with the type of choices that photographers do make, creatively. Just because they have endless amounts of subject matter readily available at their disposal, does not make the process any easier or different than other types of art.

              Photographers still consider composition, lighting, area of focus, color, etc. Along with a large amount of other factors such as camera body, filmback, lens, fstop, iso, flash, supplemental lighting, post-processing, the list goes on.

              Again, all of these choices are actively made when creating the work - using one’s critical thinking, decision making, experience and knowledge to inform each choice and how it will affect the outcome.

              Generative AI is not that and will never be that, no matter how much you argue otherwise. You are entering a prompt, the model is interpreting that and generating a result that it calculates to be most statistically accurate. Your choice of words are not artistic choices, they are at most, requests or instructions. If you iterate, you are not in control of what changes. You only find out what has changed after the result has been generated.

              Again, you are totally missing the point to the Fountain and using it as a false equivalence. It was made as a critique of the art world, to show the absurdity of what art critics said was valid art at the time. Whereas today, generative AI is not being made as a critique to anything. It’s being made for profit, to replaced skilled labor and using the work of the same people it’s trying to replace. Hopefully you can see how the two are different.

              • barsoap@lemm.ee
                link
                fedilink
                English
                arrow-up
                6
                arrow-down
                4
                ·
                8 months ago

                If you iterate, you are not in control of what changes. You only find out what has changed after the result has been generated.

                If you think that’s the case then you don’t understand the medium. Once you’ve explored a model, seen into its mind, understand how it understands things, you can direct it quite precisely. At least as precisely as a photographer taking a picture of a tree – yes, if you care about the arrangement of leaves then it might take a couple of tries until the wind moves them just right but you’ve made a point of going to the right tree, in the right season, on a day with the right weather, at a time with the right light.

                Whereas today, generative AI is not being made as a critique to anything.

                I’m not claiming that. There’s an incidental artistry in the sense that now some progressives have their underwear in a twist just as conservatives had theirs in a twist about Fountain but I’ll readily grant that there was no human intent behind it. Sometimes it’s not artists who troll people but the general machinations of the world. Still worthy of appreciation but calling it “art” is not a hill I would die on.

                What I’m claiming is that you can’t judge art by the level of craft involved: It can be zero and still be art. Any argument involving craft is literally missing the point of what art is.

              • Deceptichum
                link
                fedilink
                English
                arrow-up
                5
                arrow-down
                6
                ·
                edit-2
                8 months ago

                Photographers still consider composition, lighting, area of focus, color, etc. Along with a large amount of other factors such as camera body, filmback, lens, fstop, iso, flash, supplemental lighting, post-processing, the list goes on.

                You do know many of those things are considered in AI generated images as well, right?

                And there is so much more to it than a simple text prompt, even something as basic as what nodes i feed into what else and in what order/ weight can have vast impacts, do i want to use a depth map based on a 3d mannequin I’ve rigged up in blender to use as my pose or go with a canny line filter to keep the form as the focus, should i overlay the image cutout layer before filling in the background and running a detailer node on top or merge them together and see how that goes, etc.

    • Pips@lemmy.sdf.org
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      6
      ·
      8 months ago

      Mate, that’s not art, that’s coding. Congratulations on learning a new coding skill and how inputs can affect outputs. Frankly, it’s barely coding, it’s adding degrees of specification so a program can do all the work. I get that it took you a while to learn what all of it means and how it works, sort of, but something being hard to do doesn’t make it art.

      And don’t cheapen photography by comparing it to generating an AI image. There’s physical labor involved in photography on top of composition and patience.