• TʜᴇʀᴀᴘʏGⒶʀʏ
    link
    fedilink
    22 months ago

    Photography is widely recognized as an art form, even though the scene exists independently and the photographer “simply” frames and captures the shot.

    A better driving analogy might be Tesla’s current level of self-driving, where you have to keep a hand on the wheel and eyes on the road the whole time, and remain in charge of all the critical decisions. When someone arrives in a Tesla and says, “I drove here,” no one goes “ackchyually…” Even if we follow your analogy, it’s the individual’s idea to reach that destination- often a novel place no one has even been to before.

    Creative individuals curate unique datasets, which can take countless hours of manual work, to create LoRAs. They often draw from their own photographs, drawings, paintings, etc., and then coordinate prompts and parameters to blend their custom LoRAs with other creators’ LoRAs/models/checkpoints to craft something unique. These creations exist only because they had the vision and put in the effort to realize it. The process can be even more involved with tools like ControlNet, where artists might even sketch an outline of the scene by hand.

    A quick selfie might not be considered art, but intentional expression through creatively capturing a scene is (photography). Similarly, a quick generation via Copilot for a meme might not qualify as art, but intentional expression through creative generation certainly does.

    • Hobthrob
      link
      fedilink
      12 months ago

      I disagree with your analogy, as I find it overstates the active involvement of the driver (prompter) during the drive (actual image generation).

      Preparation is it’s own process, whether you’re curating art you made yourself/stole from non-consensual artists, or have been finding references as an artist. Different skillset. They help the process of making the final image, but they are not a direct part of that process.

      And let’s not kid ourselves about theses datasets. There’s no accountability so there’s no way to ensure that any dataset you’re getting from other people aren’t comprised of, at least partially, stolen art.

      ControlNet let’s you add visuals to your prompt for greater control, but you’re still generating the image externally, and leaving the vast majority of the decision making to the model you’re using. Even if someone is happy with the result they get from a generative model and find it visually pleasant, that doesn’t make it art. The model is doing the work and the model cannot have artistic intent, so it cannot make art. It can make images and people can enjoy those, but those images aren’t something new.

      They are amalgamations of most basic common denominator of existing things. It is much more like a really advanced collage that is great at hiding the seams.