We Asked A.I. to Create the Joker. It Generated a Copyrighted Image.::Artists and researchers are exposing copyrighted material hidden within A.I. tools, raising fresh legal questions.

  • @[email protected]
    link
    fedilink
    English
    10110 months ago

    What it proves is that they are feeding entire movies into the training data. It is excellent evidence for when WB and Disney decides to sue the shit out of them.

    • @[email protected]
      link
      fedilink
      English
      11810 months ago

      Does it really have to be entire movies when theres a ton of promotional images and memes with similar images?

      • @[email protected]
        link
        fedilink
        English
        510 months ago

        Yes. Thats what these things are, extremely large catalogues of data. As much data as possible is their goal.

        • @[email protected]
          link
          fedilink
          English
          1010 months ago

          True but it didn’t pick some random frame somewhere in the movie it chose a extremely memorable shot that is posted all over the place. I won’t deny that they are probably feeding it movies but this is not a sign of that.

          This image is literally the top result on Google images for me.

          • @[email protected]
            link
            fedilink
            English
            2
            edit-2
            10 months ago

            Why would it pick some random frame in the middle of its data set instead of a frame it has the most to reference. It can still use all those other frames to then pick the frame if has the most references to.

            But im starting to think maybe i misunderstood the comment i replied to.

            Sorry, im way out of context with my reply, totally my fault for reflexively replying.

            Uhhh would you accept i didnt have my coffee yet and hadnt got out of bed yet as an explanation?

    • @[email protected]
      link
      fedilink
      English
      60
      edit-2
      10 months ago

      I think it’s much more likely whatever scraping they used to get the training data snatched a screenshot of the movie some random internet user posted somewhere. (To confirm, I typed “joaquin phoenix joker” into Google and this very image was very high up in the image results) And of course not only this one but many many more too.

      Now I’m not saying scraping copyrighted material is morally right either, but I’d doubt they’d just feed an entire movie frame by frame (or randomly spaced screenshots from throughout a movie), especially because it would make generating good labels for each frame very difficult.

      • @[email protected]
        link
        fedilink
        English
        2410 months ago

        I just googled “what does joker look like” and it was the fourth hit on image search.

        Well, it was actually an article (unrelated to AI) that used the image.

        But then I went simpler – googling “joker” gives you the image (from the IMDb page) as the second hit.

    • @[email protected]
      link
      fedilink
      English
      2810 months ago

      WB and Disney would lose, at least without an amendment to copyright law. That in fact just happened in one court case. It was ruled that using a copyrighted work to train AI does not violate that works copyright.

          • @[email protected]
            link
            fedilink
            English
            410 months ago

            Something transformative from the original works. And arguably not being being distributed. The model producing and distributing derivative works is entirely different though. No one really gives a shit about data being used to train models - there’s nothing infringing about that which is exactly why they won their case. The example in the post is an entirely different situation though.

      • @[email protected]
        link
        fedilink
        English
        810 months ago

        And if you tried to sell that, you would be breaking the law.

        Which is what these AI models are doing

          • @[email protected]
            link
            fedilink
            English
            510 months ago

            No, they are selling you time in a digital room with a machine, and all of the things it spits out at you.

            You dont own the program generating these images. You are buying these images and the time to tinker with the AI interface.

              • @[email protected]
                link
                fedilink
                English
                110 months ago

                Youre pretty young, huh. When something on the internet from a big company is free, youre the product.

                Youre bug and stress testing their hardware, and giving them free advertising. While using the cheapest, lowest quality version that exists, and only for as long as they need the free QA.

                The real AI, and the actual quality outputs, cost money. And once they are confident in their server stability, the scraps youre picking over will get a price tag too.